Skip to content
View muupan's full-sized avatar

Organizations

@pfnet @chainer

Block or report muupan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Enable SSH access to Kuberntes pods

Rust 12 Updated Feb 2, 2026

Python bindings to the Zstandard (zstd) compression library

C 618 108 Updated Sep 14, 2025

A feature-rich command-line audio/video downloader

Python 147,312 11,925 Updated Feb 12, 2026
Python 1,095 51 Updated Jan 10, 2026

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 116,814 12,534 Updated Feb 15, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 363 36 Updated Feb 14, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,343 111 Updated Jan 16, 2026

text window manager, shell multiplexer, integrated DevOps environment

Shell 1,502 133 Updated Feb 16, 2026

CLI tool which enables you to login and retrieve AWS temporary credentials using a SAML IDP

Go 2,200 600 Updated Nov 20, 2025

Chrome extension to disable youtube video titles autotranslation

JavaScript 420 22 Updated Jan 18, 2026

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,058 117 Updated Dec 3, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,303 425 Updated Jan 21, 2026

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Python 273 21 Updated Apr 26, 2024

Web extension to set a default speed for video and audio

TypeScript 2,360 290 Updated Feb 12, 2026

The official implementation of "Horizon Reduction Makes RL Scalable"

Python 182 11 Updated Aug 2, 2025

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Python 30 6 Updated May 23, 2025
Jupyter Notebook 10 4 Updated Jul 17, 2025

OCR Benchmark

TypeScript 614 53 Updated Oct 21, 2025

日本の祝日を取得するライブラリ

Python 241 16 Updated Feb 2, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 14,666 1,373 Updated Jan 31, 2026

MR.Q is a general-purpose model-free reinforcement learning algorithm.

Python 143 8 Updated Jun 23, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,750 1,552 Updated Apr 24, 2025

Really Fast End-to-End Jax RL Implementations

Python 1,023 83 Updated Sep 9, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,105 988 Updated Jul 8, 2025

Python tool for converting files and office documents to Markdown.

Python 87,133 5,063 Updated Feb 13, 2026

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

TypeScript 1,343 44 Updated Jan 11, 2026

A small library of LLM judges

Python 323 32 Updated Jul 31, 2025

Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)

Python 40 4 Updated May 16, 2025

The matrix cookbook, proved in the Lean theorem prover

Lean 126 20 Updated Sep 16, 2025
Next