Skip to content
View muupan's full-sized avatar

Organizations

@pfnet @chainer

Block or report muupan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Democratizing Reinforcement Learning for LLMs

Python 5,630 577 Updated Jun 18, 2026

Windows alt-tab on macOS

Swift 15,905 708 Updated Jun 18, 2026

A small browser extension which shows a new tab page that Vimium can control

TypeScript 19 2 Updated Nov 20, 2025

Android feed reader app

Kotlin 2,873 187 Updated Jun 19, 2026

Flexible evaluation tool for language models

Python 59 4 Updated Jun 19, 2026

Enable SSH access to Kuberntes pods

Rust 14 1 Updated May 8, 2026

Python bindings to the Zstandard (zstd) compression library

C 635 117 Updated Sep 14, 2025

A feature-rich command-line audio/video downloader

Python 171,622 14,469 Updated Jun 18, 2026
Python 1,156 55 Updated Jan 10, 2026

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 125,632 14,072 Updated Jun 17, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 425 44 Updated Jun 16, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,445 120 Updated Apr 17, 2026

text window manager, shell multiplexer, integrated DevOps environment

Shell 1,645 138 Updated Jun 13, 2026

CLI tool which enables you to login and retrieve AWS temporary credentials using a SAML IDP

Go 2,222 611 Updated Nov 20, 2025

Chrome extension to disable youtube video titles autotranslation

JavaScript 456 24 Updated Apr 30, 2026

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,123 123 Updated Dec 3, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,455 492 Updated Jun 9, 2026

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Python 275 23 Updated Apr 26, 2024

Web extension to set a default speed for video and audio

TypeScript 2,613 315 Updated Jun 19, 2026

The official implementation of "Horizon Reduction Makes RL Scalable"

Python 196 12 Updated Aug 2, 2025

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Python 37 8 Updated Apr 1, 2026
Jupyter Notebook 12 7 Updated Jul 17, 2025

OCR Benchmark

TypeScript 635 54 Updated Oct 21, 2025

日本の祝日を取得するライブラリ

Python 254 17 Updated Feb 2, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,522 1,562 Updated May 26, 2026

MR.Q is a general-purpose model-free reinforcement learning algorithm.

Python 153 12 Updated Apr 7, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 13,174 1,585 Updated Feb 27, 2026

Really Fast End-to-End Jax RL Implementations

Python 1,081 87 Updated Sep 9, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,984 1,105 Updated Apr 20, 2026

Python tool for converting files and office documents to Markdown.

Python 155,827 10,825 Updated May 26, 2026
Next