muupan

Yasuhiro Fujita muupan

Engineer at @pfnet

219 followers · 12 following

Achievements

x3 x3 x3

Achievements

x3 x3 x3

Organizations

Stars

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,796 329 Updated Jul 27, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,738 596 Updated Jul 27, 2026

lwouis / alt-tab-macos

Windows alt-tab on macOS

Swift 16,096 797 Updated Jul 9, 2026

philc / vimium-new-tab

A small browser extension which shows a new tab page that Vimium can control

TypeScript 22 2 Updated Nov 20, 2025

spacecowboy / Feeder

Android feed reader app

Kotlin 2,928 196 Updated Jul 25, 2026

sbintuitions / flexeval

Flexible evaluation tool for language models

Python 61 4 Updated Jul 23, 2026

pfnet-research / sshpod

Enable SSH access to Kuberntes pods

Rust 15 1 Updated May 8, 2026

indygreg / python-zstandard

Python bindings to the Zstandard (zstd) compression library

C 638 117 Updated Jul 25, 2026

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 180,443 15,405 Updated Jul 23, 2026

huggingface / Math-Verify

Python 1,170 58 Updated Jan 10, 2026

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 128,464 14,588 Updated Jul 24, 2026

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 430 50 Updated Jul 20, 2026

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,469 122 Updated Apr 17, 2026

dustinkirkland / byobu

text window manager, shell multiplexer, integrated DevOps environment

Python 1,679 140 Updated Jul 25, 2026

Versent / saml2aws

CLI tool which enables you to login and retrieve AWS temporary credentials using a SAML IDP

Go 2,233 617 Updated Nov 20, 2025

zpix1 / yt-anti-translate

Chrome extension to disable youtube video titles autotranslation

JavaScript 458 24 Updated Apr 30, 2026

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,130 124 Updated Dec 3, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,501 517 Updated Jun 29, 2026

ZubinGou / math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Python 278 23 Updated Apr 26, 2024

polywock / globalSpeed

Web extension to set a default speed for video and audio

TypeScript 2,672 324 Updated Jun 30, 2026

seohongpark / horizon-reduction

The official implementation of "Horizon Reduction Makes RL Scalable"

Python 200 13 Updated Aug 2, 2025

zlwang-cs / OfficeBench

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Python 41 8 Updated Apr 1, 2026

lightblue-tech / M-IFEval

Jupyter Notebook 12 7 Updated Jul 17, 2025

getomni-ai / benchmark

OCR Benchmark

TypeScript 640 53 Updated Oct 21, 2025

Lalcs / jpholiday

日本の祝日を取得するライブラリ

Python 257 17 Updated Jun 25, 2026

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 17,256 1,644 Updated Jul 21, 2026

facebookresearch / MRQ

MR.Q is a general-purpose model-free reinforcement learning algorithm.

Python 154 13 Updated Apr 7, 2026

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,207 1,582 Updated Feb 27, 2026

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,093 87 Updated Sep 9, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 10,168 1,142 Updated Apr 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yasuhiro Fujita muupan

Achievements

Achievements

Organizations

Block or report muupan

Stars

radixark / miles

rllm-org / rllm

lwouis / alt-tab-macos

philc / vimium-new-tab

spacecowboy / Feeder

sbintuitions / flexeval

pfnet-research / sshpod

indygreg / python-zstandard

yt-dlp / yt-dlp

huggingface / Math-Verify

excalidraw / excalidraw

ServiceNow / PipelineRL

open-thought / reasoning-gym

dustinkirkland / byobu

Versent / saml2aws

zpix1 / yt-anti-translate

huggingface / evaluation-guidebook

huggingface / lighteval

ZubinGou / math-evaluation-harness

polywock / globalSpeed

seohongpark / horizon-reduction

zlwang-cs / OfficeBench

lightblue-tech / M-IFEval

getomni-ai / benchmark

Lalcs / jpholiday

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

facebookresearch / MRQ

Jiayi-Pan / TinyZero

luchris429 / purejaxrl

vwxyzjn / cleanrl