-
kxfan.github.io Public template
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
HTML UpdatedMay 21, 2026 -
Reagent Public
Agent-RRM: Exploring Reasoning Reward Model for Agents
-
rllm Public
Forked from rllm-org/rllmDemocratizing Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedDec 18, 2025 -
SophiaVL-R1 Public
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
-
EasyR1 Public
Forked from hiyouga/EasyR1EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Python Apache License 2.0 UpdatedJun 6, 2025 -
-