PhD at UChicago, Agentic RL / Post-training
Highlights
- Pro
Pinned Loading
-
Gen-Verse/OpenClaw-RL
Gen-Verse/OpenClaw-RL PublicOpenClaw-RL: Train any agent simply by talking
-
Gen-Verse/dLLM-RL
Gen-Verse/dLLM-RL Public[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
-
Gen-Verse/Open-AgentRL
Gen-Verse/Open-AgentRL Public[ICML 2026] RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings
-
Gen-Verse/CURE
Gen-Verse/CURE Public[NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.