🏠
Working from home
PhD student at Northwestern University. Previously @deepseek-ai @uiucnlp & Renmin University
Highlights
- Pro
Pinned Loading
-
mll-lab-nu/RAGEN
mll-lab-nu/RAGEN PublicRAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
-
-
xingyaoww/mint-bench
xingyaoww/mint-bench PublicOfficial Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and …
-
mll-lab-nu/TStar
mll-lab-nu/TStar PublicTStar is a unified temporal search framework for long-form video question answering
-
yeruimeng/TraTree
yeruimeng/TraTree PublicTrajectory optimization methods for improving LLM agents via weak-to-strong learning.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.