Student of RUC and undergraduate majored in AI and Fintech.
Developed a 2.4B parameter LLM that was pre-trained from scratch.
-
Renmin University of China
- Beijing
Highlights
- Pro
Pinned Loading
-
RUC-GSAI/YuLan-Mini
RUC-GSAI/YuLan-Mini PublicA highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
-
RUCAIBox/R1-Searcher
RUCAIBox/R1-Searcher PublicR1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
-
RUCAIBox/SimpleDeepSearcher
RUCAIBox/SimpleDeepSearcher PublicSimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
-
RUCAIBox/R1-Searcher-plus
RUCAIBox/R1-Searcher-plus PublicR1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.