- 📫 How to reach me >> mengqili1@link.cuhk.edu.cn or limq01@foxmail.com
😇
996
Highlights
- Pro
Pinned Loading
-
Ledzy/StreamBP
Ledzy/StreamBP PublicOfficial code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".
-
OnlineSFT
OnlineSFT PublicOfficial code of "Online SFT for LLM Reasoning: Surprising Effectiveness of Self-Tuning without Rewards".
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.