DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.
-
Updated
May 17, 2026 - TypeScript
DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.
🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.
Latest Advances on Long Chain-of-Thought Reasoning
Explore the Multimodal “Aha Moment” on 2B Model
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
Model Context Protocol server for DeepSeek's advanced language models
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
Doge Family of Small Language Models
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
A comprehensive collection of process reward models.
[AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
[ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
Add a description, image, and links to the r1 topic page so that developers can more easily learn about it.
To associate your repository with the r1 topic, visit your repo's landing page and select "manage topics."