r1

Here are 67 public repositories matching this topic...

esengine / DeepSeek-Reasonix

DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.

agent cli typescript terminal tui developer-tools ink r1 tool-use agent-framework ai-agent llm prompt-caching deepseek ai-coding coding-agent

Updated May 17, 2026
TypeScript

zzli2022 / Awesome-System2-Reasoning-LLM

Star

Latest Advances on System-2 Reasoning

benchmark mcts rl reasoning r1 prm o3 o1 slow-fast system-2 self-improve macro-action

Updated Jun 8, 2025
Python

RUC-NLPIR / Search-o1

Star

🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]

math livecode amc reasoning r1 rag qwq aimo o1 gpqa

Updated Nov 17, 2025
Python

coderonion / awesome-llm-and-aigc

Star

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

Updated Aug 1, 2025

LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning

Star

Latest Advances on Long Chain-of-Thought Reasoning

agent reinforcement-learning rl long thinking reasoning r1 o3 o1 system-2 chain-of-thought openai-o1 reasoning-language-models deepseek-r1 long-chain-of-thought

Updated Jul 18, 2025

turningpoint-ai / VisualThinker-R1-Zero

Star

Explore the Multimodal “Aha Moment” on 2B Model

reinforcement-learning reasoning r1 post-training multimodal deepseek deepseek-r1 grpo deepseek-r1-zero r1-zero multimodal-journey multimodal-r1

Updated Mar 18, 2025
Python

modelscope / awesome-deep-reasoning

Star

Collect every awesome work about r1!

collection rl reasoning r1 o1 qwen deepseek grpo

Updated May 2, 2025
Python

XiaoYee / Awesome_Efficient_LRM_Reasoning

Star

😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond

efficient reasoning r1 slow-fast chain-of-thought long-cot efficient-reasoning reasoning-budget overthinking underthinking efficient-agents

Updated Jan 22, 2026

jingyi0000 / R1-VL

Star

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

reinforcement-learning reasoning r1 mllm vision-language-model multimodal-large-language-models

Updated Dec 16, 2025
Python

DMontgomery40 / deepseek-mcp-server

Star

Model Context Protocol server for DeepSeek's advanced language models

mcp r1 deepseek-chat deepseek-api model-context-protocol deepseek-v3 deepseek-r1

Updated Apr 24, 2026
TypeScript

Zeyi-Lin / Qwen3-Medical-SFT

Star

Qwen3 Fine-tuning: Medical R1 Style Chat

r1 fine-tuning sft qwen3

Updated May 31, 2025
Python

RyanLiu112 / compute-optimal-tts

Star

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

r1 o1 large-language-model process-reward-model test-time-scaling

Updated Feb 19, 2025
Python

ritzz-ai / GUI-R1

Star

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

deep-reinforcement-learning r1 multimodal o1 multimodal-large-language-models large-multimodal-models gui-agent grpo mllm-reasoning

Updated May 5, 2025
Python

SmallDoges / small-doge

Star

Doge Family of Small Language Models

python nlp natural-language-processing reinforcement-learning deep-learning pytorch transformer chinese webui attention-mechanism r1 attention-is-all-you-need mechine-learning foundation-models small-language-models dynamic-mask-attention cross-domain-mixture-of-experts deepseek-r1

Updated Jan 6, 2026
Python

CJReinforce / PURE

Star

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

reinforcement-learning mathematics rl reasoning r1 o1 llm reinforcement-finetuning

Updated Oct 23, 2025
Python

RyanLiu112 / Awesome-Process-Reward-Models

Star

A comprehensive collection of process reward models.

r1 o1 large-language-model process-reward-model

Updated Oct 4, 2025

lll6gg / UI-R1

Star

[AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

reinforcement-learning r1 multimodal-learning multimodal-large-language-models gui-agent efficient-reasoning

Updated Nov 24, 2025
Python

LLM360 / Reasoning360

Star

A repo for open research on building large reasoning models

rl reasoning r1 llm qwen

Updated Mar 3, 2026
Python

sun-hailong / TVC

Star

[ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.

reasoning r1 cot forgetting mllms multimodel-large-language-model

Updated May 16, 2025
Python

RyanLiu112 / GenPRM

Star

[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

r1 o1 large-language-model process-reward-model test-time-scaling

Updated Nov 8, 2025
Python

Improve this page

Add a description, image, and links to the r1 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the r1 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

r1

Here are 67 public repositories matching this topic...

esengine / DeepSeek-Reasonix

zzli2022 / Awesome-System2-Reasoning-LLM

RUC-NLPIR / Search-o1

coderonion / awesome-llm-and-aigc

LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning

turningpoint-ai / VisualThinker-R1-Zero

modelscope / awesome-deep-reasoning

XiaoYee / Awesome_Efficient_LRM_Reasoning

jingyi0000 / R1-VL

DMontgomery40 / deepseek-mcp-server

Zeyi-Lin / Qwen3-Medical-SFT

RyanLiu112 / compute-optimal-tts

ritzz-ai / GUI-R1

SmallDoges / small-doge

CJReinforce / PURE

RyanLiu112 / Awesome-Process-Reward-Models

lll6gg / UI-R1

LLM360 / Reasoning360

sun-hailong / TVC

RyanLiu112 / GenPRM

Improve this page

Add this topic to your repo