Skip to content
View TongLi3701's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Organizations

@hpcaitech

Block or report TongLi3701

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 2,921 353 Updated Dec 21, 2025

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 91 7 Updated Aug 20, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,180 120 Updated Nov 9, 2025

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.โ€ฆ

Python 787 94 Updated Mar 13, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,508 120 Updated Nov 21, 2025

Fully open reproduction of DeepSeek-R1

Python 25,745 2,405 Updated Nov 24, 2025

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Python 190 6 Updated Mar 20, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,418 163 Updated Mar 20, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,659 2,860 Updated Dec 21, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,785 99 Updated Mar 18, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,537 80 Updated May 30, 2025

A flexible and efficient training framework for large-scale alignment tasks

Python 444 39 Updated Oct 23, 2025

Large Reasoning Models

Python 806 47 Updated Dec 3, 2024

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 411 17 Updated Apr 25, 2025
Python 130 20 Updated Jun 18, 2024

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 26,190 3,751 Updated Dec 20, 2025

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 282 30 Updated May 26, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,831 134 Updated Jan 17, 2025

O1 Replication Journey

2,003 63 Updated Jan 14, 2025

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 686 50 Updated Jan 20, 2025

Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://huggingface.co/spaces/pseudotensor/open-strawberry

Python 187 18 Updated Oct 15, 2024

Code for Quiet-STaR

Python 742 91 Updated Aug 21, 2024

Writing AI Conference Papers: A Handbook for Beginners

3,209 117 Updated Jul 16, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 ๐Ÿ“ and reasoning techniques.

6,870 371 Updated Dec 17, 2025

Efficient Triton Kernels for LLM Training

Python 5,962 452 Updated Dec 20, 2025

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 2,188 154 Updated Sep 11, 2025

A simple, easy-to-hack GraphRAG implementation

Python 3,559 376 Updated Jul 23, 2025

๐Ÿƒ MINT-1T: A one trillion token multimodal interleaved dataset.

827 18 Updated Jul 31, 2024

Biomedical Generalist Video Generation Model

Python 175 1 Updated Oct 20, 2024

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,897 188 Updated Oct 30, 2025
Next