-
slime Public
Forked from THUDM/slimeslime is an LLM post-training framework for RL Scaling.
Python Apache License 2.0 UpdatedDec 10, 2025 -
yet-another-claude-code Public
A minimal, hackable implementation of Claude’s code
-
-
deepagents Public
Forked from langchain-ai/deepagentsDeepagents is an agent harness built on langchain and langgraph. Deep agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped …
Python MIT License UpdatedDec 3, 2025 -
rl-learned-plan-search Public
From Prompted Plan-and-Act to RL-Incentivized, In-Weights Planning for Multi-Turn RAG
Python Apache License 2.0 UpdatedNov 28, 2025 -
tau2-bench Public
Forked from sierra-research/tau2-benchτ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
Python MIT License UpdatedNov 28, 2025 -
NeMo Public
Forked from NVIDIA-NeMo/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedNov 26, 2025 -
Qwen3-Omni Public
Forked from QwenLM/Qwen3-OmniQwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Jupyter Notebook Apache License 2.0 UpdatedNov 25, 2025 -
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedNov 23, 2025 -
tau-retail-rl Public
End-to-end reinforcement learning for retail domain tasks focused on exchange and cancel actions, inspired by the τ-bench framework.
-
gpt-oss Public
Forked from openai/gpt-ossgpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Python Apache License 2.0 UpdatedNov 1, 2025 -
SeungyounShin.github.io Public template
Forked from alshedivat/al-folioA beautiful, simple, clean, and responsive Jekyll theme for academics
HTML MIT License UpdatedOct 27, 2025 -
-
FlashCosyVoice Public
Forked from xingchensong/FlashCosyVoiceFlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
Python Apache License 2.0 UpdatedSep 15, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedSep 13, 2025 -
tau-bench Public
Forked from sierra-research/tau-benchCode and Data for Tau-Bench
Python MIT License UpdatedAug 12, 2025 -
higgs-audio Public
Forked from boson-ai/higgs-audioText-audio foundation model from Boson AI
Python Apache License 2.0 UpdatedJul 31, 2025 -
Kimi-Audio Public
Forked from MoonshotAI/Kimi-AudioKimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
Python UpdatedJul 22, 2025 -
-
EasyAgentRL Public
verl based search + code agent like o3
-
LLaSA_training Public
Forked from zhenye234/LLaSA_trainingLLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
Python Other UpdatedJun 9, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
moshi-finetune Public
Forked from nu-dialogue/moshi-finetuneFine-tuning Moshi/J-Moshi on your own spoken dialogue data
Python Apache License 2.0 UpdatedApr 11, 2025 -
Search-R1 Public
Forked from PeterGriffinJin/Search-R1Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Python Apache License 2.0 UpdatedMar 27, 2025 -
-
-
open-r1 Public
Forked from huggingface/open-r1Fully open reproduction of DeepSeek-R1
Python Apache License 2.0 UpdatedFeb 7, 2025 -
-
search-and-learn Public
Forked from huggingface/search-and-learnPython Apache License 2.0 UpdatedDec 18, 2024