Lists (3)
Sort Name ascending (A-Z)
Stars
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
An interface library for RL post training with environments.
《动手学大模型Dive into LLMs》系列编程实践教程
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
A character-level language diffusion model trained on Tiny Shakespeare
PhD/MBA-level human-annotated rubrics dataset across Physics, Chemistry, Finance and Consulting
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
shizhediao / nanochat
Forked from karpathy/nanochatThe best ChatGPT that $100 can buy.
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
High accuracy RAG for answering questions from scientific documents with citations
H-Net: Hierarchical Network with Dynamic Chunking
Kimi K2 is the large language model series developed by Moonshot AI team
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.