Lists (7)
Sort Name ascending (A-Z)
Starred repositories
From Agent to Agency — 一个 AI 做不了的事,一群 AI 可以。Multi-Agent Collaboration Platform.
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
A self-hosted dashboard that puts all your feeds in one place
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
Fully open reproduction of DeepSeek-R1
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
A agent framework based on the tutorial hello-agents
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
A set of examples based on verl for end-to-end RL training recipes.
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
OpenClaw-RL: Train any agent simply by talking
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
《动手学大模型Dive into LLMs》系列编程实践教程
✅(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a querya…
Open Source Implementation of Karpathy's LLM Wiki. Upload documents, connect your Claude account via MCP, and have it write your wiki !
LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratc…
analyse problems of AI with Math and Code
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Curated list of datasets and tools for post-training.
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
Low-level unprivileged sandboxing tool used by Flatpak and similar projects
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…