-
Awesome-LLM-RAG Public
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
-
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
-
claude-code-source-code Public
Forked from sanbuphy/learn-coding-agentClaude Code v2.1.88 Source Code
TypeScript UpdatedMar 31, 2026 -
nanobot Public
Forked from HKUDS/nanobot"🐈 nanobot: The Ultra-Lightweight OpenClaw"
Python MIT License UpdatedMar 22, 2026 -
AI-Can-Learn-Scientific-Taste Public
Forked from tongjingqi/AI-Can-Learn-Scientific-TasteWe propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…
Apache License 2.0 UpdatedMar 22, 2026 -
NanoResearch Public
Forked from OpenRaiser/NanoResearch🦞+🔬: NanoResearch: The Autonomous AI Research Assistant
Python MIT License UpdatedMar 21, 2026 -
measuring-execution Public
Forked from long-horizon-execution/measuring-executionPython UpdatedMar 18, 2026 -
OpenClaw-RL Public
Forked from Gen-Verse/OpenClaw-RLOpenClaw-RL: Train any agent simply by talking
TypeScript MIT License UpdatedMar 15, 2026 -
-
Automodel Public
Forked from NVIDIA-NeMo/AutomodelPytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
Python Apache License 2.0 UpdatedFeb 12, 2026 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedFeb 11, 2026 -
-
OpenTinker Public
Forked from open-tinker/OpenTinkerOpenTinker is an RL-as-a-Service infrastructure for foundation models
Python Apache License 2.0 UpdatedFeb 8, 2026 -
SDPO Public
Forked from lasgroup/SDPOReinforcement Learning via Self-Distillation (SDPO)
Python Apache License 2.0 UpdatedFeb 5, 2026 -
hello-agents Public
Forked from datawhalechina/hello-agents📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
Python Other UpdatedFeb 2, 2026 -
Paper2Any Public
Forked from OpenDCAI/Paper2AnyTurn paper/text/topic into editable research figures, technical route diagrams, and presentation slides.
Python Apache License 2.0 UpdatedJan 20, 2026 -
Awesome-Agent-Memory Public
Forked from TeleAI-UAGI/Awesome-Agent-MemoryCurated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
Apache License 2.0 UpdatedJan 18, 2026 -
MiroThinker Public
Forked from MiroMindAI/MiroThinkerMiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 60.2% Avg@8 score on the challenging GAIA benchmark.
Python MIT License UpdatedJan 16, 2026 -
BabyVision Public
Forked from UniPat-AI/BabyVisionWe introduce BabyVision, a benchmark revealing the infancy of AI vision.
Python UpdatedJan 13, 2026 -
-
AgentEvolver Public
Forked from modelscope/AgentEvolverAgentEvolver: Towards Efficient Self-Evolving Agent System
Python Apache License 2.0 UpdatedNov 21, 2025 -
enterprise-deep-research Public
Forked from SalesforceAIResearch/enterprise-deep-researchSalesforce Enterprise Deep Research
-
-
DreamGym Public
Forked from Pi3AI/DreamGymThis is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
Python UpdatedNov 9, 2025 -
DeepAgent Public
Forked from RUC-NLPIR/DeepAgent🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets
Python MIT License UpdatedNov 2, 2025 -
torchforge Public
Forked from meta-pytorch/torchforgePyTorch-native post-training at scale
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 28, 2025 -
magic-wormhole Public
Forked from magic-wormhole/magic-wormholeget things from one computer to another, safely
Python MIT License UpdatedOct 23, 2025 -
-
Agent-R Public
Forked from ByteDance-Seed/Agent-RResources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
Python Apache License 2.0 UpdatedOct 20, 2025 -
verl-agent Public
Forked from langfengQ/verl-agentverl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Python Apache License 2.0 UpdatedOct 20, 2025