-
mini-swe-agent Public
Forked from SWE-agent/mini-swe-agentThe 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no crazy configs, no giant monorepo—but scores 65% on SWE-bench verified!
Python MIT License UpdatedJul 25, 2025 -
agent-lightning Public
Forked from microsoft/agent-lightningPython MIT License UpdatedJul 24, 2025 -
llm-sandbox Public
Forked from vndee/llm-sandboxLightweight and portable LLM sandbox runtime (code interpreter) Python library.
Python MIT License UpdatedJul 23, 2025 -
FinGenius Public
Forked from HuaYaoAI/FinGeniusPython GNU General Public License v3.0 UpdatedJul 22, 2025 -
HRM Public
Forked from sapientinc/HRMHierarchical Reasoning Model Official Release
Python Apache License 2.0 UpdatedJul 21, 2025 -
Awesome-ML-SYS-Tutorial Public
Forked from zhaochenyang20/Awesome-ML-SYS-TutorialMy learning notes/codes for ML SYS.
Python Apache License 2.0 UpdatedJul 21, 2025 -
Awesome-Uncertainty-based-Reinforcement-Learning Public
Forked from falonss703/Awesome-Uncertainty-based-Reinforcement-Learning🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL
UpdatedJun 20, 2025 -
slime Public
Forked from THUDM/slimeslime is a LLM post-training framework aiming at scaling RL.
Python Apache License 2.0 UpdatedJun 20, 2025 -
TreeRL Public
Forked from THUDM/TreeRLTreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
Python Apache License 2.0 UpdatedJun 16, 2025 -
Meta-rater Public
Forked from opendatalab/Meta-rater[ACL 2025] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models"
-
llm-reasoners Public
Forked from maitrix-org/llm-reasonersA library for advanced large language model reasoning
Python Apache License 2.0 UpdatedJun 10, 2025 -
SWE-bench-Live Public
Forked from microsoft/SWE-bench-Live🚀 SWE-bench Goes Live!
Python MIT License UpdatedMay 30, 2025 -
-
RedTeamCUA Public
Forked from OSU-NLP-Group/RedTeamCUARedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments
Python Apache License 2.0 UpdatedMay 29, 2025 -
EvoAgentX Public
Forked from EvoAgentX/EvoAgentX🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
Python Other UpdatedMay 28, 2025 -
SynLogic Public
Forked from MiniMax-AI/SynLogicThe official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
Python MIT License UpdatedMay 28, 2025 -
deer-flow Public
Forked from bytedance/deer-flowDeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
TypeScript MIT License UpdatedMay 28, 2025 -
agent-distillation Public
Forked from Nardien/agent-distillationPython Apache License 2.0 UpdatedMay 26, 2025 -
One-RL-to-See-Them-All Public
Forked from MiniMax-AI/One-RL-to-See-Them-AllOne RL to See Them All: Visual Triple Unified Reinforcement Learning
MIT License UpdatedMay 25, 2025 -
InternBootcamp Public
Forked from InternLM/InternBootcampPython Apache License 2.0 UpdatedMay 23, 2025 -
-
-
WebOrganizer Public
Forked from CodeCreator/WebOrganizerOrganize the Web: Constructing Domains Enhances Pre-Training Data Curation
Jupyter Notebook Apache License 2.0 UpdatedMay 2, 2025 -
-
MM-EUREKA Public
Forked from ModalMinds/MM-EUREKAMM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning
Python Apache License 2.0 UpdatedMar 8, 2025 -
AppAgentX Public
Forked from Westlake-AGI-Lab/AppAgentXOfficial implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Python UpdatedMar 6, 2025 -
-
kodcode Public
Forked from KodCode-AI/kodcodeGenerate diverse coding questions and verifiable solutions - all in one framework
Python Apache License 2.0 UpdatedMar 5, 2025 -
RAGEN Public
Forked from mll-lab-nu/RAGENRAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.
Python Apache License 2.0 UpdatedFeb 6, 2025 -
demystify-long-cot Public
Forked from eddycmu/demystify-long-cotPython MIT License UpdatedFeb 5, 2025