Lists (12)
Sort Name ascending (A-Z)
Stars
"OpenPhone: Mobile Agentic Foundation Models for AI Phone"
🔥 OneThinker: All-in-one Reasoning Model for Image and Video
[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
Ola: Pushing the Frontiers of Omni-Modal Language Model
A framework for efficient model inference with omni-modality models
The parser that repairs broken JSON output for AI Agent Pipelines
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
Data recipes and robust infrastructure for training AI agents
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
Harbor is a framework for running agent evaluations and creating and using RL environments.
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
Fara-7B: An Efficient Agentic Model for Computer Use
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
[DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup
Repo2Run is an LLM-based agent that automates environment configuration by generating error-free Dockerfiles for Python repositories.
The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".
[DL4C @ NeurIPS 2025] On-Device Environment Setup via Online Reinforcement Learning
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"
Training LLMs to reason and analyze data with notebooks
A Model Context Protocol server for searching and analyzing arXiv papers
AgentScope: Agent-Oriented Programming for Building LLM Applications
DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师,自动分析大量数据,一键生成专业分析报告!
Training VLM agents with multi-turn reinforcement learning