-
Institute of Automation, Chinese Academy of Sciences
- Beijing
Stars
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…
🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.
A Survey of Reinforcement Learning for Large Reasoning Models
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
A version of verl to support diverse tool use
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Tongyi Deep Research, the Leading Open-source Deep Research Agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
[ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
A Survey on Multimodal Retrieval-Augmented Generation