taoszhang

张涛 taoszhang

Institute of Automation, Chinese Academy of Sciences
Beijing

Stars

facebookresearch / meta-agents-research-environments

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 289 29 Updated Oct 8, 2025

PolyU-ChenLab / UniPixel

🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)

Python 73 4 Updated Oct 8, 2025

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 424 40 Updated Sep 11, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

1,685 95 Updated Oct 8, 2025

Mini-o3 / Mini-o3

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 329 15 Updated Sep 15, 2025

0russwest0 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 823 50 Updated Jul 22, 2025

MMBrowseComp / MM-BrowseComp

Python 35 Updated Sep 22, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 580 42 Updated Oct 6, 2025

lime-RL / DCPO

DCPO: Dynamic Adaptive Clipping for RL

Python 34 4 Updated Sep 25, 2025

xhyumiracle / Awesome-AgenticLLM-RL-Papers

882 42 Updated Sep 5, 2025

ofirpress / self-ask

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Jupyter Notebook 322 36 Updated Dec 28, 2023

StonyBrookNLP / musique

Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022

Python 168 18 Updated Jun 12, 2024

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,014 123 Updated Sep 29, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,199 71 Updated Oct 9, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 15,643 1,166 Updated Oct 5, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 987 82 Updated Oct 6, 2025

openai / simple-evals

Python 4,099 438 Updated Jul 31, 2025

Ayanami0730 / deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 408 42 Updated Aug 3, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,387 2,260 Updated Oct 5, 2025

ByteDance-Seed / m3-agent

Python 996 85 Updated Oct 9, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,589 301 Updated Sep 30, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,752 1,835 Updated Oct 6, 2025

w1oves / hqclip

[ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets

50 1 Updated Aug 6, 2025

infiniflow / ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 65,664 6,904 Updated Oct 9, 2025

Alibaba-NLP / ZeroSearch

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,160 109 Updated Aug 16, 2025

ysymyth / ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Jupyter Notebook 3,051 313 Updated Feb 6, 2024

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,598 745 Updated Oct 1, 2025

hymie122 / RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,745 121 Updated Aug 20, 2024

jxzhangjhu / Awesome-LLM-RAG

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,276 74 Updated Feb 24, 2025

llm-lab-org / Multimodal-RAG-Survey

A Survey on Multimodal Retrieval-Augmented Generation

377 15 Updated Sep 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly