Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 477 63 Updated Apr 16, 2026

meituan-longcat / vitabench

[ICLR 2026] VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Python 119 12 Updated Feb 22, 2026

THUNLP-MT / StableToolBench

A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.

Python 227 22 Updated Apr 15, 2025

a-yeyang / AI-researcher

Python 75 9 Updated Mar 22, 2026

RUC-NLPIR / EnvScaler

The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".

Python 127 7 Updated Feb 12, 2026

Snowflake-Labs / agent-world-model

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Python 314 36 Updated Mar 16, 2026

lbjlaq / Antigravity-Manager

Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…

Rust 28,373 3,094 Updated Mar 25, 2026

inclusionAI / AReaL

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,059 469 Updated Apr 18, 2026

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,544 126 Updated Feb 19, 2025

TheAgentArk / Toucan

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 236 14 Updated Dec 16, 2025

sierra-research / tau2-bench

τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Python 1,040 264 Updated Apr 17, 2026

modelcontextprotocol / servers

Model Context Protocol Servers

TypeScript 84,039 10,433 Updated Apr 17, 2026

Nagi-ovo / gemini-voyager

An all-in-one enhancement suite for Google Gemini & AI Studio - timeline navigation, folder management, prompt library, and chat export in one powerful extension. / Google Gemini & AI Studio 全能增强插件…

TypeScript 16,937 528 Updated Apr 18, 2026

datawhalechina / easy-vibe

💻 vibe coding 2026 | Your first modern programming course for beginners to master step by step.

JavaScript 5,873 575 Updated Apr 8, 2026

nex-agi / Nex-N1

109 4 Updated Dec 5, 2025

datawhalechina / unlock-deepseek

DeepSeek 系列工作解读、扩展和复现。

Python 727 60 Updated Mar 9, 2026

hijiangtao / resume

个人中文简历 Latex 源码 https://hijiangtao.github.io/

TeX 3,069 694 Updated Sep 4, 2024

iMoonLab / yolov13

Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".

Python 1,634 171 Updated Nov 18, 2025

TawfiqMohammed / F1-RAG-Assistant

A Retrieval-Augmented Generation system built on 50k+ Formula 1 records — ask natural language questions, get fast, accurate, confidence-scored answers.

Python 2 Updated Sep 27, 2025

SupritYoung / RLHF-Label-Tool

用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.

Python 256 19 Updated Aug 1, 2023

SupritYoung / Zhongjing

A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.

Python 392 38 Updated Dec 12, 2023

FreedomIntelligence / CMB

CMB, A Comprehensive Medical Benchmark in Chinese

Python 237 21 Updated Mar 27, 2025

FreedomIntelligence / HuatuoGPT

HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)

Python 1,304 165 Updated Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WaterRainting

Block or report WaterRainting

Stars

D2I-CUHKSZ / MicroWorld

openai / codex

1rgs / claude-code-proxy

666ghj / MiroFish

666ghj / BettaFish

ShishirPatil / gorilla

hkust-nlp / Toolathlon

facebookresearch / meta-agents-research-environments