Stars
"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai
"LightAgent: Lightweight and Cost-Effective Mobile Agents"
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…
"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
"DeepResearch-Eval: An End-to-End Evaluation Framework for DeepResearch Systems"
[ACM TIST] "LLM4Urban: Urban Computing in the Era of Large Language Models"
[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications
"RAG-Anything: All-in-One RAG Framework"
🚀 The fast, Pythonic way to build MCP servers and clients
Official Repository of Absolute Zero Reasoner
ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server. The birthplace of VibeOps.
Define, Prompt and Test MCP enabled Agents and Workflows
Model Context Protocol(MCP) 编程极速入门
Model Context Protocol Servers
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
PodAgent: A Comprehensive Framework for Podcast Generation
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
Building a comprehensive and handy list of papers for GUI agents