Stars
Fast, Attemptable Route Planner for Navigation in Known and Unknown Environments
A simple screen parsing tool towards pure vision based GUI agent
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
[CoRL 2023] XSkill: cross embodiment skill discovery
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
The official repository of the first version of ACE-Brain foundation model.
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
Official implementation of "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
Tools for merging pretrained large language models.
The paper list of "Memory in the Age of AI Agents: A Survey"
A modular graph-based Retrieval-Augmented Generation (RAG) system
ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…
AI agents running research on single-GPU nanochat training automatically
340 plugins + 1367 agent skills for Claude Code. Open-source marketplace with CCPI package manager, interactive tutorials, and production orchestration patterns.
Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety
Claude Code skill that removes signs of AI-generated writing from text
CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
AgentFlow: In-the-Flow Agentic System Optimization
构建生产级AI智能体的12个工程原则。提供实用模式和最佳实践,帮助开发者构建可靠、可扩展的LLM驱动应用程序。(中文翻译版 by 云中江树)