Stars
MMSkills: Towards Multimodal Skills for General Visual Agents
HyperEyes is a parallel multimodal search agent that fuses visual grounding and retrieval into a single atomic action, enabling concurrent search across multiple entities while treating inference e…
An agentic skills framework & software development methodology that works.
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
DeepSeek 4 Flash local inference engine for Metal and CUDA
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
The agent that grows with you
The best-benchmarked open-source AI memory system. And it's free.
原汁原昧 Claude Code 可运行,可构建, 可调试版; 生产级工程化, 企业级可靠性; 安全无毒, 内存泄露修复
Lightweight coding agent that runs in your terminal
GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows
A Multimodal Reasoning Agent with Stateful Experiences
AI agents running research on single-GPU nanochat training automatically
OpenClaw skills for deep search — multi-source search, content extraction, and structured research reports.
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
Tool-Genesis: A Task-Driven Tool Creation Benchmark for Self-Evolving Language Agent
Rewards as Labels: Revisiting RLVR from a Classification Perspective
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
The open agent skills tool - npx skills
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal