Lists (16)
Sort Name ascending (A-Z)
Stars
Your Personal AI super intelligence. Private, Simple and extremely powerful.
WebWorld is a large-scale web world model that helps train web agents in a simulated browser, avoiding the latency and safety issues of the real web.
VisualWebArena is a benchmark for multimodal agents.
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents
A high-throughput and memory-efficient inference and serving engine for LLMs
AI generates natively editable PPTX from any document — real PowerPoint shapes with native animations, not images · by Hugo He
Academic Research Skills for Claude Code: research → write → review → revise → finalize
A library that integrates different MIL methods into a unified framework
💻 vibe coding 2026 | Your first modern Coding course for beginners to master step by step.
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Systematic framework for planning and writing academic papers using Claude Code. Includes strategist (planning) and composer (writing) skills with quality checkpoints.
Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies
Attention-Challenging Multiple Instance Learning for Whole Slide Image Classification (ECCV2024)
A curated list of awesome mathematics resources