Starred repositories
BEDC: Binary Emission Discovery Calculus (mathlib-free Lean 4 + LaTeX paper)
Turn any document or a whole zip into an interactive knowledge graph, using a self-hosted Qwen3.6-35B-A3B-MTP on a single NVIDIA L4
A benchmark for evaluating AI agents on frontier ultra long-horizon auto research tasks.
DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
Fast and memory-efficient classical machine learning operators
Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x
A high-performance toolkit for atomistic simulations in JAX.
Adaptive Chunking: automatically select the best chunking method per document for RAG. Accepted at LREC 2026.
A plugin for your agentic framework that optimizes code using the GEPA algorithm (Genetic-Pareto LLM-driven search).
turns your codebase into an autoresearch loop — discovers what to measure, instruments the benchmark, then runs tree search with parallel subagents.
Browser Harness | Self-healing harness that enables LLMs to complete any task.
An open-source Skill collection for GEO content and workflows, continuously updated.
GEO experiment data reports and a curated GEO/AEO/AI search paper library.
Production focused Self-harnessed LM runtime (RLM) that allows the LM to call its sub-lm with DSPy signatures. Define your inputs, outputs, and tools — the model handles its own control flow. Get f…
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a querya…
FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.
A curated list of awesome skills, tools, integrations, and resources for Hermes Agent by Nous Research
A Systematic Analysis and Discussion of Claude Code for Designing Today's and Future AI Agent Systems
Kronos: A Foundation Model for the Language of Financial Markets
TreeSearch: Search your codebase like a human — not like a vector database. No embeddings. No chunking. Just millisecond search over structured documents and large codebases. 无需 embedding,无需切分文档,在结…
"OpenHarness: Open Agent Harness with a Built-in Personal Agent--Ohmo!"
Agent Skill for exploring Obsidian vaults with Enzyme — self-contained, cross-agent compatible
Reverse-engineering Claude Code's 512K+ lines of TypeScript — architecture, design decisions, and 11 transferable patterns for building AI Agents
Memento-Skills: Let Agents Design Agents
The agent that grows with you