Lists (3)
Sort Name ascending (A-Z)
Starred repositories
Capability-based sandboxes with fine-grained policies . Brokering access directly within the agent's operating context, with zero setup and zero latency
Make any LLM talk like a normal person. A system prompt that removes AI slop.
The design language that makes your AI harness better at design.
A Claude Code skill that 10x's your effective context window by dispatching tasks to background AI workers.
These are commands I use with agents, mostly Claude
Event-driven memory for reliable, self-improving AI agents
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
jless is a command-line JSON viewer designed for reading, exploring, and searching through JSON data.
Rule-based double-entry bookkeeping importer (from Alipay/WeChat/Huobi etc. to Beancount/Ledger).
Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
Modal command dispatch that speaks native Emacs keybindings
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Rule Snippet & Rule Set for Surge / Mihomo (Clash.Meta) / Clash Premium (Dreamacro) / sing-box / Surfboard for Android / Stash
Testing Language Models for Memorization of Tabular Datasets.
Touying is a powerful package for creating presentation slides in Typst.
AI observability platform for production LLM and agent systems.
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?