Stars
Harbor is a framework for running agent evaluations and creating and using RL environments.
Makes your AI agent think like the laziest senior dev in the room. The best code is the code you never wrote.
Start a run in your terminal and walk away. Get pinged when it finishes, or needs you. Every step is a readable trace you check before anything ships.
Public ant-irys code and Harvey LAB benchmark results
Continuous background code review database for agents, work faster and smarter with accountability for every line of generated code.
Open-source & free โ Battle-tested at Alibaba's scale. Hybrid architecture code review tool: deterministic pipelines + LLM Agent, precise line-level comments, built-in fine-tuned ruleset (NPE, threโฆ
Skills for threat modeling, scanning, triage, patching, plus an autonomous scanning harness you can /customize
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Free and open source database client built natively for developers
Multi-harness agentic plugin marketplace for Claude Code, Codex CLI, Cursor, OpenCode, GitHub Copilot, and Gemini CLI
A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
AI-powered QA testing framework that uses LLMs (Claude or GPT) to test web apps, CLI tools, and TUI programs from markdown story cards, returning structured pass/fail verdicts with evidence.
A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dirโฆ
Open-source credential gateway with a built-in vault. give your AI agents access to services without exposing keys.
LLM-supervised persistent memory for AI agents โ graph-based recall, cross-session knowledge, single binary. Works with Claude Code, OpenClaw, and any CLI agent.
agent multiplexer that lives in your terminal.
Lightweight and Memory efficient terminal for Mac built with SwiftUI and libghostty
A native macOS terminal for agent-driven development, built on Ghostty.
SwiftUI component for displaying rich release notes inside an app
The headless browser for AI agents and web scraping
๐น Clean, uninstall, analyze, optimize, and monitor your Mac from the terminal.
๐ A fast, out-of-the-box terminal built for AI coding.
๐๐ผโโ๏ธ The blackboard for coding agents - multi-session tool for claude code, cursor, codex, gemini
Clone any website with one command using AI coding agents
A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.
A Git worktree workflow tool for AI coding agents. Enables parallel development with isolated environments.
Worktrunk is a CLI for Git worktree management, designed for parallel AI agent workflows
Bring your own agent and build a self-improving agentic system. Automatically mine failures, optimize the agent harness, and gate against regressions.