Stars
SWE-bench: Can Language Models Resolve Real-world Github Issues?
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Official plugin for OpenClaw that exports agent traces to Opik. See and monitor agent behaviour, cost, tokens, errors and more.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
An agentic skills framework & software development methodology that works.
Model Context Protocol Servers
Standard Go Project Layout
Scalable datastore for metrics, events, and real-time analytics
High-performance proxy for MySQL and PostgreSQL