Lists (1)
Sort Name ascending (A-Z)
Stars
- All languages
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Dockerfile
- Elixir
- Elm
- Emacs Lisp
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- MDX
- Makefile
- Markdown
- Mustache
- Nix
- Objective-C
- PHP
- Perl
- Python
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- Sass
- Scala
- Shell
- Smarty
- Solidity
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
- Zig
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Inspect: A framework for large language model evaluations
An open-source visual programming environment for battle-testing prompts to LLMs.
Open-source library for scalable, reproducible evaluation of AI models and benchmarks.
Provider-agnostic, open-source evaluation infrastructure for language models
React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely cust…
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
Robustness with frontier LLMs: R Software and Paper
Deepagents is an agent harness built on langchain and langgraph. Deep agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped …
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Integrate cutting-edge LLM technology quickly and easily into your apps
Supercharge Your LLM Application Evaluations 🚀
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sharing.
Get your documents ready for gen AI
Get insights from your research papers with LlamaExtract
🪄 Create rich visualizations with AI
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Cost-efficient and pluggable Infrastructure components for GenAI inference
Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddings recursively. This helps us understand user behaviour on…
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vector…
A terminal spinner for tasks that have non-deterministic time frame.