- Roma
Lists (5)
Sort Name ascending (A-Z)
Starred repositories
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm
The agent that grows with you
Next Generation Agentic Proxy for AI Agents and MCP servers
OpenBao is a software solution to manage, store, and distribute sensitive data including secrets, certificates, and keys.
Cluster API implementation for OpenStack
llm-d Router: The intelligent entry point for inference requests
Gateway API Inference Extension
1 place to call all your agents - OpenCode, Hermes, Claude Managed Agents, Cursor Agents API, DeepAgents.
Terraform Cisco IOS-XR Network-as-Code Module
The Cloud-Native API Gateway and AI Gateway
Beginner, advanced, expert level Rust training material
Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes
Connect your devices into a secure WireGuard®-based overlay network with SSO, MFA and granular access controls.
Hundreds of models & providers. One command to find what runs on your hardware.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
A modern static site generator by the Material for MkDocs team
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.