Skip to content
View zvfvrv's full-sized avatar
☁️
☁️

Organizations

@netgroup @superfluidity @EveryUP @5GEVE

Block or report zvfvrv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

Python 45,877 3,188 Updated Jun 22, 2026

Self-hosted AI workspace.

Python 75,974 9,881 Updated Jun 22, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,427 547 Updated Jun 22, 2026

DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm

C 14,950 1,305 Updated Jun 17, 2026

The agent that grows with you

Python 199,486 35,429 Updated Jun 22, 2026

Next Generation Agentic Proxy for AI Agents and MCP servers

Rust 3,418 567 Updated Jun 19, 2026

OpenBao is a software solution to manage, store, and distribute sensitive data including secrets, certificates, and keys.

Go 6,413 459 Updated Jun 22, 2026

Cluster API implementation for OpenStack

Go 360 302 Updated Jun 18, 2026

llm-d Router: The intelligent entry point for inference requests

Go 225 246 Updated Jun 22, 2026

Gateway API Inference Extension

Jupyter Notebook 695 293 Updated Jun 17, 2026

1 place to call all your agents - OpenCode, Hermes, Claude Managed Agents, Cursor Agents API, DeepAgents.

Rust 982 106 Updated Jun 20, 2026

Terraform Cisco IOS-XR Network-as-Code Module

HCL 2 30 Updated Jun 18, 2026

The Cloud-Native API Gateway and AI Gateway

Go 5,574 770 Updated Jun 18, 2026

Common recipes to run vLLM

JavaScript 876 309 Updated Jun 18, 2026
Python 55 21 Updated Jun 11, 2026

Beginner, advanced, expert level Rust training material

Rust 14,627 1,146 Updated Jun 11, 2026

Find secrets with Gitleaks 🔑

Go 27,809 2,124 Updated Jun 13, 2026

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 333 61 Updated Jun 22, 2026

A computer you can curl ⚡

Python 2,742 210 Updated Apr 17, 2026

Connect your devices into a secure WireGuard®-based overlay network with SSO, MFA and granular access controls.

Go 26,156 1,427 Updated Jun 22, 2026

Hundreds of models & providers. One command to find what runs on your hardware.

Rust 28,478 1,746 Updated Jun 22, 2026

Build resilient agents.

Python 35,419 5,943 Updated Jun 21, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 67,078 6,021 Updated Jun 22, 2026
Shell 83 5 Updated Feb 18, 2026

A modern static site generator by the Material for MkDocs team

Rust 5,007 113 Updated Jun 21, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,902 79,533 Updated Jun 22, 2026

🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 29,511 3,069 Updated Jun 22, 2026

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 24,635 1,093 Updated Jun 12, 2026

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 19,713 1,526 Updated Jun 22, 2026
Next