Skip to content
View prravda's full-sized avatar
🔥
🔥

Highlights

  • Pro

Block or report prravda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An open-source, AI-integrated, cross-platform terminal for seamless workflows

Go 19,405 895 Updated Apr 10, 2026

A terminal for a more modern age

TypeScript 70,248 3,949 Updated Mar 20, 2026

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 15,985 1,249 Updated Apr 10, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,956 400 Updated Apr 10, 2026

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,269 386 Updated Apr 9, 2026

eBPF-based autoinstrumentation of web applications and network metrics

Go 1,964 169 Updated Apr 10, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,558 1,002 Updated Apr 7, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,801 1,028 Updated Mar 30, 2026

rvLLM: High-performance LLM inference in Rust. Drop-in vLLM replacement.

Rust 414 44 Updated Apr 8, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,320 857 Updated Mar 22, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 60,220 7,662 Updated Apr 10, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,069 662 Updated Apr 10, 2026

Machine Learning Engineering Open Book

Python 17,657 1,119 Updated Mar 16, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,946 1,260 Updated Apr 9, 2026

Lightpanda: the headless browser designed for AI and automation

Zig 28,251 1,189 Updated Apr 10, 2026

A vulnerability scanner for container images and filesystems

Go 12,002 779 Updated Apr 10, 2026

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 8,573 946 Updated Apr 8, 2026

ODIN [for Codex CLI as a plugin] - Outline Driven development approach for agentic INtelligence

Python 10 2 Updated Mar 20, 2026

AI agents running research on single-GPU nanochat training automatically

Python 70,016 10,199 Updated Mar 26, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,038 15,430 Updated Apr 10, 2026

AI Observability & Evaluation

Jupyter Notebook 9,236 805 Updated Apr 10, 2026

Lightweight static analysis for many languages. Find bug variants with patterns that look like source code.

OCaml 14,754 908 Updated Apr 10, 2026

🔥 xCrash provides the Android app with the ability to capture java crash, native crash and ANR. No root permission or any system permissions are required.

C 3,932 654 Updated Jun 27, 2025

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 18,750 1,432 Updated Apr 10, 2026

⚡A CLI tool for code structural search, lint and rewriting. Written in Rust

Rust 13,375 341 Updated Apr 9, 2026

Web-based SQLite database browser written in Python

Python 4,060 392 Updated Apr 8, 2026

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Python 24,899 2,091 Updated Apr 10, 2026

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 35,571 2,413 Updated Apr 10, 2026

Financial data platform for analysts, quants and AI agents.

Python 65,668 6,522 Updated Apr 10, 2026

Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.

Go 3,673 408 Updated Apr 10, 2026
Next