Skip to content
View prravda's full-sized avatar
🔥
🔥

Highlights

  • Pro

Block or report prravda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Benchmark suite for LLMs from Fireworks.ai

Python 99 34 Updated Apr 16, 2026

An open-source, AI-integrated, cross-platform terminal for seamless workflows

Go 19,534 898 Updated Apr 16, 2026

A terminal for a more modern age

TypeScript 70,415 3,955 Updated Apr 13, 2026

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 16,015 1,255 Updated Apr 16, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,993 414 Updated Apr 15, 2026

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,278 388 Updated Apr 15, 2026

eBPF-based autoinstrumentation of web applications and network metrics

Go 1,967 169 Updated Apr 15, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,557 1,007 Updated Apr 7, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,806 1,033 Updated Mar 30, 2026

rvLLM: High-performance LLM inference in Rust. Drop-in vLLM replacement.

Rust 427 47 Updated Apr 16, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,335 860 Updated Apr 16, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 61,912 7,980 Updated Apr 16, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,103 679 Updated Apr 16, 2026

Machine Learning Engineering Open Book

Python 17,703 1,122 Updated Mar 16, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,982 1,264 Updated Apr 14, 2026

Lightpanda: the headless browser designed for AI and automation

Zig 28,793 1,226 Updated Apr 16, 2026

A vulnerability scanner for container images and filesystems

Go 12,032 781 Updated Apr 15, 2026

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 8,581 950 Updated Apr 16, 2026

ODIN [for Codex CLI as a plugin] - Outline Driven development approach for agentic INtelligence

Python 10 2 Updated Mar 20, 2026

AI agents running research on single-GPU nanochat training automatically

Python 72,994 10,651 Updated Mar 26, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,852 15,676 Updated Apr 16, 2026

AI Observability & Evaluation

Python 9,303 821 Updated Apr 16, 2026

Lightweight static analysis for many languages. Find bug variants with patterns that look like source code.

OCaml 14,810 912 Updated Apr 16, 2026

🔥 xCrash provides the Android app with the ability to capture java crash, native crash and ANR. No root permission or any system permissions are required.

C 3,931 654 Updated Jun 27, 2025

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 18,864 1,436 Updated Apr 16, 2026

⚡A CLI tool for code structural search, lint and rewriting. Written in Rust

Rust 13,430 343 Updated Apr 14, 2026

Web-based SQLite database browser written in Python

Python 4,066 392 Updated Apr 8, 2026

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Python 25,324 2,146 Updated Apr 10, 2026

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 35,658 2,424 Updated Apr 16, 2026

Financial data platform for analysts, quants and AI agents.

Python 65,947 6,567 Updated Apr 15, 2026
Next