Skip to content
View kumare3's full-sized avatar
👋
Check out https://flyte.org
👋
Check out https://flyte.org

Block or report kumare3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A small Mac app that displays the health of your Union.ai cluster as a menu bar icon.

Python 5 1 Updated Apr 27, 2026

eBPF-based GPU causal observability agent

Go 62 6 Updated Apr 26, 2026

Reverse engineering NVIDIA SASS instruction dictionary, kernel audits and pattern recognition across GPU architectures.

Sass 193 10 Updated Apr 28, 2026

TriAttention — Efficient long reasoning with trigonometric KV cache compression. Enables OpenClaw local deployment on memory-constrained GPUs.

Python 660 53 Updated Apr 23, 2026

Sub-millisecond VM sandboxes for AI agents via copy-on-write forking

Rust 2,253 96 Updated Mar 21, 2026

FastAPI-compatible Python framework with Zig HTTP core; 7x faster, free-threading native

Zig 963 27 Updated Apr 27, 2026

Type-safe, distributed orchestration of agents, ML pipelines, and real-time inference — in pure Python with async/await.

Python 112 37 Updated Apr 29, 2026

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 197 13 Updated Apr 24, 2026

Rust based high-performance Apache Uniffle shuffle-server

Rust 64 5 Updated Apr 24, 2026

JAX in JavaScript – ML library for the web, running on WebGPU & Wasm

TypeScript 799 47 Updated Apr 15, 2026

Open-Source Frontier Voice AI

Python 45,090 4,996 Updated Apr 24, 2026

🚀2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platfor…

Rust 26,763 1,143 Updated Apr 29, 2026

Autocomp: Optimize any AI kernel, anywhere.

Python 126 8 Updated Apr 26, 2026

Read-through cache for object storage

Rust 581 13 Updated Apr 26, 2026

Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and m…

C 3,444 123 Updated Mar 23, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,581 371 Updated Apr 29, 2026

BARCH is a local l1 + remote l2 cache with valkey and multilanguage l1 interface providing low latency ordered access

C++ 15 Updated Apr 28, 2026

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 10,930 957 Updated Apr 24, 2026

Official inference framework for 1-bit LLMs

Python 38,646 3,503 Updated Mar 10, 2026

MLX: An array framework for Apple silicon

C++ 25,842 1,732 Updated Apr 28, 2026

Nano vLLM

Python 13,169 2,012 Updated Apr 26, 2026

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Rust 5,170 229 Updated Apr 29, 2026

Format click help output nicely with rich.

Python 801 48 Updated Jan 31, 2026

🧱 secure, local and programmable sandboxes for AI agents

Rust 5,876 283 Updated Apr 29, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,096 441 Updated Apr 29, 2026

A unified inference and post-training framework for accelerated video generation.

Python 3,439 324 Updated Apr 28, 2026

An extremely fast Python type checker and language server, written in Rust.

Python 18,447 282 Updated Apr 28, 2026

Programmatic sandboxing tool

Rust 277 15 Updated Apr 23, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,693 1,069 Updated Apr 29, 2026

Hybrid in-memory and disk cache in Rust

Rust 1,701 82 Updated Apr 24, 2026
Next