Skip to content
View flozi00's full-sized avatar
  • Germany
  • 23:25 (UTC +01:00)

Organizations

@Hugging-Face-Supporter

Block or report flozi00

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

Rust 12,017 613 Updated Mar 22, 2026

The agent that grows with you

Python 10,218 1,277 Updated Mar 22, 2026

🚀2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platfor…

Rust 23,616 1,009 Updated Mar 22, 2026

GitNexus: The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive …

TypeScript 18,799 2,184 Updated Mar 22, 2026

State-of-the-Art Text Embeddings

Python 18,435 2,767 Updated Mar 12, 2026

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetil…

Rust 10,577 989 Updated Mar 16, 2026

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 61,243 2,440 Updated Feb 27, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,868 421 Updated Mar 20, 2026

incubator repo for CUDA-TileIR backend

MLIR 122 8 Updated Mar 18, 2026

🚀 The fast, Pythonic way to build MCP servers and clients.

Python 23,897 1,843 Updated Mar 22, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 81,287 6,748 Updated Mar 20, 2026

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 128,275 18,123 Updated Mar 22, 2026

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 809 132 Updated Mar 22, 2026

Windows inside a Docker container.

Shell 50,681 4,154 Updated Mar 9, 2026

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

TypeScript 75,675 6,441 Updated Mar 21, 2026

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,163 184 Updated Mar 21, 2026

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 415 53 Updated Mar 3, 2026

CUDA Python: Performance meets Productivity

Cython 3,192 260 Updated Mar 22, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,354 443 Updated Mar 9, 2026

Firmware replacement for Growatt ShineWiFi-S

C++ 424 118 Updated Feb 5, 2026

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 39,950 6,626 Updated Mar 22, 2026

A generative world for general-purpose robotics & embodied AI learning.

Python 28,322 2,629 Updated Mar 22, 2026

An extremely fast Python package and project manager, written in Rust.

Rust 81,750 2,829 Updated Mar 22, 2026

Get your documents ready for gen AI

Python 56,299 3,821 Updated Mar 20, 2026

Material for gpu-mode lectures

Jupyter Notebook 5,869 587 Updated Feb 1, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,741 465 Updated Mar 22, 2026

Multi-GPU CUDA stress test

C++ 2,135 399 Updated Nov 4, 2025

Efficient Triton Kernels for LLM Training

Python 6,226 504 Updated Mar 20, 2026

Machine Learning Engineering Open Book

Python 17,482 1,108 Updated Mar 16, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,199 820 Updated Mar 22, 2026
Next