Lists (4)
Sort Name ascending (A-Z)
Starred repositories
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Cost-efficient and pluggable Infrastructure components for GenAI inference
Community maintained hardware plugin for vLLM on Ascend
Fast and memory-efficient exact attention
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
The definitive Web UI for local AI, with powerful features and easy setup.
Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20
You like pytorch? You like micrograd? You love tinygrad! ❤️
⚡ A Fast, Extensible Progress Bar for Python and CLI
A high-throughput and memory-efficient inference and serving engine for LLMs
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
🐶 Kubernetes CLI To Manage Your Clusters In Style!
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Tapir extension to LLVM for optimizing Parallel Programs