Starred repositories
Cost-efficient and pluggable Infrastructure components for GenAI inference
Community maintained hardware plugin for vLLM on Ascend
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
Accessible large language models via k-bit quantization for PyTorch.
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
A high-throughput and memory-efficient inference and serving engine for LLMs
Lantern官方版本下载 蓝灯 翻墙 代理 科学上网 外网 加速器 梯子 路由 - Быстрый, надежный и безопасный доступ к открытому интернету - lantern proxy vpn censorship-circumvention censorship gfw accelerator پراکسی لنترن، ضدسانسور…
Lantern官方版本下载 蓝灯 翻墙 代理 科学上网 外网 加速器 梯子 路由 proxy vpn circumvention gfw
A collection of (mostly) technical things every software developer should know about
⚡ A Fast, Extensible Progress Bar for Python and CLI