Starred repositories
A framework for efficient model inference with omni-modality models
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Cost-efficient and pluggable Infrastructure components for GenAI inference
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Ohayou(おはよう), HTTP load generator, inspired by rakyll/hey with tui animation.
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Large-scale text-video dataset. 10 million captioned short videos.
Accessible large language models via k-bit quantization for PyTorch.
A high-throughput and memory-efficient inference and serving engine for LLMs
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Elegant and Powerfull. Powered by OpenAI and Vercel.
Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
TensorFlow, TensorFlow-Lite Pytorch, Torchvision, TensorRT Benchmarks
📚 Freely available programming books
Your ultimate Go microservices framework for the cloud-native era.
Bear is a tool that generates a compilation database for clang tooling.
Sol3 (sol2 v3.0) - a C++ <-> Lua API wrapper with advanced features and top notch performance - is here, and it's great! Documentation:
✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解
A General-purpose Task-parallel Programming System using Modern C++
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/