Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A high-throughput and memory-efficient inference and serving engine for LLMs
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
You like pytorch? You like micrograd? You love tinygrad! ❤️
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Time series forecasting with PyTorch
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.
Implementation of Qformer from BLIP2 in Zeta Lego blocks.