Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.