TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,413 1,957 Updated Dec 17, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 10,949 1,220 Updated Dec 16, 2025

ctlllll / axolotl

Go ahead and axolotl questions

Python 11 13 Updated Feb 6, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,709 1,246 Updated Dec 11, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 16,054 987 Updated Dec 10, 2025

bpbpublications / Building-Transformer-Models-with-PyTorch-2.0

Building Transformer Models with PyTorch 2.0, by BPB Publications

Jupyter Notebook 36 20 Updated Jul 17, 2024

thustorage / Medusa

Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]

HTML 40 5 Updated May 13, 2025

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,309 78 Updated Mar 6, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,065 231 Updated Nov 23, 2025

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 342 45 Updated Apr 22, 2025

huggingface / course

The Hugging Face course on Transformers

MDX 3,587 1,206 Updated Dec 8, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,958 31,465 Updated Dec 17, 2025

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,383 1,245 Updated Dec 16, 2025