LLM
Distribute and run LLMs with a single file.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
SGLang is a high-performance serving framework for large language models and multimodal models.
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).
Train transformer language models with reinforcement learning.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
[ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices
Robust recipes to align language models with human and AI preferences
Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
DSPy: The framework for programming—not prompting—language models
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Sparsify transformers with SAEs and transcoders