Stars
Inspect: A framework for large language model evaluations
DFlash: Block Diffusion for Flash Speculative Decoding
Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/
A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
The official repository of Mozilla's Firefox web browser.
🤗 smolagents: a barebones library for agents that think in code.
Parameterized testing with any Python test framework
Build compute kernels and load them from the Hub.
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
SGLang is a high-performance serving framework for large language models and multimodal models.
Janus-Series: Unified Multimodal Understanding and Generation Models
Fully open reproduction of DeepSeek-R1
The package used to build the documentation of our Hugging Face repos
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
scikit-learn: machine learning in Python
Entropy Based Sampling and Parallel CoT Decoding
A unified evaluation framework for large language models
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching of inference workloads.
PyTorch native quantization and sparsity for training and inference
Formatron empowers everyone to control the format of language models' output with minimal overhead.
Efficient and general syntactical decoding for Large Language Models