-
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedJun 9, 2026 -
kernels-community Public
Forked from huggingface/kernels-communityKernel sources for https://huggingface.co/kernels-community
Python UpdatedJun 7, 2026 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJun 4, 2026 -
-
-
axolotl-dev Public
Forked from axolotl-ai-cloud/axolotlGo ahead and axolotl questions
-
dflash Public
Forked from z-lab/dflashDFlash: Block Diffusion for Flash Speculative Decoding
Python MIT License UpdatedMay 10, 2026 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedApr 20, 2026 -
sglang Public
Forked from sgl-project/sglangSGLang is a high-performance serving framework for large language models and multimodal models.
Python Apache License 2.0 UpdatedApr 20, 2026 -
hermes-agent Public
Forked from NousResearch/hermes-agentThe agent that grows with you
Python MIT License UpdatedApr 14, 2026 -
distil Public
Forked from unarbos/distilDistil SN97 — Competitive Model Distillation on Bittensor
Python MIT License UpdatedApr 7, 2026 -
ao Public
Forked from pytorch/aoPyTorch native quantization and sparsity for training and inference
Python Other UpdatedApr 2, 2026 -
-
parameter-golf Public
Forked from openai/parameter-golfTrain the smallest LM you can that fits in 16MB. Best model wins!
Python MIT License UpdatedMar 24, 2026 -
modded-nanogpt Public
Forked from KellerJordan/modded-nanogptNanoGPT (124M) in 2 minutes
Python MIT License UpdatedMar 11, 2026 -
kernels Public
Forked from huggingface/kernelsLoad compute kernels from the Hub
Python Apache License 2.0 UpdatedFeb 15, 2026 -
bitsandbytes Public
Forked from bitsandbytes-foundation/bitsandbytesAccessible large language models via k-bit quantization for PyTorch.
Python MIT License UpdatedJan 7, 2026 -
OpenEnv Public
Forked from huggingface/OpenEnvAn interface library for RL post training with environments.
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 21, 2025 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 21, 2025 -
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
-
MoMoE-impl Public
Forked from tilde-research/momoe-releaseMemory optimized Mixture of Experts
Python UpdatedJul 25, 2025 -
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionRing attention implementation with flash attention
Python MIT License UpdatedJul 23, 2025 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedJul 2, 2025 -
lighteval Public
Forked from huggingface/lightevalLighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Python MIT License UpdatedMay 23, 2025 -
mamba Public
Forked from state-spaces/mambaMamba SSM architecture
Python Apache License 2.0 UpdatedMay 17, 2025 -
triton_eval Public
Forked from tcapelle/triton_evalA simple way of measuring triton kernels
-
atropos Public
Forked from NousResearch/atroposAtropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
Python MIT License UpdatedMay 8, 2025 -
Absolute-Zero-Reasoner Public
Forked from LeapLabTHU/Absolute-Zero-ReasonerPython UpdatedMay 7, 2025 -
-
KernelBench Public
Forked from ScalingIntelligence/KernelBenchKernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
Python Other UpdatedApr 7, 2025