Stars
Visual Studio Code extension generator
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
A safetensors extension to efficiently store sparse quantized tensors on disk
kubectl plugin to list allocations (cpu, memory, gpu,... X utilization, requested, limit, allocatable,...)
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).
Track emissions from Compute and recommend ways to reduce their impact on the environment.
iKeramat / HoRNDIS
Forked from jwise/HoRNDISAndroid USB tethering driver for Mac OS X
A framework for few-shot evaluation of language models.
LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
DLRover: An Automatic Distributed Deep Learning System
Code and documentation to train Stanford's Alpaca models, and generate the data.
RewardBench: the first evaluation tool for reward models.
TVM Documentation in Chinese Simplified / TVM 中文文档
Given an existing docker container, prints the command line necessary to run a copy of it.
You like pytorch? You like micrograd? You love tinygrad! ❤️
Reference implementations of MLPerf® inference benchmarks
NVIDIA Linux open GPU with P2P support
NVIDIA Linux open GPU with P2P support
NVIDIA Linux open GPU kernel module source
A toolkit to run Ray applications on Kubernetes
The Arcade Learning Environment (ALE) -- a platform for AI research.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)