The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 183,853 107,980 Updated Apr 13, 2026

clash-verge-rev / clash-verge-rev

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 110,508 8,038 Updated Apr 14, 2026

OnlyTerp / kvtc

First open-source KVTC implementation (NVIDIA, ICLR 2026) -- 8-32x KV cache compression via PCA + adaptive quantization + entropy coding

Python 12 2 Updated Apr 1, 2026

wangplin / CAL-UniCNet

UniCNet is a cycle-accurate simulator supporting effienct simulation for composable chiplet networks.

C++ 4 Updated Jan 29, 2026

xuanyuanzhifeng / code-panorama

TypeScript 118 21 Updated Mar 18, 2026

gregjhogan / nvml-debug-log-decrypt

decrypt NVML debug log files

Python 1 Updated Mar 29, 2024

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 941 82 Updated Feb 28, 2026

aibrix / PrisKV

High Performance KV Cache Store for LLM

C 53 8 Updated Apr 6, 2026

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,721 548 Updated Apr 13, 2026

TheNetAdmin / NVLeak

NVLeak: Off-Chip Side-Channel Attacks via Non-Volatile Memory Systems [USENIX Security '23]

TeX 20 2 Updated Nov 17, 2022

JetBrains / go-modern-guidelines

615 19 Updated Apr 8, 2026

NVIDIA-AI-IOT / jetson_dla_tutorial

A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson

Python 367 37 Updated May 19, 2022

eunomia-bpf / eunomia.dev

https://github.com/eunomia-bpf homepage, documents and blogs

TypeScript 209 36 Updated Mar 17, 2026

cfregly / ai-performance-engineering

Python 1,303 185 Updated Mar 31, 2026

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,352 32,868 Updated Apr 14, 2026

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 29,709 2,934 Updated Apr 13, 2026

NVIDIA / Nvidia-Comms-Perf-Suite

A comprehensive toolkit for GPU Communications Libraries performance testing and data analysis.

Python 10 1 Updated Jan 6, 2026

alibaba / clusterdata

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 2,021 461 Updated Mar 12, 2026

fujitsu / A64FX

475 34 Updated Nov 3, 2023

MARD1NO / CUDA-PPT

121 19 Updated Apr 2, 2025

bytedance / vArmor

vArmor is a cloud native container sandbox system based on AppArmor/BPF/Seccomp. It also includes multiple built-in protection rules that are ready to use out of the box.

Go 456 53 Updated Apr 13, 2026

ZJU-LLMs / Foundations-of-LLMs

A book for Learning the Foundations of LLMs

16,037 1,530 Updated Dec 12, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,692 1,121 Updated Mar 16, 2026

jinbooooom / ai-infra-hpc

hpc 教程，包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 408 44 Updated Apr 7, 2026

intel / ipmctl

C 202 67 Updated Mar 9, 2026

ypluo / PMRAccess

Using Persistent Memory Region in NVMe SSD to boost KVStore accessing

C++ 2 1 Updated Jul 15, 2024

OpenMPDK / SMDK

SMDK, Scalable Memory Development Kit, is developed for Samsung CXL(Compute Express Link) Memory Expander to enable full-stack Software-Defined Memory system

C 319 66 Updated Dec 9, 2024

cpp-projects

learn-cpp

Virtual reality

Unreal Engine

Unity

Ubuntu

Terminal

Operating system

OpenGL

MongoDB

See all starred topics