FrozenGene

🎯

Focusing

Zhao Wu FrozenGene

🎯

Focusing

1.7k followers · 40 following

Shanghai
08:11 (UTC +08:00)

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

llvm / torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,672 612 Updated Nov 7, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,210 279 Updated Nov 7, 2025

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,492 424 Updated Nov 7, 2025

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,744 3,021 Updated Nov 7, 2025

mlc-ai / xgrammar

Fast, Flexible and Portable Structured Generation

C++ 1,352 98 Updated Nov 7, 2025

EnzymeAD / Enzyme

High-performance automatic differentiation of LLVM and MLIR.

LLVM 1,487 144 Updated Nov 7, 2025

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,797 3,693 Updated Nov 7, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,741 1,520 Updated Nov 7, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,634 259 Updated Nov 6, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,938 148 Updated Nov 5, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,576 1,851 Updated Nov 4, 2025

iggredible / Learn-Vim

Learning Vim and Vimscript doesn't have to be hard. This is the guide that you're looking for 📖

14,751 1,128 Updated Oct 27, 2025

chrislgarry / Apollo-11

Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules.

Assembly 63,826 7,357 Updated Oct 22, 2025

ossu / computer-science

🎓 Path to a free self-taught education in Computer Science!

HTML 197,223 24,586 Updated Aug 23, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,668 319 Updated Aug 19, 2025

vicky002 / AlgoWiki

Repository which contains links and resources on different topics of Computer Science.

CSS 4,224 1,161 Updated Aug 15, 2025

microsoft / hummingbird

Hummingbird compiles trained ML models into tensor computation for faster inference.

Python 3,496 286 Updated Jul 17, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,099 3,267 Updated Jun 26, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 286 Updated May 15, 2025

ZhangGe6 / onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

JavaScript 1,572 192 Updated Feb 25, 2025

gpgpu-sim / gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,483 583 Updated Feb 15, 2025