Skip to content
View FrozenGene's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanghai
  • 16:15 (UTC +08:00)

Organizations

@apache @DougongAI

Block or report FrozenGene

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
12 results for source starred repositories written in Python
Clear filter

Universal LLM Deployment Engine with ML Compilation

Python 21,578 1,851 Updated Nov 4, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,800 3,693 Updated Nov 7, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,672 319 Updated Aug 19, 2025

Hummingbird compiles trained ML models into tensor computation for faster inference.

Python 3,496 286 Updated Jul 17, 2025

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,492 424 Updated Nov 8, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,211 279 Updated Nov 7, 2025

Dive into Deep Learning Compiler

Python 646 95 Updated Jun 19, 2022
Python 619 65 Updated Jun 4, 2024

A library for syntactically rewriting Python programs, pronounced (sinner).

Python 68 11 Updated Feb 22, 2022

An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.

Python 51 3 Updated Jul 23, 2024
Python 42 3 Updated Sep 8, 2023

TFLite python API package for parsing TFLite model

Python 12 6 Updated Jan 20, 2020