Skip to content
View gushiqiao's full-sized avatar
  • SenseTime
  • Beijing, China

Block or report gushiqiao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,999 881 Updated Dec 4, 2025

Light-tts is a lightweight TTS inference framework optimized for CosyVoice2, enabling fast and scalable speech synthesis in Python.

Python 13 1 Updated Nov 28, 2025

Light Video Generation Inference Framework

Python 1,256 79 Updated Dec 19, 2025

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 642 64 Updated Nov 19, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,439 1,970 Updated Dec 21, 2025

Development repository for the Triton language and compiler

MLIR 17,893 2,462 Updated Dec 21, 2025

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,222 611 Updated Dec 19, 2025

Offline Quantization Tools for Deploy.

Python 1 Updated Dec 28, 2023

PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.

Python 12,948 2,159 Updated Dec 19, 2025