-
SenseTime
- Beijing, China
-
Wan2.2 Public
Forked from Wan-Video/Wan2.2Wan: Open and Advanced Large-Scale Video Generative Models
Python Apache License 2.0 UpdatedOct 12, 2025 -
llmc Public
Forked from ModelTC/LightCompressllmc is an efficient LLM compression tool with multiple advanced compression methods.
Python Apache License 2.0 UpdatedMay 26, 2025 -
-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedSep 18, 2024 -
-
Dipoorlet Public
Forked from ModelTC/DipoorletOffline Quantization Tools for Deploy.
-
lightllm Public
Forked from ModelTC/LightLLMLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Python Apache License 2.0 UpdatedSep 25, 2023 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedSep 25, 2023 -
nncf Public
Forked from openvinotoolkit/nncfNeural Network Compression Framework for enhanced OpenVINO™ inference
Python Apache License 2.0 UpdatedJul 27, 2023 -
PaddleSlim Public
Forked from PaddlePaddle/PaddleSlimPaddleSlim is an open-source library for deep model compression and architecture search.
Python Apache License 2.0 UpdatedNov 17, 2022 -
Paddle Public
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ Apache License 2.0 UpdatedNov 10, 2022