-
-
vllm_1 Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 14, 2025 -
-
-
-
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python Apache License 2.0 UpdatedMay 6, 2025 -
-
iree Public
Forked from iree-org/ireeA retargetable MLIR-based machine learning compiler and runtime toolkit.
C++ Apache License 2.0 UpdatedMay 10, 2024 -
iree-turbine Public
Forked from iree-org/iree-turbineIREE's PyTorch Frontend, based on Torch Dynamo.
Python Apache License 2.0 UpdatedMay 10, 2024 -
torch-mlir Public
Forked from iree-org/torch-mlirFork of the torch-mlir project for carrying pre-integrate patches and branches.
C++ Other UpdatedMay 10, 2024 -
weight-sharing-quant Public
Forked from shuzhangzhong/weight-sharing-quantPython Other UpdatedJul 5, 2023 -
HMCOS Public
Forked from wzh99/HMCOSImplementation of DAC'22 paper: Hierarchical Memory-Constrained Operator Scheduling of Neural Architecture Search Networks.
C++ UpdatedSep 5, 2022 -
-
FastSpeech2 Public
Forked from ming024/FastSpeech2An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"