GavinPayne-lab

Gavin Payne GavinPayne-lab

Stars

Rivflyyy / HappyTorch

A PyTorch coding practice platform — covering LLM, Diffusion, PEFT, and more A friendly environment to help you deeply understand deep learning components through hands-on practice. Like LeetCode, …

Jupyter Notebook 421 20 Updated Apr 3, 2026

duoan / TorchCode

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,960 328 Updated Mar 27, 2026

fishaudio / fish-speech

SOTA Open Source TTS

Python 30,366 2,576 Updated May 12, 2026

NATSpeech / NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Python 1,002 102 Updated Apr 2, 2023

ysharma3501 / LuxTTS

A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.

Python 3,895 506 Updated Mar 12, 2026

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,554 534 Updated Mar 12, 2026

a710128 / nanovllm-voxcpm

Python 207 40 Updated May 8, 2026

OpenBMB / VoxCPM

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 19,009 2,262 Updated May 11, 2026

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 16,094 1,675 Updated Mar 17, 2026

koala73 / worldmonitor

Real-time global intelligence dashboard. AI-powered news aggregation, geopolitical monitoring, and infrastructure tracking in a unified situational awareness interface

TypeScript 54,312 8,739 Updated May 17, 2026

wangchengzhong / GRE-Net

Official Repository for "Global Rotation Equivariant Phase Modeling for Speech Enhancement with Deep Magnitude-Phase Interaction"

Python 16 3 Updated May 15, 2026

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,256 683 Updated Aug 10, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 57,541 6,278 Updated Apr 30, 2026

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,322 6,097 Updated Aug 16, 2024

FrontierLabs / F5R-TTS

Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"

Python 158 18 Updated Mar 3, 2026

huawei-noah / Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Jupyter Notebook 604 130 Updated Sep 18, 2023

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 21,069 2,428 Updated May 3, 2026

k2-fsa / icefall

Python 1,412 411 Updated May 2, 2026

k2-fsa / Flow2GAN

Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation

Python 142 8 Updated Mar 8, 2026

k2-fsa / ZipVoice

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 977 142 Updated Dec 2, 2025

CalvinXKY / InfraTech

分享AI Infra知识&代码练习：PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,277 193 Updated May 8, 2026

Exorust / TorchLeet

Leetcode for Pytorch

Jupyter Notebook 2,045 262 Updated Jan 19, 2026

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,358 528 Updated May 16, 2026

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,182 184 Updated Aug 26, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,316 417 Updated Apr 23, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,186 650 Updated May 10, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,143 1,416 Updated May 17, 2026

mtkresearch / BreezyVoice

Python 311 51 Updated Jun 21, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 99,615 12,197 Updated Apr 15, 2026

zai-org / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 3,177 278 Updated Dec 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly