Skip to content
View GavinPayne-lab's full-sized avatar

Block or report GavinPayne-lab

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A PyTorch coding practice platform — covering LLM, Diffusion, PEFT, and more A friendly environment to help you deeply understand deep learning components through hands-on practice. Like LeetCode, …

Jupyter Notebook 421 20 Updated Apr 3, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,960 328 Updated Mar 27, 2026

SOTA Open Source TTS

Python 30,366 2,576 Updated May 12, 2026

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Python 1,002 102 Updated Apr 2, 2023

A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.

Python 3,895 506 Updated Mar 12, 2026

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,554 534 Updated Mar 12, 2026
Python 207 40 Updated May 8, 2026

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 19,009 2,262 Updated May 11, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 16,094 1,675 Updated Mar 17, 2026

Real-time global intelligence dashboard. AI-powered news aggregation, geopolitical monitoring, and infrastructure tracking in a unified situational awareness interface

TypeScript 54,312 8,739 Updated May 17, 2026

Official Repository for "Global Rotation Equivariant Phase Modeling for Speech Enhancement with Deep Magnitude-Phase Interaction"

Python 16 3 Updated May 15, 2026

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,256 683 Updated Aug 10, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 57,541 6,278 Updated Apr 30, 2026

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,322 6,097 Updated Aug 16, 2024

Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"

Python 158 18 Updated Mar 3, 2026

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Jupyter Notebook 604 130 Updated Sep 18, 2023

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 21,069 2,428 Updated May 3, 2026
Python 1,412 411 Updated May 2, 2026

Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation

Python 142 8 Updated Mar 8, 2026

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 977 142 Updated Dec 2, 2025

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,277 193 Updated May 8, 2026

Leetcode for Pytorch

Jupyter Notebook 2,045 262 Updated Jan 19, 2026

Efficient Triton Kernels for LLM Training

Python 6,358 528 Updated May 16, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,182 184 Updated Aug 26, 2025

My learning notes for ML SYS.

Python 6,316 417 Updated Apr 23, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,186 650 Updated May 10, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,143 1,416 Updated May 17, 2026
Python 311 51 Updated Jun 21, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 99,615 12,197 Updated Apr 15, 2026

GLM-4-Voice | 端到端中英语音对话模型

Python 3,177 278 Updated Dec 5, 2024
Next