Skip to content
View KdaiP's full-sized avatar

Block or report KdaiP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 432 46 Updated Sep 13, 2024

A curated list of my favourite music DSP and audio programming resources

2,812 98 Updated Mar 19, 2025

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

875 85 Updated Jul 8, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

2,099 255 Updated Jun 6, 2024

Reference-aware automatic speech evaluation toolkit

Python 172 14 Updated Dec 5, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 40,434 9,094 Updated Dec 18, 2025

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 40,080 7,795 Updated Dec 19, 2025

JiOu-LLM: 基于llama2的奇偶数判别模型

Python 5 1 Updated Mar 11, 2024

Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch

C 500 60 Updated Oct 28, 2023

DDSP: Differentiable Digital Signal Processing

Python 3,174 369 Updated Sep 30, 2025

逐行解释的pytorch自编码器实现,使用MNIST数据集进行训练,保证代码简单。

Python 18 1 Updated Feb 9, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,540 263 Updated Dec 14, 2025

Train the next generation of TTS systems.

Python 170 17 Updated Sep 13, 2024

distortion/saturation plugin

C++ 60 1 Updated Jul 12, 2024

JS Inflator is a copy of Sonox Inflator.

C++ 385 34 Updated Jan 11, 2025

Conditioning and feature fusion methods such as FiLM, Conditional Layer Norm and AdaIN.

Python 11 2 Updated Feb 10, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

70,839 8,105 Updated Dec 21, 2025

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,849 139 Updated Jul 5, 2024

A curated list of JUCE modules, templates, plugins, oh my!

Ruby 1,109 53 Updated Dec 21, 2025

Collection of tutorials & resources for the C++ library JUCE

Makefile 125 11 Updated Jun 7, 2020

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,149 381 Updated Aug 13, 2024

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 313 49 Updated Aug 25, 2021

Finetune MobileSAM with Less Than 4GB RAM!

Jupyter Notebook 32 6 Updated Nov 12, 2023

a huggingface mirror site.

321 45 Updated Mar 18, 2024

An Efficient Lexical Analyzer for Chinese

Python 2,094 338 Updated Jan 31, 2022

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,064 26,317 Updated Dec 21, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,777 309 Updated Dec 16, 2025

Extract the voice and corresponding text

C# 88 9 Updated Jan 20, 2025

Fast Segment Anything

Python 8,203 746 Updated Jul 30, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,757 622 Updated Feb 21, 2025
Next