chenchy

aaronchen chenchy

dasp, mir, machine learning

92 followers · 1.5k following

diffrhythm2 Public
Forked from xiaomi-research/diffrhythm2

Python Apache License 2.0 Updated Oct 27, 2025
indexTTS2 Public
Forked from iszhanjiawei/indexTTS2

Python Apache License 2.0 Updated Sep 6, 2025
ChinaTextbook Public
Forked from TapXWorld/ChinaTextbook

所有小初高、大学PDF教材。

Roff Updated May 15, 2025
ACE-Step Public
Forked from ace-step/ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python Apache License 2.0 Updated May 8, 2025
MusicInfuser Public
Forked from SusungHong/MusicInfuser

Python Apache License 2.0 Updated Mar 19, 2025
discoder Public
Forked from ETH-DISCO/discoder

Python MIT License Updated Feb 24, 2025
free-svc Public
Forked from freds0/free-svc

[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion

Python MIT License Updated Jan 7, 2025
SALMONN Public
Forked from bytedance/SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Python Apache License 2.0 Updated Dec 12, 2024
flow_matching Public
Forked from facebookresearch/flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python Other Updated Dec 10, 2024
autosing Public
Forked from streichgeorg/autosing

Python Updated Nov 27, 2024
WaveFM Public
Forked from luotianze666/WaveFM

WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching

Python Updated Oct 29, 2024
F5-TTS Public
Forked from SWivid/F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python MIT License Updated Oct 15, 2024
codec-bpe Public
Forked from AbrahamSanders/codec-bpe

Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs

Python MIT License Updated Sep 21, 2024
seed-vc Public
Forked from Plachtaa/seed-vc

zero-shot voice conversion with in context learning

Python MIT License Updated Sep 3, 2024
beat_this Public
Forked from CPJKU/beat_this

Accurate and general beat tracker

Python MIT License Updated Aug 6, 2024
Prompt-Singer Public
Forked from cyanbx/Prompt-Singer

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

Python MIT License Updated Jun 21, 2024
DDPM-Midi2Performance-Model Public
Forked from FlyToYourMooN/DDPM-Midi2Performance-Model

Music generation

Python Updated May 2, 2024
snac Public
Forked from hubertsiuzdak/snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python MIT License Updated Apr 9, 2024
MahaTTS Public
Forked from dubverse-ai/MahaTTS

Python Apache License 2.0 Updated Mar 27, 2024
VoiceCraft Public
Forked from jasonppy/VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Python Other Updated Mar 25, 2024
DTTNet-Pytorch Public
Forked from junyuchen-cjy/DTTNet-Pytorch

An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation

Python Apache License 2.0 Updated Mar 19, 2024
LODGE Public
Forked from li-ronghui/LODGE

The code the CVPR2024 paper Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

Python Updated Mar 19, 2024
FineDance Public
Forked from li-ronghui/FineDance

FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation. (ICCV2023)

Python Other Updated Mar 18, 2024
MQTTS Public
Forked from b04901014/MQTTS

Python MIT License Updated Mar 7, 2024
PAM Public
Forked from soham97/PAM

PAM is a no-reference audio quality metric for audio generation tasks

Python MIT License Updated Mar 1, 2024
audioseal Public
Forked from facebookresearch/audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python MIT License Updated Feb 21, 2024
languagecodec Public
Forked from jishengpeng/Languagecodec

Official code repository of Language-Codec

Python MIT License Updated Feb 20, 2024
rule-guided-music Public
Forked from yjhuangcd/rule-guided-music

Python Updated Feb 19, 2024
airgen Public
Forked from Kikyo-16/airgen

Python MIT License Updated Feb 16, 2024
pinyin-to-ipa Public
Forked from stefantaubert/pinyin-to-ipa

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

Python MIT License Updated Feb 9, 2024

aaronchen chenchy

diffrhythm2 Public

Uh oh!

indexTTS2 Public

Uh oh!

ChinaTextbook Public

Uh oh!

ACE-Step Public

Uh oh!

MusicInfuser Public

Uh oh!

discoder Public

Uh oh!

free-svc Public

Uh oh!

SALMONN Public

Uh oh!

flow_matching Public

Uh oh!

autosing Public

Uh oh!

WaveFM Public

Uh oh!

F5-TTS Public

Uh oh!

codec-bpe Public

Uh oh!

seed-vc Public

Uh oh!

beat_this Public

Uh oh!

Prompt-Singer Public

Uh oh!

DDPM-Midi2Performance-Model Public

Uh oh!

snac Public

Uh oh!

MahaTTS Public

Uh oh!

VoiceCraft Public

Uh oh!

DTTNet-Pytorch Public

Uh oh!

LODGE Public

Uh oh!

FineDance Public

Uh oh!

MQTTS Public

Uh oh!

PAM Public

Uh oh!

audioseal Public

Uh oh!

languagecodec Public

Uh oh!

rule-guided-music Public

Uh oh!

airgen Public

Uh oh!

pinyin-to-ipa Public

Uh oh!