-
diffrhythm2 Public
Forked from xiaomi-research/diffrhythm2Python Apache License 2.0 UpdatedOct 27, 2025 -
-
-
ACE-Step Public
Forked from ace-step/ACE-StepACE-Step: A Step Towards Music Generation Foundation Model
Python Apache License 2.0 UpdatedMay 8, 2025 -
MusicInfuser Public
Forked from SusungHong/MusicInfuserPython Apache License 2.0 UpdatedMar 19, 2025 -
-
free-svc Public
Forked from freds0/free-svc[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
Python MIT License UpdatedJan 7, 2025 -
SALMONN Public
Forked from bytedance/SALMONNSALMONN: Speech Audio Language Music Open Neural Network
Python Apache License 2.0 UpdatedDec 12, 2024 -
flow_matching Public
Forked from facebookresearch/flow_matchingA PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Python Other UpdatedDec 10, 2024 -
-
WaveFM Public
Forked from luotianze666/WaveFMWaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
Python UpdatedOct 29, 2024 -
F5-TTS Public
Forked from SWivid/F5-TTSOfficial code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Python MIT License UpdatedOct 15, 2024 -
codec-bpe Public
Forked from AbrahamSanders/codec-bpeImplementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs
Python MIT License UpdatedSep 21, 2024 -
seed-vc Public
Forked from Plachtaa/seed-vczero-shot voice conversion with in context learning
Python MIT License UpdatedSep 3, 2024 -
beat_this Public
Forked from CPJKU/beat_thisAccurate and general beat tracker
Python MIT License UpdatedAug 6, 2024 -
Prompt-Singer Public
Forked from cyanbx/Prompt-SingerImplementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).
Python MIT License UpdatedJun 21, 2024 -
DDPM-Midi2Performance-Model Public
Forked from FlyToYourMooN/DDPM-Midi2Performance-ModelMusic generation
Python UpdatedMay 2, 2024 -
snac Public
Forked from hubertsiuzdak/snacMulti-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Python MIT License UpdatedApr 9, 2024 -
-
VoiceCraft Public
Forked from jasonppy/VoiceCraftZero-Shot Speech Editing and Text-to-Speech in the Wild
Python Other UpdatedMar 25, 2024 -
DTTNet-Pytorch Public
Forked from junyuchen-cjy/DTTNet-PytorchAn official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
Python Apache License 2.0 UpdatedMar 19, 2024 -
LODGE Public
Forked from li-ronghui/LODGEThe code the CVPR2024 paper Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Python UpdatedMar 19, 2024 -
FineDance Public
Forked from li-ronghui/FineDanceFineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation. (ICCV2023)
Python Other UpdatedMar 18, 2024 -
-
PAM Public
Forked from soham97/PAMPAM is a no-reference audio quality metric for audio generation tasks
Python MIT License UpdatedMar 1, 2024 -
audioseal Public
Forked from facebookresearch/audiosealLocalized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Python MIT License UpdatedFeb 21, 2024 -
languagecodec Public
Forked from jishengpeng/LanguagecodecOfficial code repository of Language-Codec
Python MIT License UpdatedFeb 20, 2024 -
-
-
pinyin-to-ipa Public
Forked from stefantaubert/pinyin-to-ipaCommand-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.
Python MIT License UpdatedFeb 9, 2024