-
timedomAIn
- Beijing
- seanweichat
-
Sana Public
Forked from NVlabs/SanaSANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Python Apache License 2.0 UpdatedMar 25, 2025 -
-
weight_selection_example Public
This project demonstrates how to use weight selection to get a smaller model from existing large models. the smaller model can then be used for further finetuning on downstream tasks.
Jupyter Notebook UpdatedOct 30, 2024 -
-
-
audiolm-pytorch Public
Forked from lucidrains/audiolm-pytorchImplementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Python MIT License UpdatedJan 11, 2024 -
AudioLDM2 Public
Forked from haoheliu/AudioLDM2Text-to-Audio/Music Generation
Python Other UpdatedJan 9, 2024 -
AudioLDM-training-finetuning Public
Forked from haoheliu/AudioLDM-training-finetuningAudioLDM training, finetuning, evaluation and inference.
Python MIT License UpdatedNov 26, 2023 -
pretty-midi Public
Forked from craffel/pretty-midiUtility functions for handling MIDI data in a nice/intuitive way.
Jupyter Notebook MIT License UpdatedNov 1, 2023 -
audiocraft Public
Forked from facebookresearch/audiocraftAudiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Python MIT License UpdatedAug 4, 2023 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedJul 26, 2023 -
bark Public
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model
Jupyter Notebook MIT License UpdatedJul 19, 2023 -
Barkify Public
Forked from anyvoiceai/BarkifyBarkify: an unoffical training implementation of Bark TTS by suno-ai
Python UpdatedMay 31, 2023 -
-
praat Public
Forked from praat/praat.github.ioPraat: Doing Phonetics By Computer
C UpdatedNov 21, 2022 -
-
Idea-Box Public
Forked from AgoraIO-Community/Idea-BoxHTML GNU General Public License v3.0 UpdatedJul 29, 2022 -
-
cnnpss Public
A Chinese version of A Neural Parametric Singing Synthesizer
-
torch_npss Public
pytorch implementation of Neural Parametric Singing Synthesizer 歌声合成
-
PeakLimiter Public
Forked from tcarpent/PeakLimiterPeakLimiter
-
-
Python-Wrapper-for-World-Vocoder Public
Forked from JeremyCCHsu/Python-Wrapper-for-World-VocoderA Python wrapper for the high-quality vocoder "World"
Python MIT License UpdatedMay 26, 2021 -
-
textgrid Public
Forked from kylebgorman/textgridA Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat
-
AudioKit Public
Forked from AudioKit/AudioKitSwift audio synthesis, processing, & analysis platform for iOS, macOS and tvOS
-
LSA_pytorch Public
pytorch implementation of Lead Sheet Generation and Arrangement
-
README Public
Forked from guodongxiaren/READMEREADME文件语法解读,即Github Flavored Markdown语法介绍
-
-
waveglow Public
Forked from NVIDIA/waveglowA Flow-based Generative Network for Speech Synthesis