-
audiogym Public
Forked from cocktailpeanut/fluxgymDead simple Stable Audio training UI with LOW VRAM support
-
ComfyUI Public
Forked from Comfy-Org/ComfyUIThe most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
-
audiocraft_plus Public
Forked from GrandaddyShmax/audiocraft_plusAudiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
-
open_flamingo Public
Forked from mlfoundations/open_flamingoAn open-source framework for training large multimodal models.
Python MIT License UpdatedOct 23, 2023 -
-
so-vits-svc-fork Public
Forked from voicepaw/so-vits-svc-forkso-vits-svc fork with realtime support, improved interface and more features.
Python Other UpdatedApr 28, 2023 -
auraloss Public
Forked from csteinmetz1/auralossCollection of audio-focused loss functions in PyTorch
Python Apache License 2.0 UpdatedApr 21, 2023 -
CLAP Public
Forked from LAION-AI/CLAPContrastive Language-Audio Pretraining
-
audio-local-transformers Public
Experimental implementations of local attention-based audio transformers and autoencoders
-
ClipCap Public
Forked from TheoCoombes/ClipCapUsing pretrained encoder and language models to generate captions from multimedia inputs.
Python UpdatedMar 11, 2023 -
audio-diffusion-pytorch-trainer Public
Forked from archinetai/audio-diffusion-pytorch-trainerTrainer for audio-diffusion-pytorch
Python MIT License UpdatedOct 21, 2022 -
audio-data-pytorch Public
Forked from archinetai/audio-data-pytorchA collection of useful audio datasets and transforms for PyTorch.
Python MIT License UpdatedSep 4, 2022 -
RAVE Public
Forked from acids-ircam/RAVEOfficial implementation of the RAVE model: a Realtime Audio Variational autoEncoder
-
k-diffusion Public
Forked from crowsonkb/k-diffusionKarras et al. (2022) diffusion models for PyTorch
Python MIT License UpdatedJul 28, 2022 -
-
imagen-pytorch Public
Forked from lucidrains/imagen-pytorchImplementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Python MIT License UpdatedJun 8, 2022 -
unconditional-diff-STFT Public
Forked from SynthAether/unconditional-diff-STFTUnconditional music synthesis using a diffusion model in the STFT domain
Jupyter Notebook MIT License UpdatedMay 31, 2022 -
FastDiff Public
Forked from Rongjiehuang/FastDiffPyTorch Implementation of FastDiff (IJCAI'22)
Python UpdatedMay 20, 2022 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
Python Apache License 2.0 UpdatedMay 14, 2022 -
byol-a-2 Public
Forked from nttcslab/byol-aBYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
Python Other UpdatedMay 7, 2022 -
-
SoundStream Public
Forked from wesbz/SoundStreamThis repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
Python UpdatedApr 21, 2022 -
tagbox Public
Forked from ethman/tagboxSteer OpenAI's Jukebox with Music Taggers
Jupyter Notebook UpdatedApr 21, 2022 -
-
v-diffusion-pytorch Public
Forked from crowsonkb/v-diffusion-pytorchv objective diffusion inference code for PyTorch.
Python MIT License UpdatedMar 26, 2022 -
nicotine-plus Public
Forked from nicotine-plus/nicotine-plusGraphical client for the Soulseek peer-to-peer network
Python GNU General Public License v3.0 UpdatedMar 25, 2022 -
steerable-nafx Public
Forked from csteinmetz1/steerable-nafxSteerable discovery of neural audio effects
Jupyter Notebook Apache License 2.0 UpdatedMar 2, 2022 -
guided-diffusion Public
Forked from crowsonkb/guided-diffusionPython MIT License UpdatedMar 1, 2022 -
glide-text2im Public
Forked from crowsonkb/glide-text2imGLIDE: a diffusion-based text-conditional image synthesis model
Python MIT License UpdatedMar 1, 2022 -
CLIP Public
Forked from openai/CLIPContrastive Language-Image Pretraining
Jupyter Notebook MIT License UpdatedFeb 18, 2022