Lists (11)
Sort Name ascending (A-Z)
Stars
๐ Text-Prompted Generative Audio Model
PyTorch code and models for the DINOv2 self-supervised learning method.
QLoRA: Efficient Finetuning of Quantized LLMs
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
MARS5 speech model (TTS) from CAMB.AI
JonathanFly / bark
Forked from suno-ai/bark๐ BARK INFINITY GUI CMD ๐ถ Powered Up Bark Text-prompted Generative Audio Model
(ICCV'21) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing." (No longer actively maintained)
Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation
Pytorch implementation of the paper, Neural re-rendering of humans from a single image.