- Berlin, Germany
- https://andywer.com
- @andywritescode
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
A latent text-to-image diffusion model
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
QLoRA: Efficient Finetuning of Quantized LLMs
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Face Image Motion Model (Photo-2-Video) based on "first-order-model" repository.
Audio-driven facial animation generator with BiLSTM used for transcribing the speech and web interface displaying the avatar and the animation