Skip to content
View andywer's full-sized avatar

Highlights

  • Pro

Organizations

@AKSW @webpack-blocks @shuttersh

Block or report andywer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
10 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 72,962 10,610 Updated Jun 18, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,236 2,619 Updated Mar 3, 2026

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,899 873 Updated Jun 10, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,864 1,053 Updated Apr 16, 2026
Jupyter Notebook 9,657 679 Updated Oct 16, 2025

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,476 797 Updated Mar 15, 2025
Jupyter Notebook 644 81 Updated Nov 10, 2025

Face Image Motion Model (Photo-2-Video) based on "first-order-model" repository.

Jupyter Notebook 545 89 Updated Aug 23, 2022

Prompt engineering, automated.

Jupyter Notebook 354 24 Updated Apr 22, 2025

Audio-driven facial animation generator with BiLSTM used for transcribing the speech and web interface displaying the avatar and the animation

Jupyter Notebook 36 16 Updated Jul 14, 2022