Skip to content
View andywer's full-sized avatar

Highlights

  • Pro

Organizations

@AKSW @webpack-blocks @shuttersh

Block or report andywer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
10 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 71,770 10,518 Updated Jun 18, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,633 2,481 Updated Mar 13, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,736 865 Updated Jun 10, 2024
Jupyter Notebook 9,606 678 Updated Oct 16, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,647 966 Updated Oct 23, 2025

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,431 798 Updated Mar 15, 2025
Jupyter Notebook 635 83 Updated Nov 7, 2025

Face Image Motion Model (Photo-2-Video) based on "first-order-model" repository.

Jupyter Notebook 548 90 Updated Aug 23, 2022

Prompt engineering, automated.

Jupyter Notebook 347 26 Updated Apr 22, 2025

Audio-driven facial animation generator with BiLSTM used for transcribing the speech and web interface displaying the avatar and the animation

Jupyter Notebook 35 16 Updated Jul 14, 2022