Skip to content
View WrongProtocol's full-sized avatar

Block or report WrongProtocol

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 331 30 Updated Aug 12, 2025
Jupyter Notebook 374 56 Updated Nov 2, 2025

Main reference implementation for NLWeb, implemented in Python.

Python 6,109 682 Updated Dec 18, 2025
Python 73 10 Updated Oct 16, 2025

Realtime AI Voice Converter for NVIDIA GPUs

Python 133 5 Updated Nov 5, 2025

API documentation for Paymo

JavaScript 81 29 Updated Jul 20, 2023

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,477 418 Updated Jun 27, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 92,177 11,547 Updated Dec 15, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,943 5,857 Updated Aug 16, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,316 3,312 Updated Dec 20, 2025

Audio Plugin for Audio to MIDI transcription using deep learning.

C++ 2,309 144 Updated Jan 16, 2025

⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agents

TypeScript 30,429 3,928 Updated Dec 20, 2025

Realtime DDSP Neural Synthesizer and Effect

C++ 790 85 Updated Aug 7, 2023

Gradio UI for YuE

Python 84 17 Updated Apr 5, 2025
HTML 2 Updated Jul 16, 2025

A zero-config VS Code database extension with affordances to aid development and debugging.

JavaScript 1,208 38 Updated Nov 26, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,697 2,753 Updated Jun 22, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,266 847 Updated Jun 20, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,017 237 Updated Nov 30, 2025

Code for FLAVR: A fast and efficient frame interpolation technique.

Python 513 76 Updated May 7, 2024

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Python 3,092 315 Updated Aug 10, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,016 3,201 Updated Dec 19, 2025

Nodes related to video workflows

Python 1,396 252 Updated Dec 17, 2025

A custom node set for Video Frame Interpolation in ComfyUI.

Python 916 86 Updated Apr 30, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 53,332 5,839 Updated Dec 19, 2025

The official GitHub page for the survey paper "Foundation Models for Music: A Survey".

221 9 Updated Sep 4, 2024

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Python 2,141 248 Updated Nov 27, 2025

InvokeAI API library

Python 1 Updated Feb 10, 2025

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …

TypeScript 2,825 294 Updated Nov 23, 2025
Next