Skip to content
View WrongProtocol's full-sized avatar

Block or report WrongProtocol

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A transformers-based song transcriber using Ace-Step's Qwen2.5-Omni-based song transcription model, acestep-transcriber

Jupyter Notebook 4 Updated Feb 7, 2026

The most powerful local music generation model that outperforms most commercial alternatives

Python 4,668 485 Updated Feb 8, 2026

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 5,780 686 Updated Jan 24, 2026

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 349 34 Updated Aug 12, 2025
Jupyter Notebook 386 55 Updated Nov 2, 2025

Main reference implementation for NLWeb, implemented in Python.

Python 6,140 693 Updated Feb 5, 2026
Python 94 13 Updated Oct 16, 2025

Realtime AI Voice Converter for NVIDIA GPUs

Python 165 7 Updated Nov 5, 2025

API documentation for Paymo

JavaScript 81 29 Updated Jul 20, 2023

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,940 493 Updated Jan 28, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,312 11,730 Updated Dec 15, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,481 5,956 Updated Aug 16, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 21,496 3,712 Updated Feb 6, 2026

Audio Plugin for Audio to MIDI transcription using deep learning.

C++ 2,412 157 Updated Jan 16, 2025

⏩ Ship faster with Continuous AI. Open-source CLI that can be used in Headless mode to run async cloud agents or TUI mode as an in sync coding agent

TypeScript 31,285 4,131 Updated Feb 8, 2026

Realtime DDSP Neural Synthesizer and Effect

C++ 797 89 Updated Aug 7, 2023

Gradio UI for YuE

Python 89 18 Updated Apr 5, 2025
HTML 2 Updated Jul 16, 2025

A zero-config VS Code database extension with affordances to aid development and debugging.

JavaScript 1,292 41 Updated Feb 5, 2026

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,823 2,779 Updated Jun 22, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,412 876 Updated Jun 20, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,086 245 Updated Feb 6, 2026

Code for FLAVR: A fast and efficient frame interpolation technique.

Python 514 76 Updated May 7, 2024

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Python 3,106 314 Updated Aug 10, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,601 3,254 Updated Feb 2, 2026

Nodes related to video workflows

Python 1,492 274 Updated Jan 24, 2026

A custom node set for Video Frame Interpolation in ComfyUI.

Python 960 96 Updated Apr 30, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 54,817 5,995 Updated Dec 30, 2025

The official GitHub page for the survey paper "Foundation Models for Music: A Survey".

220 10 Updated Sep 4, 2024
Next