Skip to content
View WrongProtocol's full-sized avatar

Block or report WrongProtocol

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
67 results for source starred repositories
Clear filter

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 5,114 773 Updated Mar 11, 2026

The most powerful training scripts for ACE-Step 1.5 including a Command Line Interface, a Terminal Wizard and a Graphical User Interface.

Python 87 14 Updated Mar 22, 2026

openDAW is a next-generation web-based Digital Audio Workstation (DAW)

TypeScript 1,382 102 Updated Mar 24, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 19,572 2,411 Updated Mar 16, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 334,023 65,129 Updated Mar 24, 2026

The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 8,173 928 Updated Mar 24, 2026

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 6,174 746 Updated Mar 13, 2026

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 363 34 Updated Aug 12, 2025
Jupyter Notebook 453 60 Updated Nov 2, 2025

Main reference implementation for NLWeb, implemented in Python.

Python 6,168 693 Updated Mar 16, 2026
Python 111 16 Updated Oct 16, 2025

Realtime AI Voice Converter for NVIDIA GPUs

Python 186 15 Updated Nov 5, 2025

API documentation for Paymo

JavaScript 81 29 Updated Jul 20, 2023

ACE-Step: A Step Towards Music Generation Foundation Model

Python 4,234 533 Updated Feb 15, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,534 11,919 Updated Dec 15, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,917 6,011 Updated Aug 16, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 22,630 4,062 Updated Mar 24, 2026

Audio Plugin for Audio to MIDI transcription using deep learning.

C++ 2,496 161 Updated Jan 16, 2025

⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI

TypeScript 32,036 4,297 Updated Mar 24, 2026

Gradio UI for YuE

Python 90 18 Updated Apr 5, 2025
HTML 2 Updated Jul 16, 2025

A zero-config VS Code database extension with affordances to aid development and debugging.

TypeScript 1,351 42 Updated Mar 4, 2026

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,897 2,795 Updated Jun 22, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,522 902 Updated Jun 20, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,130 246 Updated Feb 23, 2026

Code for FLAVR: A fast and efficient frame interpolation technique.

Python 515 76 Updated May 7, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,143 3,344 Updated Mar 23, 2026

Nodes related to video workflows

Python 1,554 296 Updated Mar 17, 2026
Next