Skip to content
View v3ucn's full-sized avatar

Block or report v3ucn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Quantized text-audio foundation model from Boson AI

Python 42 9 Updated Aug 13, 2025
Python 302 40 Updated Jul 22, 2025

开源的LstmSync数字人泛化模型,只做最好的泛化模型!

Python 162 18 Updated Jun 1, 2026

Bella is best

JavaScript 6,418 983 Updated Feb 5, 2026

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Python 314 32 Updated Jan 19, 2026
Jupyter Notebook 11 3 Updated Feb 20, 2025

This package loads the espeak-ng shared library so it will be available for other libraries.

Python 10 1 Updated Nov 1, 2025

Live2D Library for Python (C++ impl): Supports model loading, lip-sync, basic face rigging, and precise click test.

C++ 543 54 Updated May 22, 2026

Taming Stable Diffusion for Lip Sync!

Python 5,772 947 Updated Jun 20, 2025

Quick Mflux on ComfyUI

Python 131 25 Updated Mar 9, 2025

An implementation of MeloTTS by onnxruntime

Python 29 6 Updated Oct 27, 2024

A passive recording project allows you to have complete control over your data. Automatically take screenshots of all your screens, index them, and save them locally.

Python 1,370 59 Updated Jun 11, 2026

gradio WebUI for AdvancedLivePortrait

Python 525 51 Updated Mar 13, 2025

洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频

Python 176 36 Updated Oct 20, 2024

The fastest digital human algorithm, now on your desktop.

Python 580 74 Updated Sep 29, 2025

GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能

Python 180 19 Updated Nov 11, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,754 2,154 Updated May 18, 2026

Powerful Free DeepL API, No Token Required

Go 8,555 639 Updated Jun 7, 2026

[ICLR 2025] Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

Python 747 43 Updated Feb 5, 2025

在DH_live项目基础上修改,添加webui界面

JavaScript 77 17 Updated Apr 25, 2025

Next-Token Prediction is All You Need

Python 2,417 99 Updated Jan 12, 2026

桌面Live2D,提供自定义聊天接口,支持更换模型,自定义动作语音+文本

Python 206 20 Updated Jan 9, 2025
Python 203 20 Updated Sep 24, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 10,399 968 Updated May 16, 2026

Industry leading face manipulation platform

Python 28,797 4,695 Updated Jun 15, 2026

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,811 495 Updated Apr 20, 2025

An easy-to-use web framework. Supports both WSGI and ASGI modes. Gevent or asyncio, this is the question.

Python 298 23 Updated Apr 1, 2026

GPT-SoVITS-V2模型,合并了官方的一些PR,包含但不限于:参考音频自动填充,字幕同步,SillyTavern酒馆接入等功能

Python 205 26 Updated Jan 15, 2025

一个基于Flask实现的RWKV_Role_Playing项目的API。

Python 32 3 Updated Jun 26, 2024
Next