Skip to content
View zhuyingSeu's full-sized avatar

Organizations

@minivision-ai

Block or report zhuyingSeu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

OpenClaw Desktop Assistant MVP - Electron-based AI voice assistant with Live2D character animations, real-time speech recognition, and text-to-speech

JavaScript 410 55 Updated Feb 13, 2026

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 18,908 2,248 Updated May 11, 2026
Python 207 40 Updated May 8, 2026

Added vLLM support to IndexTTS for faster inference.

Python 1,147 159 Updated Apr 13, 2026

Deprecated, the Web Neural Network Polyfill project has been moved to https://github.com/webmachinelearning/webnn-polyfill

Python 161 40 Updated Apr 14, 2023

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Python 713 142 Updated May 15, 2026

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 1,000 127 Updated Apr 10, 2026

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 10,910 1,013 Updated Mar 22, 2026

[ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Python 785 142 Updated Nov 12, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,049 771 Updated Mar 26, 2026

Real time interactive streaming digital human

Python 7,652 1,209 Updated May 14, 2026

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 137,243 19,582 Updated May 15, 2026

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,572 112 Updated Mar 4, 2026

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Python 1,457 83 Updated Sep 27, 2024

[ICCV 2025] STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 1,483 87 Updated Jul 2, 2025

Singing Voice Conversion via diffusion model

Jupyter Notebook 2,717 817 Updated Apr 18, 2026

SoftVC VITS Singing Voice Conversion

Python 28,063 5,057 Updated Nov 11, 2023

[ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Python 378 32 Updated Nov 20, 2025

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 2,587 233 Updated Nov 18, 2025

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Python 4,314 677 Updated May 6, 2024

Real-Time High-Resolution Background Matting

Python 7,166 962 Updated Jun 19, 2024

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,572 538 Updated Feb 23, 2026

Bring portraits to life!

Python 18,348 1,911 Updated Mar 2, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,728 1,293 Updated Nov 4, 2025

JoyHallo: Digital human model for Mandarin

Python 521 51 Updated Sep 21, 2025

[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,697 535 Updated Feb 27, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,645 1,118 Updated Sep 14, 2024

High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN

Python 507 104 Updated Mar 27, 2024

[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation

Python 1,150 149 Updated Aug 24, 2025

一个超轻量级、可以在移动端实时运行的数字人模型

Python 2,495 365 Updated Apr 23, 2026
Next