Skip to content
View gayshub's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report gayshub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

Python 410 49 Updated Aug 20, 2025

Have a natural, spoken conversation with AI!

Python 3,229 350 Updated Jul 11, 2025

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,418 188 Updated Jul 15, 2025

实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …

Python 1,107 144 Updated Mar 21, 2025

一个超轻量级、可以在移动端实时运行的数字人模型

Python 2,252 321 Updated Sep 18, 2025

[CVPR 2025] Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency

Python 51 2 Updated Jul 30, 2025

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 17,544 3,655 Updated Oct 9, 2024

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Python 1,362 79 Updated Sep 27, 2024

A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.

C++ 14,778 1,314 Updated Aug 3, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 32,769 4,088 Updated Aug 6, 2024

[ECCV 2024] Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Python 160 6 Updated Feb 7, 2025

[CVPR 2024 Highlight] Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.

Python 216 16 Updated Dec 22, 2024

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 37,120 6,211 Updated Jul 26, 2024

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,566 123 Updated Aug 20, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,217 2,077 Updated Oct 9, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,285 503 Updated Aug 11, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,992 626 Updated Aug 10, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 42,932 5,680 Updated Aug 16, 2024

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,294 386 Updated Apr 20, 2025

DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portraits.

Python 263 18 Updated Aug 7, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 3,067 261 Updated Jun 27, 2025

Powerful & Easy-to-Use Video Face Swapping and Editing Software

Python 1,290 208 Updated Mar 11, 2025

Taming Stable Diffusion for Lip Sync!

Python 4,984 812 Updated Jun 20, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 27,362 2,697 Updated Apr 30, 2025

unofficial implementation of Comfyui magic clothing

Python 584 43 Updated Sep 4, 2024

Installer & Activited Microsoft Office For MacOS

5,117 619 Updated Sep 17, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 13,063 1,394 Updated Oct 1, 2025

一种基于Emotion2Vec的批量音频情感自动标注脚本

Python 449 26 Updated Mar 7, 2025

A cli tool for split vocal timbre.

Python 253 15 Updated Feb 17, 2025

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,624 774 Updated Mar 19, 2025
Next