Skip to content
View svitass's full-sized avatar

Block or report svitass

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Python 130 4 Updated Nov 19, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,275 851 Updated Jun 20, 2025

repo collection for NVIDIA Audio2Face-3D models and tools

142 14 Updated Sep 24, 2025

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,166 105 Updated Dec 18, 2025

This repository provides a comprehensive sample project showcasing the integration of Meta's Avatars with the Meta XR Interaction SDK in Unity. It serves as a practical guide for developers, demons…

ShaderLab 23 9 Updated Dec 3, 2025

NVIDIA ACE samples, workflows, and resources

HCL 294 71 Updated Jun 26, 2025

A service to convert audio to facial blendshapes for lipsyncing and facial performances.

Python 195 30 Updated Jun 17, 2025

可对接fay数字人的ue5工程

617 119 Updated Dec 18, 2024

fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。

Python 12,232 2,206 Updated Dec 17, 2025

Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…

Python 1,134 95 Updated Dec 23, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 33,637 4,205 Updated Aug 6, 2024

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,388 743 Updated Dec 21, 2025

GMTalker 由光明实验室媒体智能团队打造的3d数字人。系统集成了语音识别、语音合成、自然语言理解、嘴型动画驱动。支持windows、Linux、安卓快速部署。

Python 1,064 40 Updated Dec 22, 2025

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 3,500 492 Updated Dec 23, 2025

We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…

Python 1,165 99 Updated Dec 8, 2025

​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation

Python 3,998 668 Updated Dec 18, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,195 106 Updated Oct 15, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,757 2,145 Updated Dec 23, 2025

[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation

Python 682 70 Updated Nov 24, 2025

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Python 1,037 180 Updated Dec 23, 2025

Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.

Python 90 7 Updated Dec 23, 2025

一个超轻量级、可以在移动端实时运行的数字人模型

Python 2,361 340 Updated Sep 18, 2025

rtmp streaming from opencv with ffmpeg / avcodec using C++ or Python

C++ 204 47 Updated Feb 11, 2023

python库,实现推送实时rtmp音视频流

C++ 136 37 Updated Apr 17, 2024

实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

Python 1,175 152 Updated Dec 18, 2025

Digital Human Resource: 2D/3D/4D Human Modeling, Avatar Generation & Animation, Clothed People Digitalization, Virtual Try-On, and Others.

1,867 167 Updated Dec 16, 2025

A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.

2,982 351 Updated Nov 10, 2025
Next