Skip to content
View chunyu-li's full-sized avatar
  • ByteDance
  • Beijing, China

Block or report chunyu-li

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 331 7 Updated Dec 16, 2025

Krea Realtime 14B. An open-source realtime AI video model.

Python 428 24 Updated Nov 13, 2025

Hierarchical Reasoning Model Official Release

Python 12,169 1,779 Updated Sep 9, 2025

Latest Advances on System-2 Reasoning

Python 1,299 73 Updated Jun 8, 2025
Python 1,038 63 Updated Nov 20, 2025

Witness the aha moment of VLM with less than $3.

Python 4,011 289 Updated May 19, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

767 41 Updated Oct 10, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 822 25 Updated Dec 23, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,538 78 Updated Nov 16, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

772 22 Updated Nov 8, 2025

This is a repo to track the latest autoregressive visual generation papers.

420 5 Updated Jun 25, 2025

Lets make video diffusion practical!

Python 16,383 1,596 Updated Oct 16, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,818 2,034 Updated Dec 21, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,974 2,221 Updated Dec 15, 2025

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 6,430 761 Updated Feb 19, 2025

[ICLR 2025] Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Python 378 38 Updated Sep 17, 2025

talking-face video editing

Python 411 57 Updated Feb 27, 2025

Real time interactive streaming digital human

Python 6,909 1,068 Updated Nov 23, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,277 851 Updated Jun 20, 2025

Bring portraits to life!

Python 17,498 1,813 Updated Nov 16, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 3,104 269 Updated Dec 5, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,565 2,083 Updated Apr 16, 2024

Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端

Python 8,458 1,428 Updated Apr 2, 2025

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

808 63 Updated Oct 8, 2025

one-click face swap

Python 30,427 6,917 Updated Aug 19, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,630 1,121 Updated Sep 14, 2024

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 5,096 698 Updated Sep 26, 2025

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Python 1,102 192 Updated Sep 25, 2023

A Zsh theme

Shell 52,054 2,380 Updated Apr 29, 2025

Download and preprocess voxceleb datasets.

Python 40 9 Updated Jun 18, 2025
Next