-
Ant Group
- Hangzhou, China
-
06:44
(UTC +08:00) - https://zengyh1900.github.io/
- @zengyh1900
Highlights
- Pro
Lists (7)
Sort Name ascending (A-Z)
Starred repositories
Krea Realtime 14B. An open-source realtime AI video model.
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Ring attention implementation with flash attention
SOTAMak1r / Infinite-Forcing
Forked from guandeh17/Self-ForcingInfinite-Forcing: Towards Infinite-Long Video Generation
A sparse attention kernel supporting mix sparse patterns
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
Dream to Control: Learning Behaviors by Latent Imagination
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Ongoing research training transformer models at scale
Virtual Community: An Open World for Humans, Robots, and Society
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Make self forcing endless. Add cache purging. Add prompt controllability.
Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
An educational resource to help anyone learn deep reinforcement learning.
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
Official implementation of Continuous 3D Perception Model with Persistent State
Official implementation of CharacterShot: Controllable and Consistent 4D Character Animation
[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.