- hong kong
- adamdad.github.io
- @yxy2168
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"
Self-Purification Mitigates Backdoors in Multimodal Diffusion Language Models
A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…
My Python scripts to make high-quality figures for publications in top AI conferences and journals.
Official repository for the paper "Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention"
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image (CVPR 2026)
MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
The official repository for "Rongsheng Wang's Arxiv Template"
😎 Awesome lists about Video Generation Model for Video Generation
A Foundation Model for Generalist Gaming Agents
WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
A character-level language diffusion model trained on Tiny Shakespeare
SpotEdit:Selective Region Editing in Diffusion Transformers
Long-range camera-conditioned scene generation from one single image.
Towards Scalable Pre-training of Visual Tokenizers for Generation
HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency