ifzhang

Follow

🐶

Focusing

Yifu Zhang ifzhang

🐶

Focusing

Follow

844 followers · 127 following

Achievements

Achievements

Organizations

Stars

hustvl / MoDA

An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".

Python 162 4 Updated Apr 15, 2026

ZhuLinsen / daily_stock_analysis

LLM驱动的 A/H/美股智能分析器：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets.

Python 30,263 30,957 Updated Apr 17, 2026

FoundationVision / Alive

[Tech Report] Alive: A Unified Audio-Video Generation Model

504 36 Updated Mar 31, 2026

MeiGen-AI / Infinite-World

Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Python 153 5 Updated Feb 9, 2026

Tencent-Hunyuan / HunyuanVideo-1.5

HunyuanVideo-1.5: A leading lightweight video generation model

Python 4,382 219 Updated Apr 10, 2026

FoundationVision / InfinityStar

[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation

Python 751 27 Updated Apr 16, 2026

meituan-longcat / LongCat-Video

Python 2,264 342 Updated Apr 15, 2026

character-ai / Ovi

Python 1,690 194 Updated Nov 15, 2025

Dorniwang / UniVerse-1-code

The official UniVerse-1 code.

Python 123 10 Updated Oct 13, 2025

ByteDance-Seed / SeedVR

Repo for SeedVR2 (ICLR2026) & SeedVR (CVPR2025 Highlight)

Python 1,143 66 Updated Jan 27, 2026

baaivision / MTVCraft

MTVCraft: An Open Veo3-style Audio-Video Generation Demo

Python 100 12 Updated Oct 8, 2025

FoundationVision / Waver

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

926 114 Updated Aug 27, 2025

TencentARC / ARC-Hunyuan-Video-7B

Structured Video Comprehension of Real-World Shorts

Python 237 7 Updated Sep 21, 2025

InternRobotics / InternNav

InternRobotics' open platform for building generalized navigation foundation models.

Jupyter Notebook 805 111 Updated Mar 10, 2026

bebebe666 / OptimalSteps

Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".

Python 200 12 Updated Apr 13, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,675 236 Updated Jun 17, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,244 145 Updated Mar 25, 2026

hustvl / LightningDiT

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,442 57 Updated Dec 16, 2025

xiaominli1020 / ReNeg

ReNeg: Learning Negative Embedding with Reward Guidance

Python 35 Updated Dec 22, 2025

FoundationVision / Liquid

(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

Python 641 35 Updated Nov 10, 2025

zju3dv / street_crafter

[CVPR 2025] StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models

Python 306 27 Updated Jan 27, 2026

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,608 87 Updated Mar 16, 2025

CompVis / discrete-interpolants

The official implementation of "[MASK] is All You Need"

Jupyter Notebook 126 6 Updated Jul 23, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,342 333 Updated Jan 5, 2026

FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,559 93 Updated Apr 16, 2026

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,974 1,228 Updated Nov 21, 2025

zju3dv / street_gaussians

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Python 1,316 100 Updated Jul 4, 2025

hustvl / DiffusionDrive

[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 1,358 129 Updated Dec 8, 2025

NVlabs / DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 960 63 Updated Mar 24, 2026

hustvl / Senna

Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Python 545 44 Updated Mar 15, 2026