hywang2002

🎯

Focusing

Haoyu Wang hywang2002

🎯

Focusing

Bachelor in Harbin Institute of Technology, PhD student in Peking University.

45 followers · 56 following

Peking University

Achievements

Lists (26)

Sort

Stars

cft0808 / edict

🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails

Python 14,490 1,496 Updated Apr 5, 2026

Leey21 / awesome-ai-research-writing

Elevate your AI research writing, no more tedious polishing ✨

15,822 1,261 Updated Mar 25, 2026

TianxingChen / Embodied-AI-Guide

[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide

12,824 822 Updated Mar 12, 2026

WJ-CV / VGGDrive

[CVPR 2026] VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Python 83 5 Updated Mar 10, 2026

xiaobai1217 / Awesome-Video-Datasets

Video datasets

1,629 116 Updated Mar 8, 2023

XiaomiRobotics / Xiaomi-Robotics-0

Python 404 45 Updated Feb 26, 2026

BienLuky / PTQ4ARVG

[ICLR 2026] The official implementation of "PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models"

Python 10 Updated Feb 3, 2026

metric-anything / metric-anything

Python 309 17 Updated Feb 13, 2026

Robbyant / lingbot-world

Advancing Open-source World Models

Python 3,321 275 Updated Apr 2, 2026

NVlabs / FastGen

NVIDIA FastGen: Fast Generation from Diffusion Models

Python 663 47 Updated Mar 19, 2026

NVlabs / CTG

Python 131 17 Updated Mar 1, 2024

NVlabs / cosmos-policy

Cosmos Policy

Python 687 54 Updated Jan 23, 2026

UtkarshMishra04 / CDGS_imgvideo

Compositional Diffusion with Guided search for Long-Horizon Planning

Python 5 Updated Dec 18, 2025

xbyym / StableWorld

StableWorld: Towards Stable and Consistent Long Interactive Video Generation

Python 92 1 Updated Mar 18, 2026

TencentARC / DSR_Suite

Jupyter Notebook 69 7 Updated Apr 1, 2026

IGL-HKUST / CoMoVi

Official repository of paper "CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos"

80 Updated Jan 16, 2026

caiyuanhao1998 / Open-PhyGDPO

PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

57 Updated Jan 5, 2026

MarkTechStation / VideoCode

Jupyter Notebook 3,013 762 Updated Aug 13, 2025

Kevin-thu / StoryMem

Official code for StoryMem: Multi-shot Long Video Storytelling with Memory

Python 709 69 Updated Jan 22, 2026

starVLA / starVLA

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 1,538 186 Updated Apr 6, 2026

Lightricks / LTX-2

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 5,575 848 Updated Apr 2, 2026

yifanlu0227 / DiffSynth-Studio-InfiniCube

Video generation Stage of InfiniCube, implemented in DiffSynth-Studio

Python 7 Updated Nov 17, 2025

cdb342 / OccStudio

A unified framework for 3D Occupancy Prediction

Python 38 3 Updated Jan 15, 2026

bytedance / Video-As-Prompt

[ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"

Python 407 27 Updated Feb 8, 2026

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,650 2,757 Updated Aug 12, 2024

PointsCoder / GPT-Driver

Learning to Drive with GPT

Python 298 23 Updated Feb 1, 2024

ltp1995 / GPVL

[AAAI 2025] Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Python 52 2 Updated Mar 4, 2026

OpenDriveLab / ELM

[ECCV 2024] Embodied Understanding of Driving Scenarios

Python 209 14 Updated Jul 2, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,627 457 Updated Sep 14, 2025

OpenSenseNova / SenseNova-SI

[CVPR2026] Scaling Spatial Intelligence with Multimodal Foundation Models

Python 194 11 Updated Apr 1, 2026

Haoyu Wang hywang2002

Lists (26)

3D/4D Generation

3DV

Accelerating

AD

Agent

AR

Awesome

AWS

Datasets

DiT based

Flow and tracking

Human Generation

Image Generation

LLM

Physical model

PixDiffusion

Prediction

RL

Spatial Reasoning

T2M

Token-prediction based gen

Tools

UniUndGen

Video Generation

VLA

VTON

Stars