Skip to content
View hywang2002's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Peking University

Block or report hywang2002

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails

Python 14,222 1,476 Updated Apr 4, 2026

Elevate your AI research writing, no more tedious polishing ✨

15,656 1,245 Updated Mar 25, 2026

[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide

12,800 820 Updated Mar 12, 2026

[CVPR 2026] VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Python 82 5 Updated Mar 10, 2026

[ICLR 2026] The official implementation of "PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models"

Python 10 Updated Feb 3, 2026

Advancing Open-source World Models

Python 3,316 273 Updated Apr 2, 2026

NVIDIA FastGen: Fast Generation from Diffusion Models

Python 663 47 Updated Mar 19, 2026
Python 131 17 Updated Mar 1, 2024

Cosmos Policy

Python 684 54 Updated Jan 23, 2026

Compositional Diffusion with Guided search for Long-Horizon Planning

Python 5 Updated Dec 18, 2025

StableWorld: Towards Stable and Consistent Long Interactive Video Generation

Python 92 1 Updated Mar 18, 2026
Jupyter Notebook 68 7 Updated Apr 1, 2026

Official repository of paper "CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos"

80 Updated Jan 16, 2026

PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

57 Updated Jan 5, 2026
Jupyter Notebook 2,987 753 Updated Aug 13, 2025

Official code for StoryMem: Multi-shot Long Video Storytelling with Memory

Python 709 69 Updated Jan 22, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 1,531 185 Updated Mar 30, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 5,556 840 Updated Apr 2, 2026

Video generation Stage of InfiniCube, implemented in DiffSynth-Studio

Python 7 Updated Nov 17, 2025

A unified framework for 3D Occupancy Prediction

Python 38 3 Updated Jan 15, 2026

[ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"

Python 407 27 Updated Feb 8, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,649 2,758 Updated Aug 12, 2024

Learning to Drive with GPT

Python 298 23 Updated Feb 1, 2024

[AAAI 2025] Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Python 52 2 Updated Mar 4, 2026

[ECCV 2024] Embodied Understanding of Driving Scenarios

Python 209 14 Updated Jul 2, 2025
Python 4,627 457 Updated Sep 14, 2025

[CVPR2026] Scaling Spatial Intelligence with Multimodal Foundation Models

Python 193 11 Updated Apr 1, 2026
Next