-
Tsinghua University
- Beijing
- https://hongzhebi.github.io/
- https://scholar.google.com/citations?hl=zh-CN&user=2-LOJF4AAAAJ
Lists (5)
Sort Name ascending (A-Z)
Starred repositories
[CVPR 2026] Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles
The first continuous diffusion language model that rivals discrete counterparts on standard language modeling benchmarks like LM1B and OpenWebText.
HY-Embodied: Embodied Foundation Models for Real-World Agents
AI agents running research on single-GPU nanochat training automatically
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Memory-Dependent Manipulation Benchmark based on RoboTwin
A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io
A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…
Awesome Unified Multimodal Models
NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation
Official repo for vidar and vidarc: video foundation model for robotics.
Official code of Motus: A Unified Latent Action World Model
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
RoboChallenge Inference example code
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
[ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields
A unified inference and post-training framework for accelerated video generation.
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
Reference PyTorch implementation and models for DINOv3
A pipeline parallel training script for diffusion models.
Enjoy the magic of Diffusion models!
A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention
A supervised learning trained reward head for ACT
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)