Skip to content
View zxrzju's full-sized avatar
  • Zhejiang University
  • Hangzhou, China

Organizations

@APRIL-ZJU

Block or report zxrzju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Comprehensive Survey of Interactive Video World Models

179 11 Updated Jun 16, 2026

The one and only one gfwlist here

25,388 3,998 Updated Jun 14, 2026

AI-IO: An Aerodynamics-Inspired Real-Time Inertial Odometry for Quadrotors (ICRA 2026)

Python 63 6 Updated Jun 8, 2026

An open source library designed to provide community examples of Joint Embedding Predictive Architectures (JEPAs). It contains code and examples for learning representations from images, video, and…

Python 702 75 Updated Jun 11, 2026

[ICML 2026] Official Code for Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations

Python 76 9 Updated Feb 15, 2026

Official implementation of our paper "CNN-JEPA: Self-Supervised Pretraining Convolutional Neural Networks Using Joint Embedding Predictive Architecture"

Jupyter Notebook 36 7 Updated Jul 28, 2025

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 2,276 193 Updated Apr 19, 2026

[AAAI 2026] SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries

Python 58 7 Updated Jan 14, 2026

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,963 400 Updated Feb 27, 2025

GaussianAD: Gaussian-Centric End-to-End Autonomous Driving

Python 126 5 Updated Apr 12, 2026

Efficient vision foundation models for high-resolution generation and perception.

Python 3,322 250 Updated Sep 5, 2025

Code of "OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments".

Python 362 19 Updated Jun 5, 2025

(ICCV 2025) GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting

Python 377 21 Updated Feb 17, 2026

[ICCV 2025] Detect Anything 3D in the Wild

Python 276 17 Updated Dec 14, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,691 878 Updated Jun 15, 2026

从零手搓Flow Matching(Rectified Flow)

Python 627 34 Updated Dec 10, 2025
4 Updated Jul 23, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,545 359 Updated Jan 5, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,370 1,490 Updated May 19, 2026

Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.

55 2 Updated Nov 24, 2025
Python 420 34 Updated Oct 29, 2025

[CVPR 2025] Gaussian World Model for Streaming 3D Occupancy Prediction

Python 157 8 Updated Dec 4, 2025

[ICCV 2025] TeRA: Rethinking Text-guided Realistic 3D Avatar Generation

19 Updated Sep 13, 2025

[ICCV 2025] Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?

Python 29 2 Updated Sep 16, 2025

[ECCV2024] SQD-MapNet: Stream Query Denoising for Vectorized HD-Map Construction

Python 22 3 Updated Oct 6, 2024

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Python 253 16 Updated Aug 17, 2025

This is the official project repository for "FASTopo: Fast-Slow Lane Segment Topology Reasoning with Latent World Models"

10 3 Updated Aug 1, 2025
Next