Skip to content
View XianfengWu01's full-sized avatar

Block or report XianfengWu01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 626 37 Updated May 18, 2026
Python 26 1 Updated May 14, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 2,082 135 Updated May 18, 2026

LLaDA2.0-Uni: Understanding and Generation the World.

Python 744 48 Updated May 13, 2026

Job Talks @ Mines

Shell 1 Updated Apr 10, 2026

DVD: Deterministic Video Depth Estimation with Generative Priors

Python 297 22 Updated Apr 7, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 1,995 164 Updated Apr 19, 2026

scDFM: Distributional Flow Matching for Robust Single-Cell Perturbation Prediction (ICLR 2026)

Python 30 5 Updated Apr 22, 2026
Python 41 3 Updated Feb 4, 2026

[ICML 2026] LatentMorph: Morphing Latent Reasoning into Image Generation

Python 43 Updated May 5, 2026
Python 198 1 Updated Feb 27, 2026

A UX system for full scale deployment of a llm driven video editing ysstem

JavaScript 1 Updated Jan 25, 2026

[ICLR26 Oral] RealPDEBench: A Benchmark for Complex Physical Systems with Paired Real-World and Simulated Data

Python 87 10 Updated May 14, 2026
Python 53 1 Updated Dec 10, 2025

Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”

Python 87 2 Updated Mar 25, 2026

[CVPR 2026] TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Python 65 1 Updated Feb 21, 2026

BuildArena, where LLM agents design, build, and test rockets, cars, and bridges in a physics simulator given a goal-directed sentence.

Python 91 3 Updated May 4, 2026

[ICML 2026] ScalingAR: Scaling Confidence for Autoregressive Image Generation

Python 20 1 Updated May 5, 2026

Explore how to get a VQ-VAE models efficiently!

Python 70 5 Updated Jul 24, 2025

An Efficient Text-to-Image Generation Pretrain Pipeline

Python 131 8 Updated Apr 18, 2025

[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 684 25 Updated Feb 27, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,419 842 Updated Mar 30, 2026

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 236 6 Updated Aug 18, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,347 265 Updated Sep 12, 2025

[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models

Python 37 Updated Apr 2, 2026

A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)

490 25 Updated May 4, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,798 1,946 Updated Mar 17, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,128 1,462 Updated May 16, 2026

[ICLR 2026] Streaming 4D Visual Geometry Transformer

Python 912 46 Updated Oct 27, 2025
Next