Skip to content
View liuziwei7's full-sized avatar

Block or report liuziwei7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code of "Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion"

Python 18 Updated Jun 17, 2026

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Python 106 Updated May 28, 2026

PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

Jupyter Notebook 253 11 Updated Jun 11, 2026

🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

HTML 418 37 Updated Apr 12, 2026

🎥 [Awesome] Egocentric / First-Person Video Datasets 📚 Papers, Benchmarks & Resources for Ego Vision

149 4 Updated Jun 15, 2026

[CVPR 2026] Scaling Spatial Intelligence with Multimodal Foundation Models

Python 272 13 Updated May 14, 2026

[CVPR 2026 Highlight] U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences

Python 25 1 Updated Dec 20, 2025

[Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

TeX 119 5 Updated Jun 9, 2026

Modular SenseNova skills for building AI-powered office assistants and productivity workflows

Python 4,567 310 Updated Jun 15, 2026

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

TypeScript 262 8 Updated Jun 16, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,234 282 Updated Jun 15, 2026

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 365 3 Updated May 24, 2026

Your behavior is the signal. Not your words. — Behavioral intelligence for AI agents, built into your MacBook notch.

8 Updated Apr 7, 2026

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Python 64 6 Updated Apr 12, 2026

[ICLR 2026] 🦅 FALCON: an effective vision-language-action model injects rich 3D spatial tokens into the action head, enabling robust spatial understanding and SOTA performance across diverse manipu…

Python 28 Updated May 26, 2026

A simple video streaming baseline that outperforms SOTAs.

Python 143 8 Updated May 1, 2026

A benchmark for evaluating contextual agents on realistic multimodal personal-computer environments with profiling and factual-retention tasks.

Python 28 1 Updated Apr 2, 2026
Python 9 Updated May 9, 2026

Implementation for Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer.

36 4 Updated Mar 20, 2026

The official implementation of “MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction”

64 1 Updated Mar 20, 2026

Official Implementation of "Kinema4D: Kinematic4D World Modeling for Spatiotemporal Embodied Simulation"

Python 73 4 Updated May 21, 2026

An inference-time, plug-and-play method for temporal control in multi-event generation

JavaScript 170 16 Updated Apr 26, 2026
Python 98 3 Updated Jun 11, 2026

Toy-scale unified multimodal model experiments — encoder-free understanding & generation with Mixture-of-Transformers on MLX/Apple Silicon

Python 44 1 Updated Mar 8, 2026

[ICML 2026 Oral] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

322 6 Updated May 25, 2026

[ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".

Python 38 Updated Mar 5, 2026

Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Python 371 20 Updated May 13, 2026

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Python 40 Updated Mar 3, 2026
Next