Skip to content
View cxxgtxy's full-sized avatar

Organizations

@AMAP-ML

Block or report cxxgtxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 77 2 Updated Jun 12, 2026

RISE: Reliable Improvement in Self-Evolving Vision-Language Models

Python 13 Updated Jun 12, 2026

[ICML 2026] The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation"

Python 77 3 Updated May 25, 2026

TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation

Python 124 4 Updated May 30, 2026

[SIGGRAPH 2026] MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Python 104 1 Updated May 19, 2026

This is the official repository of "LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics".

Python 78 2 Updated Apr 24, 2026

[2026 CVPR]Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

Python 110 2 Updated Apr 15, 2026

[CVPR 2026] Elucidating the SNR-t Bias of Diffusion Probabilistic Models

Python 122 3 Updated Apr 20, 2026

[ICLR2026] Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools

Python 205 18 Updated Apr 17, 2026

DreamX-World: A General-Purpose Interactive World Model

Python 257 11 Updated Jun 11, 2026

Let Skills Evolve Collectively with Agentic Evolver

Python 1,893 181 Updated Jun 2, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,820 350 Updated Jun 12, 2026

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,421 756 Updated Mar 23, 2025

[ICLR2026] AutoDrive-R2: Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving

Python 205 19 Updated May 20, 2026

[CVPR 2026] Semantic Context Matters: Improving Conditioning for Autoregressive Models

Python 2 Updated Mar 23, 2026

A comprehensive benchmark specifically designed to evaluate the interactive response capabilities of world models in 4D settings.

106 1 Updated Mar 24, 2026

[ICLR 2026] FASA: FREQUENCY-AWARE SPARSE ATTENTION

19 Updated Mar 1, 2026

PyTorch re-implementation for MeanFlow

Python 125 3 Updated Jul 17, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,260 70 Updated Nov 9, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,396 274 Updated Sep 12, 2025
HTML 202 8 Updated Mar 11, 2026

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,365 82 Updated Aug 7, 2025

A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 599 17 Updated Jun 4, 2026

[KDD 2026 Oral] MobilityBench: A Scalable Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios

Python 152 9 Updated Jun 10, 2026

IntTravel: A Real-World Dataset and Generative Framework for Integrated Multi-Task Travel Recommendation

Python 57 Updated Feb 18, 2026

Code2World: A GUI World Model via Renderable Code Generation

Python 320 18 Updated Feb 12, 2026
Python 61 Updated Feb 9, 2026

Official repository for “PixelGen: Improving Pixel Diffusion with Perceptual Loss”

Python 261 11 Updated May 12, 2026

[ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Python 130 Updated Jan 30, 2026

[ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Python 128 1 Updated May 17, 2026
Next