Skip to content
View fengjiasun's full-sized avatar

Block or report fengjiasun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Python 1,236 88 Updated Apr 16, 2026

The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 9,414 1,106 Updated Apr 17, 2026

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) , UltraViCo (ICLR 2026) and UltraImage

Python 803 75 Updated Mar 8, 2026

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Python 120 6 Updated Mar 31, 2026

[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 2,352 195 Updated Jan 19, 2026

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 196 7 Updated Dec 29, 2025

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,157 107 Updated Feb 26, 2026

Official implementation of "Repurposing Geometric Foundation Models for Multi-view Diffusion"

Python 181 7 Updated Apr 1, 2026

(TPAMI 2026) Learning Continuous Wasserstein Barycenter Space for Generalized All-in-One Image Restoration

Python 106 4 Updated Apr 9, 2026

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

Python 763 40 Updated Feb 25, 2026

To pioneer training long-context multi-modal transformer models

Python 73 10 Updated Aug 8, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,859 147 Updated Apr 15, 2026

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 706 75 Updated Nov 28, 2025

Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 440 17 Updated Mar 15, 2026

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Python 248 15 Updated Mar 25, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,158 231 Updated Mar 30, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,527 50 Updated Apr 17, 2026

A list of works on video generation towards world model

459 10 Updated Mar 21, 2026

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,269 72 Updated Jan 5, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,450 129 Updated Apr 15, 2026

Consistent Autoregressive Video Generation with Long Context

81 2 Updated Feb 6, 2026

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 570 31 Updated Apr 18, 2026

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 1,091 143 Updated Apr 17, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 1,702 130 Updated Apr 19, 2026

Public repository for Agent Skills

Python 120,401 13,954 Updated Apr 16, 2026

Your own professional personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 188 19 Updated Apr 18, 2026

Advancing Open-source World Models

Python 3,470 293 Updated Apr 10, 2026

A Pragmatic VLA Foundation Model

Python 1,073 91 Updated Mar 12, 2026

[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 847 79 Updated Jan 28, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 7,079 549 Updated Apr 13, 2026
Next