Skip to content
View fengjiasun's full-sized avatar

Block or report fengjiasun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Python 1,671 131 Updated Apr 24, 2026

The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 9,703 1,149 Updated Apr 24, 2026

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) , UltraViCo (ICLR 2026) and UltraImage

Python 807 75 Updated Mar 8, 2026

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Python 130 6 Updated Mar 31, 2026

[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 2,376 200 Updated Jan 19, 2026

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 200 7 Updated Dec 29, 2025

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,183 112 Updated Feb 26, 2026

Official implementation of "Repurposing Geometric Foundation Models for Multi-view Diffusion"

Python 189 7 Updated Apr 1, 2026

(TPAMI 2026) Learning Continuous Wasserstein Barycenter Space for Generalized All-in-One Image Restoration

Python 109 4 Updated Apr 23, 2026

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

Python 769 40 Updated Feb 25, 2026

To pioneer training long-context multi-modal transformer models

Python 73 10 Updated Aug 8, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,880 147 Updated Apr 15, 2026

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 710 75 Updated Nov 28, 2025

Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 450 15 Updated Mar 15, 2026

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Python 253 15 Updated Mar 25, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,171 234 Updated Mar 30, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,580 51 Updated Apr 27, 2026

A list of works on video generation towards world model

465 10 Updated Mar 21, 2026

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,278 73 Updated Jan 5, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,475 132 Updated Apr 15, 2026

Consistent Autoregressive Video Generation with Long Context

81 2 Updated Feb 6, 2026

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 582 36 Updated Apr 25, 2026

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 1,121 148 Updated Apr 17, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 1,774 139 Updated Apr 19, 2026

Public repository for Agent Skills

Python 124,924 14,634 Updated Apr 23, 2026

Your own professional personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 188 19 Updated Apr 21, 2026

Advancing Open-source World Models

Python 3,608 309 Updated Apr 10, 2026

A Pragmatic VLA Foundation Model

Python 1,106 98 Updated Mar 12, 2026

[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 851 79 Updated Jan 28, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 7,478 576 Updated Apr 13, 2026
Next