Skip to content
View fengjiasun's full-sized avatar

Block or report fengjiasun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) , UltraViCo (ICLR 2026) and UltraImage

Python 802 75 Updated Mar 8, 2026

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Python 116 6 Updated Mar 31, 2026

[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 2,323 193 Updated Jan 19, 2026

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 196 7 Updated Dec 29, 2025

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,150 106 Updated Feb 26, 2026

Official implementation of "Repurposing Geometric Foundation Models for Multi-view Diffusion"

Python 177 7 Updated Apr 1, 2026

(TPAMI 2026) Learning Continuous Wasserstein Barycenter Space for Generalized All-in-One Image Restoration

Python 95 3 Updated Apr 9, 2026

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

Python 761 41 Updated Feb 25, 2026

To pioneer training long-context multi-modal transformer models

Python 74 10 Updated Aug 8, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,841 145 Updated Jan 1, 2026

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 705 75 Updated Nov 28, 2025

Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 437 17 Updated Mar 15, 2026

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Python 243 15 Updated Mar 25, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,101 226 Updated Mar 30, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,488 49 Updated Apr 13, 2026

A list of works on video generation towards world model

457 10 Updated Mar 21, 2026

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,271 71 Updated Jan 5, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,401 123 Updated Mar 24, 2026

Consistent Autoregressive Video Generation with Long Context

81 2 Updated Feb 6, 2026

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 553 31 Updated Apr 8, 2026

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 1,071 133 Updated Apr 3, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 1,655 128 Updated Mar 18, 2026

Public repository for Agent Skills

Python 116,294 13,347 Updated Apr 9, 2026

Your own professional personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 188 20 Updated Apr 9, 2026

Advancing Open-source World Models

Python 3,359 278 Updated Apr 10, 2026

A Pragmatic VLA Foundation Model

Python 1,037 89 Updated Mar 12, 2026

[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 847 78 Updated Jan 28, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 6,701 522 Updated Apr 10, 2026

LLM驱动的 A/H/美股智能分析器:多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送,零成本定时运行,纯白嫖. LLM-powered stock analysis system for A/H/US markets.

Python 29,629 30,339 Updated Apr 12, 2026

Open-Source Frontier Voice AI

Python 39,160 4,531 Updated Apr 10, 2026
Next