Skip to content
View fengjiasun's full-sized avatar

Block or report fengjiasun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Python 116 5 Updated Mar 31, 2026

[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 2,318 193 Updated Jan 19, 2026

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 196 7 Updated Dec 29, 2025

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,147 106 Updated Feb 26, 2026

Official implementation of "Repurposing Geometric Foundation Models for Multi-view Diffusion"

Python 176 6 Updated Apr 1, 2026

(TPAMI 2026) Learning Continuous Wasserstein Barycenter Space for Generalized All-in-One Image Restoration

Python 92 3 Updated Apr 9, 2026

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

Python 760 41 Updated Feb 25, 2026

To pioneer training long-context multi-modal transformer models

Python 74 10 Updated Aug 8, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,837 146 Updated Jan 1, 2026

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 705 75 Updated Nov 28, 2025

Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 435 17 Updated Mar 15, 2026

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Python 242 15 Updated Mar 25, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,088 225 Updated Mar 30, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,470 48 Updated Apr 11, 2026

A list of works on video generation towards world model

455 9 Updated Mar 21, 2026

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,269 71 Updated Jan 5, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,396 123 Updated Mar 24, 2026

Consistent Autoregressive Video Generation with Long Context

81 2 Updated Feb 6, 2026

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 551 30 Updated Apr 8, 2026

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 1,065 132 Updated Apr 3, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 1,639 127 Updated Mar 18, 2026

Public repository for Agent Skills

Python 114,851 13,130 Updated Apr 9, 2026

Your own professional personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 188 20 Updated Apr 9, 2026

Advancing Open-source World Models

Python 3,349 278 Updated Apr 10, 2026

A Pragmatic VLA Foundation Model

Python 1,035 89 Updated Mar 12, 2026

[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 847 78 Updated Jan 28, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 6,562 515 Updated Apr 10, 2026

LLM驱动的 A/H/美股智能分析器:多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送,零成本定时运行,纯白嫖. LLM-powered stock analysis system for A/H/US markets.

Python 29,238 29,926 Updated Apr 10, 2026

Open-Source Frontier Voice AI

Python 38,477 4,440 Updated Apr 10, 2026

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 677 49 Updated Nov 10, 2025
Next