Skip to content
View wufan-cse's full-sized avatar
☺️
☺️

Block or report wufan-cse

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,784 105 Updated Nov 4, 2025

Depth Anything 3

Python 3,692 321 Updated Dec 12, 2025

Builder and index for PyTorch packages

Python 305 38 Updated Dec 7, 2025

VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

Python 218 9 Updated Dec 23, 2025

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,552 66 Updated Dec 22, 2025

A Video Tokenizer Evaluation Dataset

Python 145 10 Updated Jan 13, 2025

This repository contains the code of the paper "IC-World: In-Context Generation for Shared World Modeling".

Python 86 Updated Dec 18, 2025

Official repository for LTX-Video

Python 8,931 838 Updated Oct 25, 2025

A toolbox for spectral compressive imaging reconstruction including MST (CVPR 2022), CST (ECCV 2022), DAUHST (NeurIPS 2022), BiSCI (NeurIPS 2023), HDNet (CVPR 2022), MST++ (CVPRW 2022), etc.

Python 1,113 87 Updated Oct 10, 2025

The repository for CVPR 2022 Paper "Neural 3D Video Synthesis"

337 12 Updated Mar 26, 2022

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 5,010 400 Updated Jul 10, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 2,239 179 Updated Mar 6, 2025

[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy

Python 869 39 Updated Sep 26, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,384 96 Updated Dec 11, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,214 6,633 Updated Dec 25, 2025

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,493 78 Updated Dec 20, 2025

Light Video Generation Inference Framework

Python 1,410 93 Updated Dec 25, 2025

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,374 67 Updated Oct 16, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,862 228 Updated Dec 24, 2025

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,137 55 Updated Mar 5, 2025

Open3D: A Modern Library for 3D Data Processing

C++ 13,136 2,502 Updated Nov 23, 2025

4-steps distilled version of Wan2.2-TI2V-5B

Python 126 9 Updated Sep 12, 2025

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 301 20 Updated Sep 29, 2025

The most widely used, high performance Minecraft server that aims to fix gameplay and mechanics inconsistencies

Java 11,787 3,178 Updated Dec 23, 2025

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Python 103 24 Updated Sep 30, 2025

An Open-Ended Embodied Agent with Large Language Models

JavaScript 5 4 Updated Dec 29, 2024

An Open-Ended Embodied Agent with Large Language Models

JavaScript 6,550 623 Updated Apr 3, 2024

Minecraft AI with LLMs+Mineflayer

JavaScript 4,537 627 Updated Dec 14, 2025

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

Java 893 167 Updated Jan 22, 2025

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Java 2,104 187 Updated Mar 18, 2024
Next