Skip to content
View caiyuanhao1998's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report caiyuanhao1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 632 57 Updated Oct 14, 2025

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction (ICCV 2025)

Python 645 32 Updated Nov 24, 2025

official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation

Python 55 4 Updated Jul 31, 2025

[CVPR'25] A vision question answering (VQA) benchmark for 6D spatial reasoning.

Python 16 2 Updated Jun 17, 2025

EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory

Python 52 Updated Oct 18, 2025

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Ba…

Python 415 90 Updated Oct 4, 2022

Generative World Explorer

Python 161 8 Updated Jun 14, 2025

Official repo for paper "EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning"

Python 109 3 Updated Oct 9, 2025
6 Updated Oct 7, 2025

[ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Python 138 6 Updated Oct 25, 2024
HTML 167 9 Updated Oct 27, 2025

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,288 65 Updated Oct 16, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,315 1,947 Updated Nov 1, 2025

Enjoy the magic of Diffusion models!

Python 10,829 1,015 Updated Nov 27, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,157 1,388 Updated Nov 14, 2025

[NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Python 149 9 Updated Oct 17, 2025

[ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction

Python 41 1 Updated Oct 27, 2025

[NeurIPS 2025] Completeness-Aware Reconstruction Enhancement

Python 29 1 Updated Oct 18, 2025

OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions (NeurIPS 2025)

83 6 Updated Sep 19, 2025

[3DV 2026] VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment

Python 139 3 Updated Jan 21, 2025

Official Implementation of X-Filed. Code coming soon.

23 1 Updated Oct 20, 2025

A toolbox for feedforward sparse-view CT reconstruction

17 Updated Mar 9, 2025

SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging

Python 15 Updated Oct 24, 2024

official NeRFLiX implementation

Python 105 10 Updated Jul 18, 2023

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,229 322 Updated Nov 27, 2025

Official implementation of “LucidFusion: Reconstructing 3D Gaussians with Arbitrary Unposed Images”

Python 72 4 Updated Mar 21, 2025

A curated list of instruction-prompted visual translation papers

Python 8 Updated Feb 14, 2024

[ECCV22] Unbiased Multi-Modality Guidance for Image Inpainting

Python 33 1 Updated Aug 7, 2022

Deficiency-Aware Masked Transformer for Video Inpainting

54 1 Updated Dec 11, 2023
Next