This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Ba…

Python 415 90 Updated Oct 4, 2022

GenEx-world / genex

Generative World Explorer

Python 161 8 Updated Jun 14, 2025

Tiezheng11 / Vision-Language-Vision

Python 63 6 Updated Jul 11, 2025

adobe-research / EditVerse

Official repo for paper "EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning"

Python 109 3 Updated Oct 9, 2025

YiAi03 / FMU

6 Updated Oct 7, 2025

OpenGVLab / PhyGenBench

[ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Python 138 6 Updated Oct 25, 2024

Kai-46 / minFM

HTML 167 9 Updated Oct 27, 2025

XueZeyue / DanceGRPO

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,288 65 Updated Oct 16, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,315 1,947 Updated Nov 1, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,829 1,015 Updated Nov 27, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,157 1,388 Updated Nov 14, 2025

ZhaoYujie2002 / LangSplatV2

[NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Python 149 9 Updated Oct 17, 2025

yuyouxixi / x2-gaussian

[ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction

Python 41 1 Updated Oct 27, 2025

MrGiovanni / CARE

[NeurIPS 2025] Completeness-Aware Reconstruction Enhancement

Python 29 1 Updated Oct 18, 2025

caiyuanhao1998 / Open-OmniVCus

OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions (NeurIPS 2025)

83 6 Updated Sep 19, 2025

VITA-Group / VideoLifter

[3DV 2026] VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment

Python 139 3 Updated Jan 21, 2025

Brack-Wang / X-Field

Official Implementation of X-Filed. Code coming soon.

23 1 Updated Oct 20, 2025

caiyuanhao1998 / X-LRM

A toolbox for feedforward sparse-view CT reconstruction

17 Updated Mar 9, 2025

lyuxi / SAH-SCI

SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging

Python 15 Updated Oct 24, 2024

redrock303 / NeRFLiX_CVPR2023

official NeRFLiX implementation

Python 105 10 Updated Jul 18, 2023

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,229 322 Updated Nov 27, 2025

EnVision-Research / LucidFusion

Official implementation of “LucidFusion: Reconstructing 3D Gaussians with Arbitrary Unposed Images”

Python 72 4 Updated Mar 21, 2025

yeates / awesome-instruction-prompted-vision

A curated list of instruction-prompted visual translation papers

Python 8 Updated Feb 14, 2024

yeates / MMT

[ECCV22] Unbiased Multi-Modality Guidance for Image Inpainting

Python 33 1 Updated Aug 7, 2022

yeates / DMT

Deficiency-Aware Masked Transformer for Video Inpainting

54 1 Updated Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuanhao Cai caiyuanhao1998

Block or report caiyuanhao1998

Stars

ChenLiu-1996 / CitationMap

caiyuanhao1998 / Open-DiffusionGS

pittisl / PhyT2V

XingruiWang / Spatial457

JiahaoPlus / EvoWorld

TACJu / TransFG