[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,014 112 Updated Oct 29, 2025

hkchengrex / XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,914 204 Updated Nov 15, 2024

hkchengrex / Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,444 137 Updated Apr 26, 2025

pix2pixzero / pix2pix-zero

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Python 1,133 82 Updated Oct 16, 2024

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,104 71 Updated Feb 7, 2025

MichalGeyer / plug-and-play

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Python 989 64 Updated Jun 19, 2023

hkchengrex / Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 973 88 Updated Nov 8, 2024

Shilin-LU / TF-ICON

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Python 821 104 Updated Mar 6, 2025

kvablack / ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 700 60 Updated Mar 22, 2024

hkchengrex / STCN

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Python 561 71 Updated Mar 15, 2024

yuangan / EAT_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Python 292 30 Updated May 30, 2025

lingorX / HieraSeg

CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.

Python 252 26 Updated Apr 24, 2023

benkyoujouzu / stable-diffusion-webui-visualize-cross-attention-extension

Python 114 7 Updated Sep 18, 2023

wxh1996 / LANA-VLN

Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"

Python 92 20 Updated Apr 27, 2023

sjtuplayer / few-shot-diffusion

[ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

Python 63 3 Updated Dec 7, 2023

xljh0520 / JOTR

Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“

Python 37 1 Updated Aug 21, 2023

yamy-cheng / DMAOT-VOTS2023

DMAOT ranked 1st in the VOTS 2023 challenge.

Python 16 3 Updated Dec 21, 2023

VamosC / CoLearning-meet-StitchUp

[TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.

Python 13 Updated Aug 19, 2023

yamy-cheng / MSAOT-VOT2022

MS-AOT: Winner of VOT-STs2022 and VOT-RTs2022 (real-time)

Python 8 Updated Dec 25, 2023

xljh0520 / note

pytorch_note

Python 4 1 Updated Sep 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yamy-cheng

Achievements