An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,067 354 Updated Apr 25, 2024

River-Zhang / ICEdit

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,014 112 Updated Oct 29, 2025

hkchengrex / XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,914 204 Updated Nov 15, 2024

hkchengrex / Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,444 137 Updated Apr 26, 2025

pix2pixzero / pix2pix-zero

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Python 1,133 82 Updated Oct 16, 2024

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,104 71 Updated Feb 7, 2025

MichalGeyer / plug-and-play

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Python 989 64 Updated Jun 19, 2023

hkchengrex / Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 973 88 Updated Nov 8, 2024

Shilin-LU / TF-ICON

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Python 821 104 Updated Mar 6, 2025

yuval-alaluf / Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Jupyter Notebook 755 62 Updated Jan 26, 2024

kvablack / ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 700 60 Updated Mar 22, 2024

fky2015 / resume-ng

A LaTeX resume template designed for optimal information density and aesthetic appeal.

TeX 590 61 Updated Jun 26, 2024

zyxElsa / InST

Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)

Jupyter Notebook 582 56 Updated Jun 18, 2024

hkchengrex / STCN

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Python 561 71 Updated Mar 15, 2024

wl-zhao / VPD

[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.

Jupyter Notebook 528 32 Updated Dec 21, 2023