Skip to content
View yamy-cheng's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report yamy-cheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
23 stars written in Python
Clear filter

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,240 1,120 Updated Aug 27, 2025

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,030 1,801 Updated Oct 29, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,112 309 Updated Dec 21, 2024

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,014 112 Updated Oct 29, 2025

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,914 204 Updated Nov 15, 2024

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,444 137 Updated Apr 26, 2025

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Python 1,133 82 Updated Oct 16, 2024

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,104 71 Updated Feb 7, 2025

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Python 989 64 Updated Jun 19, 2023

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 973 88 Updated Nov 8, 2024

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Python 821 104 Updated Mar 6, 2025

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 700 60 Updated Mar 22, 2024

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Python 561 71 Updated Mar 15, 2024

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Python 292 30 Updated May 30, 2025

CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.

Python 252 26 Updated Apr 24, 2023

Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"

Python 92 20 Updated Apr 27, 2023

[ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

Python 63 3 Updated Dec 7, 2023

Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“

Python 37 1 Updated Aug 21, 2023

DMAOT ranked 1st in the VOTS 2023 challenge.

Python 16 3 Updated Dec 21, 2023

[TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.

Python 13 Updated Aug 19, 2023

MS-AOT: Winner of VOT-STs2022 and VOT-RTs2022 (real-time)

Python 8 Updated Dec 25, 2023

pytorch_note

Python 4 1 Updated Sep 26, 2022