Skip to content
View yamy-cheng's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report yamy-cheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
32 results for source starred repositories
Clear filter

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,014 112 Updated Oct 29, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,240 1,119 Updated Aug 27, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,104 71 Updated Feb 7, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,112 309 Updated Dec 21, 2024

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 700 60 Updated Mar 22, 2024

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Python 561 71 Updated Mar 15, 2024

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,444 137 Updated Apr 26, 2025

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,914 204 Updated Nov 15, 2024

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 973 88 Updated Nov 8, 2024

MS-AOT: Winner of VOT-STs2022 and VOT-RTs2022 (real-time)

Python 8 Updated Dec 25, 2023

DMAOT ranked 1st in the VOTS 2023 challenge.

Python 16 3 Updated Dec 21, 2023

Zhejiang University Graduation Thesis LaTeX Template

TeX 3,229 684 Updated Sep 8, 2025

A list of video object segmentation (VOS) papers

297 28 Updated Oct 22, 2025

[TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.

Python 13 Updated Aug 19, 2023

[ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

Python 63 3 Updated Dec 7, 2023

Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)

Jupyter Notebook 582 56 Updated Jun 18, 2024

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Python 292 30 Updated May 30, 2025

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Python 821 104 Updated Mar 6, 2025

Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“

Python 37 1 Updated Aug 21, 2023

LeetCode 101:力扣刷题指南

9,788 1,246 Updated Dec 8, 2024

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,030 1,801 Updated Oct 29, 2025

[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.

Jupyter Notebook 528 32 Updated Dec 21, 2023

A LaTeX resume template designed for optimal information density and aesthetic appeal.

TeX 590 61 Updated Jun 26, 2024

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Jupyter Notebook 755 62 Updated Jan 26, 2024

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Python 989 64 Updated Jun 19, 2023

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Python 1,133 82 Updated Oct 16, 2024

A curated list of papers, code and resources pertaining to few-shot image generation.

373 46 Updated Jun 3, 2023

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,067 354 Updated Apr 25, 2024

Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"

Python 92 20 Updated Apr 27, 2023
Next