Skip to content
View yamy-cheng's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report yamy-cheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,240 1,120 Updated Aug 27, 2025

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,030 1,801 Updated Oct 29, 2025

LeetCode 101:力扣刷题指南

9,788 1,246 Updated Dec 8, 2024

Zhejiang University Graduation Thesis LaTeX Template

TeX 3,229 684 Updated Sep 8, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,112 309 Updated Dec 21, 2024

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,067 354 Updated Apr 25, 2024

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,014 112 Updated Oct 29, 2025

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,914 204 Updated Nov 15, 2024

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,444 137 Updated Apr 26, 2025

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Python 1,133 82 Updated Oct 16, 2024

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,104 71 Updated Feb 7, 2025

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Python 989 64 Updated Jun 19, 2023

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 973 88 Updated Nov 8, 2024

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Python 821 104 Updated Mar 6, 2025

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Jupyter Notebook 755 62 Updated Jan 26, 2024

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 700 60 Updated Mar 22, 2024

A LaTeX resume template designed for optimal information density and aesthetic appeal.

TeX 590 61 Updated Jun 26, 2024

Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)

Jupyter Notebook 582 56 Updated Jun 18, 2024

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Python 561 71 Updated Mar 15, 2024

[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.

Jupyter Notebook 528 32 Updated Dec 21, 2023

A curated list of papers, code and resources pertaining to few-shot image generation.

373 46 Updated Jun 3, 2023

A list of video object segmentation (VOS) papers

297 28 Updated Oct 22, 2025

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

Python 292 30 Updated May 30, 2025

CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.

Python 252 26 Updated Apr 24, 2023

Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"

Python 92 20 Updated Apr 27, 2023

[ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

Python 63 3 Updated Dec 7, 2023

Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“

Python 37 1 Updated Aug 21, 2023

DMAOT ranked 1st in the VOTS 2023 challenge.

Python 16 3 Updated Dec 21, 2023

[TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.

Python 13 Updated Aug 19, 2023
Next