-
CatVTON Public
Forked from Zheng-Chong/CatVTONCatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…
Python Other UpdatedOct 21, 2024 -
MagicTailor Public
Forked from Correr-Zhou/MagicTailorOffical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
Apache License 2.0 UpdatedOct 18, 2024 -
ctrlora Public
Forked from xyfJASON/ctrloraCodebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"
Python Apache License 2.0 UpdatedOct 18, 2024 -
MotionClone Public
Forked from LPengYang/MotionCloneOfficial implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Python UpdatedOct 16, 2024 -
-
Show-o Public
Forked from showlab/Show-oRepository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Python Apache License 2.0 UpdatedOct 14, 2024 -
Monkey Public
Forked from Yuliang-Liu/Monkey【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Python MIT License UpdatedOct 14, 2024 -
IterComp Public
Forked from YangLing0818/IterCompIterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Python MIT License UpdatedOct 12, 2024 -
SageAttention Public
Forked from thu-ml/SageAttentionQuantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 9, 2024 -
ControlNeXt Public
Forked from dvlab-research/ControlNeXtControllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Python Apache License 2.0 UpdatedOct 9, 2024 -
TextHarmony Public
Forked from bytedance/TextHarmonyThe official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation
Apache License 2.0 UpdatedOct 9, 2024 -
ControlNetPlus Public
Forked from xinsir6/ControlNetPlusControlNet++: All-in-one ControlNet for image generations and editing!
Python Apache License 2.0 UpdatedSep 30, 2024 -
MagicTime Public
Forked from PKU-YuanGroup/MagicTimeMagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Python Apache License 2.0 UpdatedSep 30, 2024 -
Open-Sora Public
Forked from hpcaitech/Open-SoraOpen-Sora: Democratizing Efficient Video Production for All
Python Apache License 2.0 UpdatedSep 29, 2024 -
OpenDiT Public
Forked from NUS-HPC-AI-Lab/VideoSysOpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
Python Apache License 2.0 UpdatedSep 29, 2024 -
HivisionIDPhotos Public
Forked from Zeyi-Lin/HivisionIDPhotos⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Python Apache License 2.0 UpdatedSep 28, 2024 -
Latte Public
Forked from Vchitect/LatteLatte: Latent Diffusion Transformer for Video Generation.
Python Apache License 2.0 UpdatedSep 28, 2024 -
flux Public
Forked from black-forest-labs/fluxOfficial inference repo for FLUX.1 models
Python Apache License 2.0 UpdatedSep 24, 2024 -
InstantDrag Public
Forked from SNU-VGILab/InstantDragInstantDrag: Improving Interactivity in Drag-based Image Editing
Python Other UpdatedSep 20, 2024 -
AnyV2V Public
Forked from TIGER-AI-Lab/AnyV2VA Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Jupyter Notebook MIT License UpdatedSep 19, 2024 -
champ Public
Forked from fudan-generative-vision/champChamp: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Python MIT License UpdatedSep 18, 2024 -
GOT-OCR2.0- Public
Forked from Ucas-HaoranWei/GOT-OCR2.0Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Python UpdatedSep 14, 2024 -
Cinemo Public
Forked from maxin-cn/CinemoCinemo: Consistent and Controllable Image Animation with Motion Diffusion Models
Python Apache License 2.0 UpdatedSep 13, 2024 -
FollowYourEmoji Public
Forked from mayuelala/FollowYourEmoji[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
Python UpdatedSep 11, 2024 -
MOFA-Video Public
Forked from MyNiuuu/MOFA-Video[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
Python Other UpdatedSep 11, 2024 -
ComfyUI-layerdiffuse Public
Forked from huchenlei/ComfyUI-layerdiffuseLayer Diffuse custom nodes
Python Apache License 2.0 UpdatedAug 27, 2024 -
manga-image-translator Public
Forked from zyddnys/manga-image-translatorTranslate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Python GNU General Public License v3.0 UpdatedAug 24, 2024 -
EchoMimic Public
Forked from antgroup/echomimicLifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Python Apache License 2.0 UpdatedAug 21, 2024 -
VEnhancer Public
Forked from Vchitect/VEnhancerOfficial codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Python UpdatedAug 20, 2024 -
Lumina-mGPT Public
Forked from Alpha-VLLM/Lumina-mGPTOfficial Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Python UpdatedAug 16, 2024