Stars
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Lets make video diffusion practical!
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
Ostensibly619 / IDM-VTON
Forked from yisol/IDM-VTON[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
Ostensibly619 / MagicQuill
Forked from ant-research/MagicQuillOfficial Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Ostensibly619 / flux
Forked from black-forest-labs/fluxOfficial inference repo for FLUX.1 models
Fast and Simple Face Swap Extension Node for ComfyUI (SFW)
Bring portraits to life!
Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.
Ostensibly619 / MMAudio
Forked from hkchengrex/MMAudio[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Ostensibly619 / OmniGen
Forked from VectorSpaceLab/OmniGenOmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Ostensibly619 / open_clip
Forked from mlfoundations/open_clipAn open source implementation of CLIP.
Ostensibly619 / CogVideo
Forked from zai-org/CogVideotext and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Ostensibly619 / facefusion
Forked from facefusion/facefusionIndustry leading face manipulation platform
Ostensibly619 / FramePack
Forked from lllyasviel/FramePackLets make video diffusion practical!
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
GPT4V-level open-source multi-modal model based on Llama3-8B
Stable Diffusion web UI