Stars
Ostensibly619 / LivePortrait
Forked from KlingTeam/LivePortraitBring portraits to life!
Ostensibly619 / MMAudio
Forked from hkchengrex/MMAudio[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Ostensibly619 / FramePack
Forked from lllyasviel/FramePackLets make video diffusion practical!
Lets make video diffusion practical!
Ostensibly619 / facefusion
Forked from facefusion/facefusionIndustry leading face manipulation platform
Ostensibly619 / Fooocus
Forked from lllyasviel/FooocusFocus on prompting and generating
Ostensibly619 / LLaVA
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Ostensibly619 / flux
Forked from black-forest-labs/fluxOfficial inference repo for FLUX.1 models
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
Ostensibly619 / aura-sr
Forked from fal-ai/aura-srAuraSR: GAN-based Super-Resolution for real-world
Ostensibly619 / CogVideo
Forked from zai-org/CogVideotext and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
Ostensibly619 / open_clip
Forked from mlfoundations/open_clipAn open source implementation of CLIP.
Stable Diffusion web UI
Ostensibly619 / OmniGen
Forked from VectorSpaceLab/OmniGenOmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
GPT4V-level open-source multi-modal model based on Llama3-8B
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.