Starred repositories
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
Official inference repo for FLUX.1 models
GenEval: An object-focused framework for evaluating text-to-image alignment
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
Prevents the artifact that tends to occur with XL models
[WIP] Layer Diffusion for WebUI (via Forge)
Transparent Image Layer Diffusion using Latent Transparency
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
Official Code for Stable Cascade
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
Unofficial implementation of InstantID for ComfyUI
Thin Custom Node wrapper for InstantID in ComfyUI.
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Instance segmentation for cartoon/anime characters and some visual techniques building around it.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Unofficial implementation of PhotoMaker for ComfyUI
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
Character Animation (AnimateAnyone, Face Reenactment)
model merge extention for stable diffusion web ui
Instant voice cloning by MIT and MyShell. Audio foundation model.