Starred repositories
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Enjoy the magic of Diffusion models!
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Command-line program to download image galleries and collections from several image hosting sites
An open source implementation of CLIP.
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
Very customizable imageboard/booru downloader with powerful filenaming features.
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
A extendable, replaceable Python algorithmic backtest && trading framework supporting multiple securities
🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
Stable Diffusion web UI
model merge extention for stable diffusion web ui
State-of-the-art 2D and 3D Face Analysis Project
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
This node was designed to help AI image creators to generate prompts for human portraits.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"