Starred repositories
Enjoy the magic of Diffusion models!
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Official PyTorch implementation of StyleGAN3
A extendable, replaceable Python algorithmic backtest && trading framework supporting multiple securities
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
📷 EasyPhoto | Your Smart AI Photo Generator.
LSTM built using Keras Python package to predict time series steps and sequences. Includes sin wave and stock market data
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Official implementations for paper: Anydoor: zero-shot object-level image customization
[WIP] Layer Diffusion for WebUI (via Forge)
Character Animation (AnimateAnyone, Face Reenactment)
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Resources and Implementations of Generative Adversarial Nets: GAN, DCGAN, WGAN, CGAN, InfoGAN
Unofficial Implementation of Animate Anyone
Image to prompt with BLIP and CLIP
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.