- Hong Kong
Stars
A set of nodes to edit videos using the Hunyuan Video model
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
SkyReels V1: The first and most advanced open-source human-centric video foundation model
A general fine-tuning kit geared toward image/video/audio diffusion models.
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
The ultimate training toolkit for finetuning diffusion models
Scalable and memory-optimized training of diffusion models
Official code for "Seeing Faces in Things: A Model and Dataset for Pareidolia" ECCV 2024
Official PyTorch implementation for the paper Generalizable Face Landmarking Guided by Conditional Face Warping (CVPR 2024).
Official implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture
Numpy & PyTorch implementation of three algorithms of image deformation using moving least squares. http://dl.acm.org/citation.cfm?doid=1179352.1141920
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Simple OAuth Component for Streamlit App
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Lip and hair color editor using face parsing maps.
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
OneTrainer is a one-stop solution for all your stable diffusion training needs.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.