Skip to content
View nttstar's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report nttstar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Spark-TTS Inference Code

Python 10,823 1,157 Updated Apr 9, 2025

A set of nodes to edit videos using the Hunyuan Video model

Python 497 31 Updated Feb 21, 2025

Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!

Python 1,025 104 Updated Jun 29, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,598 295 Updated Mar 10, 2025
Python 6,805 1,150 Updated Nov 3, 2025

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,667 264 Updated Dec 19, 2025

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 890 43 Updated Dec 18, 2025

The ultimate training toolkit for finetuning diffusion models

Python 8,371 978 Updated Dec 20, 2025

Scalable and memory-optimized training of diffusion models

Python 1,312 144 Updated Jun 4, 2025
Python 1,561 200 Updated Dec 21, 2025

Official code for "Seeing Faces in Things: A Model and Dataset for Pareidolia" ECCV 2024

Jupyter Notebook 23 Updated Sep 25, 2024

Official PyTorch implementation for the paper Generalizable Face Landmarking Guided by Conditional Face Warping (CVPR 2024).

Python 31 Updated Nov 21, 2024

Official implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture

Python 385 20 Updated Jun 30, 2025

Numpy & PyTorch implementation of three algorithms of image deformation using moving least squares. http://dl.acm.org/citation.cfm?doid=1179352.1141920

Python 364 70 Updated Dec 19, 2023

Bring portraits to life!

Python 17,484 1,816 Updated Nov 16, 2025

DUSt3R: Geometric 3D Vision Made Easy

Python 6,831 716 Updated Sep 24, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,629 1,121 Updated Sep 14, 2024

Simple OAuth Component for Streamlit App

Python 214 35 Updated Oct 21, 2025

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

Python 1,535 149 Updated Jul 29, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,562 551 Updated Nov 10, 2025

Lip and hair color editor using face parsing maps.

Python 536 152 Updated Aug 4, 2021

Fine-Grained Subject-Specific Attribute Expression Control in T2I Models

Jupyter Notebook 134 13 Updated Feb 27, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,946 5,857 Updated Aug 16, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 53,343 5,839 Updated Dec 19, 2025

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Jupyter Notebook 529 31 Updated Sep 8, 2025

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 5,383 467 Updated May 12, 2025

OneTrainer is a one-stop solution for all your stable diffusion training needs.

Python 2,651 257 Updated Dec 21, 2025

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

C++ 4,268 406 Updated Dec 19, 2025
Next