Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Python tool for converting files and office documents to Markdown.
Official inference repo for FLUX.1 models
The ultimate training toolkit for finetuning diffusion models
Unlock the fullest potential of your device
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
Dead simple FLUX LoRA training UI with LOW VRAM support
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation
The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"
From comfyui workflow to web app, in seconds
This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.
The official implementation of "GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation". (CVPR 2025)
KeyForge3D is an app that turns a photo of a key into a 3D-printable STL file. Ideal for locksmiths and hobbyists, it analyzes the key's bitting pattern using image processing and generates an accu…
[CVPR 2025 Highlight] Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis
Prodigy and Schedule-Free, together at last.