Starred repositories
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Official Code for DragGAN (SIGGRAPH 2023)
A modular graph-based Retrieval-Augmented Generation (RAG) system
Open-Sora: Democratizing Efficient Video Production for All
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Open standard for machine learning interoperability
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Official implementation of AnimateDiff.
High-Resolution 3D Human Digitization from A Single Image.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Depth-Aware Video Frame Interpolation (CVPR 2019)
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
TripoSR: Fast 3D Object Reconstruction from a Single Image
Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
RESTler is the first stateful REST API fuzzing tool for automatically testing cloud services through their REST APIs and finding security and reliability bugs in these services.
Komodo Edit is a fast and free multi-language code editor. Written in JS, Python, C++ and based on the Mozilla platform.
Ray tracing and hybrid rasterization of Gaussian particles
pix2pix3D: Generating 3D Objects from 2D User Inputs
Segment-Anything + 3D. Let's lift anything to 3D.
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.