Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Models and examples built with TensorFlow
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
No fortress, purely open ground. OpenManus is Coming.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
PyTorch Tutorial for Deep Learning Researchers
Open-Sora: Democratizing Efficient Video Production for All
A generative world for general-purpose robotics & embodied AI learning.
Generative Models by Stability AI
Official inference repo for FLUX.1 models
Image-to-Image Translation in PyTorch
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Fast and memory-efficient exact attention
Rembg is a tool to remove images background
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
A Deep Learning based project for colorizing and restoring old images (and video!)
WebUI extension for ControlNet
Datasets, Transforms and Models specific to Computer Vision
PyTorch implementations of Generative Adversarial Networks.
Wan: Open and Advanced Large-Scale Video Generative Models
StyleGAN - Official TensorFlow Implementation
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Generate 3D objects conditioned on text or images
Enjoy the magic of Diffusion models!
Official implementation of AnimateDiff.