Stars
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
(CVPR 2025) Adversarial Diffusion Compression for Real-World Image Super-Resolution [PyTorch]
Inference server benchmarking tool
A peer-to-peer database that spans devices. For apps and agents that work everywhere.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Using Low-rank adaptation to quickly fine-tune diffusion models.
Python bindings for FFmpeg - with complex filtering support
EVA Series: Visual Representation Fantasies from BAAI
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Stable Diffusion web UI
Rembg is a tool to remove images background
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
Machine Learning and Computer Vision Engineer - Technical Interview Questions
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
Simple image captioning model
This is the official repo of Panoptic SegFormer [CVPR'22]
Datasets, Transforms and Models specific to Computer Vision
TRACER: Extreme Attention Guided Salient Object Tracing Network (AAAI 2022) implementation in PyTorch
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."