-
Universidade de São Paulo
- https://www.linkedin.com/in/andre-oliveira-francani/
Stars
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
A toolkit for developing and comparing reinforcement learning algorithms.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
OpenMMLab Detection Toolbox and Benchmark
PyTorch Tutorial for Deep Learning Researchers
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
End-to-End Object Detection with Transformers
A resource for learning about Machine learning & Deep Learning
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
A data augmentations library for audio, image, text, and video.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
Python package for the evaluation of odometry and SLAM
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
User-friendly, commercial-grade software for processing aerial imagery.
A deep learning library for video understanding research.
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
pySLAM is a hybrid Python/C++ Visual SLAM pipeline supporting monocular, stereo, and RGB-D cameras. It provides a broad set of modern local and global feature extractors, multiple loop-closure stra…