-
Carnegie Mellon University
- Pittsburgh, PA
- ivanw@andrew.cmu.edu
Stars
Scenic: A Jax Library for Computer Vision Research and Beyond
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Segment Anything combined with CLIP
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Experiment on combining CLIP with SAM to do open-vocabulary image segmentation.
[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.
[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
Deploy some tools for training, testing, and visualization
Models and examples built with TensorFlow