Highlights
- Pro
Stars
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
[NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation
[CVPR 2022] VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
[ECCV 2022] "SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
A curated list of awesome self-supervised methods
A one-stop repository for low-code easily-installable object detection pipelines.
[CVPR 2020 & 2021 & 2022 & 2023] Agriculture-Vision Dataset, Prize Challenge and Workshop: A joint effort with many great collaborators to bring Agriculture and Computer Vision/AI communities toget…
A curated list of resources for Learning with Noisy Labels
Script to remotely check GPU servers for free GPUs
TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision
🤘 awesome-semantic-segmentation
Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification (ICCV 2019, Oral)
Revisiting RCNN: On Awakening the Classification Power of Faster RCNN (ECCV 2018)
AlignSeg: Feature-Aligned Segmentation Networks (TPAMI 2021)
Product Studio | Cornell Tech | Fall 2016 📰
This repo is the source code of Acemap-Paper X-Ray, which is a system to evaluate the look of your uploaded paper for a specific conference.
A deep learning model for detecting fire in video and camera streams
PyTorch original implementation of Cross-lingual Language Model Pretraining.
Real-time Action detection demo for the work Actor Conditioned Attention Maps. This repo includes a complete pipeline for person detection/tracking and analyzing their actions in real-time.
A parser for Google Scholar, written in Python
🦋A PyTorch implementation of BigGAN with pretrained weights and conversion scripts.
Book about interpretable machine learning
A collection of important graph embedding, classification and representation learning papers with implementations.