Stars
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A beautiful, simple, clean, and responsive Jekyll theme for academics
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Official implementation of Character Region Awareness for Text Detection (CRAFT)
A presenter console with multi-monitor support for PDF files.
CNN Image Retrieval in PyTorch: Training and evaluating CNNs for Image Retrieval in PyTorch
A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac
Corruption and Perturbation Robustness (ICLR 2019)
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
Curate, Annotate, and Manage Your Data in LightlyStudio.
End-to-end learning of deep visual representations for image retrieval
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
PyTorch Implementation for Deep Metric Learning Pipelines
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
Repository of the paper "AnyUp: Universal Feature Upsampling".
The code for the CVPR2019 paper Bi-Directional Cascade Network for Perceptual Edge Detection
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers
ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository
[ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"
Pytorch code of "Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning", CVPR 2019.
TLDR is an unsupervised dimensionality reduction method that combines neighborhood embedding learning with the simplicity and effectiveness of recent self-supervised learning losses
Code for CVPR 2019 paper Label Propagation for Deep Semi-supervised Learning