๐๏ธ ImageCLEF 2023
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ๐ video, up to 5x faster than OpenAI CLIP and LLaVA ๐ผ๏ธ & ๐๏ธ
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
An open-source framework for training large multimodal models.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Repository with solution for the ImageCLEF 2023 Challange Medical Visual Question Answering for GI Task - MEDVQA-GI.