-
Updated
May 1, 2024 - Python
vision-language-pretraining
Here are 40 public repositories matching this topic...
🌋 A flexible framework for training and configuring Vision-Language Models
-
Updated
Jul 6, 2025 - Python
This project generates behavioral descriptions from images by combining computer vision and natural language processing. It goes beyond basic scene descriptions to infer human behaviors, intentions, and social contexts.
-
Updated
May 5, 2025 - Python
[KDD 2024] Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning
-
Updated
Jul 18, 2024 - Python
Official code for CVPR2025 "Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection"
-
Updated
Mar 20, 2025 - Python
This repository contains the implementation of AlignVLM paper, which proposes a novel method for vision language alignment
-
Updated
May 23, 2025 - Python
[MICCAI‘25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
-
Updated
Jul 10, 2025 - Python
MICCAI 2024 Oral: Vision-Language Open-Set Detectors for Bone Fenestration and Dehiscence Detection from Intraoral Images
-
Updated
Apr 1, 2025 - Python
Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
-
Updated
Dec 24, 2024 - Python
Unofficial implementation for Sigmoid Loss for Language Image Pre-Training
-
Updated
Sep 26, 2023 - Python
VTC: Improving Video-Text Retrieval with User Comments
-
Updated
Aug 11, 2025 - Python
Evaluate robustness of adaptation methods on large vision-language models
-
Updated
Aug 23, 2023 - Shell
A codebase for flexible and efficient Image Text Representation Alignment
-
Updated
Jun 20, 2023 - Python
SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models
-
Updated
Jan 11, 2024 - Python
[ICCV'25 Highlight] Derm1M: A Million‑Scale Vision‑Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology
-
Updated
Jul 25, 2025 - Python
[Science Advances] Demographic Bias of Vision-Language Foundation Models in Medical Imaging
-
Updated
Mar 28, 2025 - Python
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
-
Updated
Dec 5, 2023 - Python
Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation
-
Updated
May 21, 2023 - Python
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
-
Updated
May 15, 2023 - Python
Easy wrapper for inserting LoRA layers in CLIP.
-
Updated
Jun 16, 2024 - Python
Improve this page
Add a description, image, and links to the vision-language-pretraining topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the vision-language-pretraining topic, visit your repo's landing page and select "manage topics."