image-descriptions

Star

Here are 21 public repositories matching this topic...

meng1994412 / CBIR

Star

Content-Based Image Retrieval System

information-retrieval database computer-vision image-descriptions keypoints-detector

Updated Dec 18, 2018
Python

google / imageinwords

Star

Data release for the ImageInWords (IIW) paper.

evaluation dataset image-captioning dataset-generation image-to-text image-descriptions image-text human-annotation t2i i2t detailed-descriptions detailed-annotations

Updated Nov 17, 2024
JavaScript

dhruvik-patel / image-description

Star

This repo represents our machine learning project Image Description which is used to generate a description of an image based on activities and objects detected in the image.

python flask machine-learning image tensorflow image-processing cnn lstm image-descriptions tflite-models image-descriptor

Updated Apr 8, 2024
CSS

antonio-f / Moondream

Star

Testing the Moondream tiny vision model

tutorial artificial-intelligence image-captioning language-models image-descriptions hands-on huggingface-transformers vision-models vision-transformers running-locally tiny-models

Updated May 12, 2024
Jupyter Notebook

Pavansomisetty21 / Image-Caption-Generation-using-LLMs-GEMINI-

Sponsor

Star

we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI

Updated Aug 24, 2024
Jupyter Notebook

aviralchharia / Neural-Image-Captioning

Star

In this project, we use a Deep Recurrent Architecture, which uses CNN (VGG-16 Net) pretrained on ImageNet to extract 4096-Dimensional image feature Vector and an LSTM which generates a caption from these feature vectors.

natural-language-processing computer-vision cnn lstm neural-networks image-captioning show-and-tell vgg16 bleu-score image-descriptions flickr8k-dataset feature-vectors

Updated Sep 9, 2020
Jupyter Notebook

DevExpress-Examples / office-file-api-ai-implementation

Star

Integrate AI capabilities into a DevExpress-powered Office File API Web API application.

ai accessibility web-api devexpress image-descriptions word-processing office-file-api spreadsheet-document-api

Updated Feb 16, 2026
C#

baaivision / DenseFusion

Star

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

vlm image-descriptions visual-perception mllm multimodal-large-language-models vision-language-models

Updated Dec 6, 2024
Python

jornalistainclusivo / descreve-ai

Star

Gerador de texto alternativo com inteligência artificial

accessibility a11y image-descriptions alt-text ai-tool

Updated Jan 31, 2026
JavaScript

alterism / mastodon-alt-text

Star

Experimenting with mastodon.social client alt-text usage dataset.

data-science university accessibility a11y university-project datascience mastodon image-descriptions fediverse alt-text alttext mastodon-social image-description aiss-master

Updated Dec 18, 2024
HTML

ShivaliGoel / Paper-Explanations

Star

Key Pointers/ Exhaustive Notes for various Machine Learning Research Papers

machine-learning deep-neural-networks neural-networks image-captioning show-and-tell deep-learning-papers research-paper deep-learning-tutorial research-paper-explanation karpathy deep-visual-semantic-alignments image-descriptions

Updated Mar 16, 2017

ElenaChes / discordjs-image-reader

Star

An AI-powered accessibility Discord bot written in Node.js, using Discord.js, MongoDB and Google Gemini API. The image-reader bot is designed to describe and transcribe images for the visually impaired, features a manual image-reading command and an auto-reading system.

nodejs bot mongodb discord discordjs discord-bot gemini discord-js image-to-text image-descriptions gemini-api image-descriptor ai-powered discord-ai-bot

Updated Mar 3, 2026
JavaScript

mariliafernandez / hilbert-curves-descriptor

Star

Trabalho de Conclusão de Curso de Engenharia de Computação (UTFPR): Descritor de imagem baseado em curvas de Hilbert

computer-vision image-processing descriptors image-descriptions

Updated Aug 26, 2021
Jupyter Notebook

mrunmayimahajan12 / hoo-hacks-screen-reader

Star

Smart Screen Reader- A Screen reader chrome extension for visually-impaired people. This is a Chrome extension that enhances web accessibility by: Generating image descriptions using AI, Summarizing entire pages, Allowing users to ask questions and automatically scroll to the relevant sections.

chrome-extension text-to-speech accessibility web-speech-api image-descriptions openai-api manifest-v3 conversational-assistant visually-impaired-people openai-vision ask-a-question voice-based-webpage-scrolling

Updated Mar 30, 2025
JavaScript

TatjanaChernenko / image_description_generation

Star

NL Generation from structured inputs. Focuses on generating natural language descriptions for images by exploring the relationship between textual descriptions and image attributes. Leveraging an encoder-decoder architecture with LSTM cells, the system transforms normalized vector representations of attributes into fixed-length vector.

data-science lstm image-captioning natural-language-generation lstm-neural-networks encoder-decoder image-descriptions

Updated Jan 7, 2024
Jupyter Notebook

tsdicloud / markitdown-arch-plugin

Star

Tech spec specialized #markitdown-plugin, for example enhanced markdowns with image descriptions and more

image-descriptions markitdown-plugin llm-multimodal

Updated Dec 2, 2025
Python

Slayer412 / docling-bedrock-plugin

Star

Integrates AWS Bedrock's multimodal capabilities (Claude 3) into the Docling framework for generating image descriptions within document processing pipelines.

python image-descriptions document-processing-pipeline aws-bedrock docling