caption-generation

Here are 56 public repositories matching this topic...

Larasx1S / Generate-Context-Aware-Captions-from-Photos

🌍 Generate rich, context-aware captions from images by integrating location, events, and dates for more informative and meaningful descriptions.

training deep-neural-networks ai new context neural-networks context-aware captioning-images new-york-times captioning caption-generation caption-generator goodnews multi-gpu-training generative-ai blip2

Updated Feb 8, 2026
Python

InternLM / CapRL

Star

(ICLR 2026) An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

image-captioning multi-modal caption-generation llm vision-language-model large-vision-language-models grpo rlvr

Updated Jan 26, 2026
Python

tharun-ship-it / image-to-text-generator

Star

🖼️BLIP-powered Image-to-Text Generator achieving 136.7 CIDEr score on TextCaps benchmark (142K captions, 28K images). 129M parameter model with batch processing (25 images), ZIP export with embedded captions & conditional captioning. Live Streamlit demo for instant AI-powered captioning.

machine-learning natural-language-processing computer-vision deep-learning transformers python3 pytorch image-captioning image-to-text blip multimodal-learning caption-generation huggingface streamlit vision-language-model textcaps

Updated Jan 7, 2026
Python

official-imvoiid / Joycaption

Star

Joycaption optimized for windows

windows flux aiml captions captioning-images batfile caption-generation

Updated Dec 23, 2025
Python

kaushal07wick / Captions

Star

Automatic Caption Generation for YT shorts and raw videos.

ai captioning-images captioning-videos caption-generation caption-generator

Updated Dec 14, 2025
Python

navaneet625 / RealTimeVQACaptioning

Star

A real-time image captioning and visual question answering (VQA) system. This project uses computer vision and NLP to generate descriptive captions for images and answer user questions about them.

Updated Nov 26, 2025
Python

Doga0 / Caption-Generator

Star

Caption generator using Vision Language Models and vLLM

dataset-generation caption-generation vision-language-model vllm

Updated Nov 17, 2025
Python

DragonDiffusionbyBoyo / qwen2vl-captioner-gui-BulkFolder

Star

Qwen Uncensored Image Captioner

image dataset-generation captioning-images caption-generation uncensored-captions dataset-captions

Updated Nov 5, 2025
Python

Ali-Shariati-Najafabadi / Generate-Context-Aware-Captions-from-Photos

Star

Context-Aware Image Captioning with BLIP-2

training photos deep-neural-networks ai new context neural-networks deeplearning context-aware captioning-images new-york-times captioning caption-generation caption-generator goodnews multi-gpu-training generative-ai blip2

Updated Oct 8, 2025
Python

BrijeshRakhasiya / Video-Caption-Generator

Star

Transcribe videos and generate captions using Whisper and FFmpeg with a Streamlit UI

ffmpeg transcription whisper captioning caption-generation streamlit

Updated Oct 3, 2025
Python

photoprism / photoprism-vision

Sponsor

Star

Computer Vision Playground ⚡️

ai computer-vision image-classification caption-generation photoprism nsfw-detection large-language-models

Updated Oct 1, 2025
Python

naqashafzal / AI-Content-Studio

Star

A 100% free & open-source AI Content Automation Tool that writes scripts, generates voiceovers, creates videos, and uploads them automatically — hands-free YouTube growth powered by AI.

ai caption-generation facebook-automation youtube-automation ai-content-generation free-video-generator

Updated Sep 18, 2025
Python

uninterruptedpowersupply3-NEW / Sigma-Captioner

Star

A some what optimized implementation of some light weight and popular models

git gui optimized vqa tagger dataset-generation clip blip visual-question-answering caption-generation huggingface-transformers llava moondream vision-language-models florence-2 smolvlm

Updated Sep 12, 2025
Python

BilalAhmadKhanKhattak / JustInCase

Star

JustInCase is a tool that generates .srt subtitles from any given video or audio file. It uses AI (Whisper model) to generate captions

subtitles caption-generation subtitles-generator bilred

Updated Aug 27, 2025
Python

tomash-dev / Image-Caption-Generator

Star

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Updated Jul 18, 2025
Python

Vinventive / live-captions-vr

Star

Accessibility-focused SteamVR Overlay improving communication between deaf, hard-of-hearing, and hearing users in VR. It is leveraging AI allowing users to see real-time speech transcription in their 3D space. DISCLAIMER: Voice recognition technology is prone to errors and project should not be used as a replacement for medical hearing aid.

ai vr captions subtitles hearing-loss vr-overlay live-captions caption-generation hearing-impaired hearing-aid whisper-v3-turbo

Updated May 22, 2025
Python

khushalimakani / image-captioning

Star

Captionify: Describing Images with AI An AI-powered image captioning system that uses CNNs and LSTMs to generate human-like captions for images. Trained on the Flickr8k dataset and evaluated with BLEU scores, it bridges computer vision and natural language processing for real-world applications like accessibility, social media, and e-commerce.

natural-language-processing deep-learning image-recognition caption-generation streamlit-webapp modeltraining