Skip to content
#

coco-dataset

Here are 170 public repositories matching this topic...

VisionDriveX is a multi-task autonomous driving perception system that performs traffic sign classification, stop-sign detection, and lane segmentation. Built with PyTorch and explainable AI (Grad-CAM), it delivers real-time, interpretable road understanding for safety-critical ADAS applications.

  • Updated Jan 28, 2026
  • Python

Image captioning project with a deep learning model. Given an image, it is processed by an encoder CNN and then passed to a decoder RNN. The decoder outputs a text caption of the image content. The work is based on the paper "Show and Tell" by Vinyals et al. Pytorch is used for the implementation and the COCO dataset for training.

  • Updated Oct 14, 2022
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the coco-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the coco-dataset topic, visit your repo's landing page and select "manage topics."

Learn more