Skip to content

Romeo-Gumayagay/AI-Glasses

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ’₯ Guide Me Glasses (GMG) πŸ’₯

Licence Python

Bring the future to your eyes πŸ‘“

An innovative AI-powered smart glasses system designed to help visually impaired individuals by providing real-time assistance through computer vision, natural language processing, and various AI APIs.

🎯 Project Overview

Guide Me Glasses (GMG) is a comprehensive assistive technology project that enables blind and visually impaired users to:

  • Read Text: OCR (Optical Character Recognition) for reading text from images
  • Get Information: Access Wikipedia, news, weather, and time information via voice commands
  • Identify People: Face detection and recognition to identify known individuals
  • Describe Images: Generate semantic descriptions of pictures using image captioning
  • Recognize Objects: Barcode scanning, money detection, and hand sign recognition

πŸ—οΈ Prototype Architecture

architecture

πŸ”¨ Materials

matriels

πŸ“‹ Features

Core APIs

  • Face API: Face detection and recognition with multiple algorithms
  • OCR API: Text extraction from images using Tesseract and CTPN
  • Speech API: Text-to-speech and speech-to-text conversion
  • Translate API: Multi-language translation support
  • Info API: Wikipedia, news, weather, and time information
  • Caption API: Image description generation
  • Barcode API: Barcode detection and reading
  • Money API: Currency recognition
  • Hand Sign API: Sign language recognition

πŸš€ Installation

Prerequisites

  • Python 3.5.5 or higher
  • Required dependencies (see requirement.txt)

Setup

Windows:

python setup_win.py

Linux:

python setup_linux.py

Manual Installation:

pip install -r requirement.txt

πŸ“ Usage

Command options:

  • -t : task
  • -m : mode (opencv_haar, dlib_hog, dlib_cnn, mtcnn, mobilenet_ssd)
  • -i : input object (image path or query)
  • -l : language (en / fr)

1) Face Detection

python gmg.py -t face_detection -m opencv_haar -i image.png -l en

2) Face Recognition

Recognize person:

python gmg.py -t face_recognition -i image.png -l en

Add person face to dataset:

python gmg.py -t add_face -i image.png -l fr

Initialize dataset with existing images:

python gmg.py -t face_init

3) Wiki API

Get information about anything:

python gmg.py -t wiki -i obama -l fr

4) News API

Get latest 10 articles from CNN:

python gmg.py -t news_latest -l en

Get specific article (1-10) from latest CNN articles:

python gmg.py -t news_article -i 3 -l en

5) Weather API

Get weather description for a specified city:

python gmg.py -t weather -country canada -city vancouver -l en

6) Time API

Get current date:

python gmg.py -t date -l en

Get current time:

python gmg.py -t time -l en

7) OCR API

Read text from input image:

python gmg.py -t ocr -i image_path -l en

πŸ› οΈ Technologies Used

  • Computer Vision: OpenCV, dlib, MTCNN, MobileNet SSD
  • OCR: Tesseract, CTPN (Connectionist Text Proposal Network)
  • NLP: Natural Language Processing for text understanding
  • Speech: Text-to-Speech and Speech-to-Text engines
  • APIs: Wikipedia, News, Weather, Translation services
  • Machine Learning: Face recognition, object detection, image captioning

πŸ“š Documentation Links

🎯 Use Cases

  • Accessibility: Assist visually impaired users in daily tasks
  • Navigation: Help identify objects, people, and text in the environment
  • Information Access: Provide real-time access to news, weather, and general knowledge
  • Communication: Enable interaction through voice commands and responses

🀝 Contributing

Contributions are welcome! This project aims to improve accessibility for visually impaired individuals.

πŸ“§ About This Project

This project is developed and maintained by Romeo Gumayagay as an innovative assistive technology solution.

For questions, suggestions, or contributions, feel free to reach out:

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


GuideMeGlasses πŸ‘“ - Empowering visually impaired individuals through AI technology

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published