Bring the future to your eyes π
An innovative AI-powered smart glasses system designed to help visually impaired individuals by providing real-time assistance through computer vision, natural language processing, and various AI APIs.
Guide Me Glasses (GMG) is a comprehensive assistive technology project that enables blind and visually impaired users to:
- Read Text: OCR (Optical Character Recognition) for reading text from images
- Get Information: Access Wikipedia, news, weather, and time information via voice commands
- Identify People: Face detection and recognition to identify known individuals
- Describe Images: Generate semantic descriptions of pictures using image captioning
- Recognize Objects: Barcode scanning, money detection, and hand sign recognition
- Face API: Face detection and recognition with multiple algorithms
- OCR API: Text extraction from images using Tesseract and CTPN
- Speech API: Text-to-speech and speech-to-text conversion
- Translate API: Multi-language translation support
- Info API: Wikipedia, news, weather, and time information
- Caption API: Image description generation
- Barcode API: Barcode detection and reading
- Money API: Currency recognition
- Hand Sign API: Sign language recognition
- Python 3.5.5 or higher
- Required dependencies (see
requirement.txt)
Windows:
python setup_win.pyLinux:
python setup_linux.pyManual Installation:
pip install -r requirement.txtCommand options:
-t: task-m: mode (opencv_haar, dlib_hog, dlib_cnn, mtcnn, mobilenet_ssd)-i: input object (image path or query)-l: language (en / fr)
python gmg.py -t face_detection -m opencv_haar -i image.png -l enRecognize person:
python gmg.py -t face_recognition -i image.png -l enAdd person face to dataset:
python gmg.py -t add_face -i image.png -l frInitialize dataset with existing images:
python gmg.py -t face_initGet information about anything:
python gmg.py -t wiki -i obama -l frGet latest 10 articles from CNN:
python gmg.py -t news_latest -l enGet specific article (1-10) from latest CNN articles:
python gmg.py -t news_article -i 3 -l enGet weather description for a specified city:
python gmg.py -t weather -country canada -city vancouver -l enGet current date:
python gmg.py -t date -l enGet current time:
python gmg.py -t time -l enRead text from input image:
python gmg.py -t ocr -i image_path -l en- Computer Vision: OpenCV, dlib, MTCNN, MobileNet SSD
- OCR: Tesseract, CTPN (Connectionist Text Proposal Network)
- NLP: Natural Language Processing for text understanding
- Speech: Text-to-Speech and Speech-to-Text engines
- APIs: Wikipedia, News, Weather, Translation services
- Machine Learning: Face recognition, object detection, image captioning
- Free Computer Science Books
- Natural Language Processing (NLP)
- Image Caption
- Face Recognition Library (dlib)
- Face Detection on Raspberry Pi
- Open Source ChatBot Library (Rasa)
- Accessibility: Assist visually impaired users in daily tasks
- Navigation: Help identify objects, people, and text in the environment
- Information Access: Provide real-time access to news, weather, and general knowledge
- Communication: Enable interaction through voice commands and responses
Contributions are welcome! This project aims to improve accessibility for visually impaired individuals.
This project is developed and maintained by Romeo Gumayagay as an innovative assistive technology solution.
For questions, suggestions, or contributions, feel free to reach out:
- Email: rcgj.777@gmail.com
- GitHub: Romeo-Gumayagay
This project is licensed under the MIT License - see the LICENSE file for details.
GuideMeGlasses π - Empowering visually impaired individuals through AI technology