Skip to content

alighofrani95/pytextractor

 
 

Repository files navigation

pytextractor

python ocr using tesseract/ with EAST opencv text detector

Uses the EAST opencv detector defined here with pytesseract to extract text(default) or numbers from images.

usage: text_detection.py [-h] [-east EAST] [-c CONFIDENCE] [-w WIDTH]
                         [-e HEIGHT] [-d] [-n] [-p PERCENTAGE]
                         images [images ...]

Text/Number extractor from image

positional arguments:
  images                path(s) to input image(s)

optional arguments:
  -h, --help            show this help message and exit
  -east EAST, --east EAST
                        path to input EAST text detector 
  -c CONFIDENCE, --confidence CONFIDENCE
                        minimum probability required to inspect a region[0.5]
  -w WIDTH, --width WIDTH
                        resized image width (should be multiple of 32)[320]
  -e HEIGHT, --height HEIGHT
                        resized image height (should be multiple of 32)[320]
  -d, --display         Display bounding boxes
  -n, --numbers         Detect only numbers
  -p PERCENTAGE, --percentage PERCENTAGE
                        Expand/shrink detected bound box[2.0]

About

python ocr using tesseract/ with EAST opencv detector

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%