A command-line application to extract text from images, PDFs, and audio files using Appleās Vision and Speech APIs.