Face recognition with deep neural networks
Open Source OCR Engine
Port of OpenAI's Whisper model in C/C++
State-of-the-art 2D and 3D Face Analysis Project
Robust Speech Recognition via Large-Scale Weak Supervision
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
Build your own AI friend
OCRmyPDF adds an OCR text layer to scanned PDF files
Captcha solver extension for humans
A Lightweight Face Recognition and Facial Attribute Analysis
Speech-to-text, text-to-speech, and speaker recognition
OpenVINO™ Toolkit repository
A GUI Agent app based on UI-TARS to control your computer using AI
Interactive video and image annotation tool for computer vision
Speech recognition module for Python
A pure Javascript Multilingual OCR
High-performance neural network inference framework for mobile
A free, open source, and extensible speech-to-text application
Qwen3-Coder is the code version of Qwen3
Open-Source Python3 tool for recognizing layouts, tables, and math
Open Source Computer Vision Library
ZBar is an open source software suite for reading bar codes
The cross-platform open-source app built for handwriting
Image polygonal annotation with Python