Face recognition with deep neural networks
Open Source OCR Engine
Port of OpenAI's Whisper model in C/C++
State-of-the-art 2D and 3D Face Analysis Project
Robust Speech Recognition via Large-Scale Weak Supervision
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
Build your own AI friend
OCRmyPDF adds an OCR text layer to scanned PDF files
A Lightweight Face Recognition and Facial Attribute Analysis
Captcha solver extension for humans
Speech-to-text, text-to-speech, and speaker recognition
OpenVINO™ Toolkit repository
A GUI Agent app based on UI-TARS to control your computer using AI
Interactive video and image annotation tool for computer vision
Speech recognition module for Python
A pure Javascript Multilingual OCR
High-performance neural network inference framework for mobile
A free, open source, and extensible speech-to-text application
Qwen3-Coder is the code version of Qwen3
Open-Source Python3 tool for recognizing layouts, tables, and math
Open Source Computer Vision Library
The cross-platform open-source app built for handwriting
ZBar is an open source software suite for reading bar codes
Image polygonal annotation with Python