State-of-the-art 2D and 3D Face Analysis Project
Robust Speech Recognition via Large-Scale Weak Supervision
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
A Lightweight Face Recognition and Facial Attribute Analysis
OCRmyPDF adds an OCR text layer to scanned PDF files
Qwen3-Coder is the code version of Qwen3
Speech recognition module for Python
Image polygonal annotation with Python
Open-Source Python3 tool for recognizing layouts, tables, and math
Models for the spaCy Natural Language Processing (NLP) library
Industrial-strength Natural Language Processing (NLP)
A framework to enable multimodal models to operate a computer
An open and fair framework for everyone to build AI agents
Formula recognition based on LaTeX-OCR and ONNXRuntime
A high-quality tool for convert PDF to Markdown and JSON
CLI tool to extract (meta)data from PDF and manipulate PDF files
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Crowdsourcing platform for full text transcription and tagging
Multilingual Automatic Speech Recognition with word-level timestamps
Training data (data labeling, annotation, workflow) for all data types
Toolkit for conversational AI
A PyTorch-based Speech Toolkit
Han Language Processing
Replace OpenAI GPT with another LLM in your app