AV
Beautiful visualizations of how language differs among document types.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
A Python Perceptual Image Hashing Module
Lightweight Python library for adding real-time multi-object tracking to any detector.
A deep learning library for video understanding research.
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.
Code examples of Image Processing on Python, using OpenCV and other libraries
A Unified Toolkit for Deep Learning Based Document Image Analysis
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
Fixes mojibake and other glitches in Unicode text, after the fact.
Library to scrape and clean web pages to create massive datasets.
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …
A foundational library for Semantic Hypergraphs
👷♂️ A simple package for extracting useful features from character objects 👷♀️
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Cross-platform, customizable ML solutions for live and streaming media.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
🎥 Python and OpenCV-based scene cut/transition detection program & library.
📝 An awesome Data Science repository to learn and apply for real world problems.
Machine Learning for Cyber Security
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
An open-source computer vision framework to build and deploy apps in minutes
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models