Stars
teddyCloud is an open source server replacement for the Boxine Cloud
A feature-rich command-line audio/video downloader
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
OpenParliamentTV-Tools for parsing parliamentary data
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
High accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
ocr-docker is small, Flask powerd web app, helps us to extract text from images and pdf document using OCR
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
telegram bot for self-hosted local inference of stable diffusion, text-to-speech and large language models, such as llama3
Create a visual search engine using tensorflow serving, elasticsearch, vuejs and nginx.
DeepFaceLab is the leading software for creating deepfakes.
Visualization toolbox for Sound Event Detection
The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3
Semantic Image Similarity Search in Elasticsearch
Fusion of feature pyramids for nucleus segmentation and cell segmentation
How to Copy Text from Images ? Answer is TextSnatcher !. Perform OCR operations in seconds on Linux Desktop.
Virtual Video Device for Background Replacement with Deep Semantic Segmentation
This repository provides a starter code for using tensorboard via tensorflow for visualising embeddings
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.