Lists (11)
Sort Name ascending (A-Z)
Stars
MaxMind's GeoIP2 GeoLite2 Country, City, and ASN databases
ClickHouse® is a real-time analytics database management system
High performance, self-hosted, newsletter and mailing list manager with a modern dashboard. Single binary app.
OCR, layout analysis, reading order, table recognition in 90+ languages
georgmangold / console
Forked from minio/object-browserConsole is a Admin UI for MinIO® Object Storage Server 🖥️
Streamlink is a CLI utility which pipes video streams from various services into a video player
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation
A software update framework for macOS
Massive open Japanese speech corpus
Py2/py3 script that can download macOS components direct from Apple
A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi programmically in Python
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Model for MDX23 music separation contest
Fair database benchmarks framework and datasets
Meta's "No Language Left Behind" models served as web app and REST API
Noise supression using deep filtering
Vocal Remover using Deep Neural Networks
Text to speech alignment using CTC forced alignment
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Command-line program to download videos from YouTube.com and other video sites
Experience macOS just like before
GPUd automates monitoring, diagnostics, and issue identification for GPUs