Lists (32)
Sort Name ascending (A-Z)
ASR
Audio Datasets
Audio Embeddings
CV
DeepFake
Denoisers
Diarization
DiffSinger
Dropbox
Fast Python
Image generation
image inhance
INFERENCE
LLM, AI-AGENTS
ML
Multiprocessing
Music classification
Music enchance
Music generation
Music Loop
Music tagging
Photo animation
Scraping
Silence detection
Study
SVC
SVS
Tests
Text summarization
Time Series
Time stretch
vocal detection
Starred repositories
The most intuitive desktop API client. Organize and execute REST, GraphQL, WebSockets, Server Sent Events, and gRPC 🦬
🚀 The fast, Pythonic way to build MCP servers and clients.
Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python, JavaScript, Rust and C.
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.
The All in One Framework to Build Undefeatable Scrapers
This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are o…
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
A Copy of FreeFileSync Source Code. This repository is just a mirror of the FreeFileSync source code. Please do not send pull requests. Submit issues to the official forum (https://freefilesync.org…
A Python toolkit/library for reality-centric machine/deep learning & data mining on partially-observed time series, with 50+ SOTA neural network models for scientific analysis tasks (imputation, cl…
Source for remoteintech.company — a community-maintained directory of remote-friendly tech companies
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Collection of Data Science PET Projects (Сборник PET-проектов Data Science)
Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)
Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech
GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.
The python library and service for automatic speech recognition and transcribing in Russian and English
This open-source curriculum introduces the fundamentals of Model Context Protocol (MCP) through real-world, cross-language examples in .NET, Java, TypeScript, JavaScript, Rust and Python. Designed …
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound
A lightweight Python package for Automatic Speech Recognition using ONNX models
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets
A list of free LLM inference resources accessible via API.
Learn Agentic AI using Dapr Agentic Cloud Ascent (DACA) Design Pattern and Agent-Native Cloud Technologies: OpenAI Agents SDK, Memory, MCP, A2A, Knowledge Graphs, Dapr, Rancher Desktop, and Kuberne…