Stars
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
Baresip is a modular SIP User-Agent with audio and video support
High-quality speech synthesis with LoRA fine-tuning on index-tts, enhancing prosody and naturalness for single and multi-speaker voices.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Text-audio foundation model from Boson AI
Noise supression using deep filtering
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
A python package to analyze and compare voices with deep learning
Receipts for creating AI Applications with APIs from DashScope (and friends)!
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
Production-ready platform for agentic workflow development.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Retrieval and Retrieval-augmented LLMs
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
👾 Fast and simple video download library and CLI tool written in Go