Skip to content
View alphonz's full-sized avatar

Block or report alphonz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 11,264 1,274 Updated Apr 1, 2026

Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…

Rust 27,613 3,004 Updated Mar 25, 2026

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Python 703 172 Updated Jul 28, 2023

OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go

Go 10,604 1,686 Updated Oct 21, 2025

Baresip is a modular SIP User-Agent with audio and video support

C 2,050 503 Updated Mar 24, 2026

A PBX written by rust

Rust 518 75 Updated Mar 31, 2026

High-quality speech synthesis with LoRA fine-tuning on index-tts, enhancing prosody and naturalness for single and multi-speaker voices.

Python 299 25 Updated Mar 12, 2026

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,453 1,043 Updated Mar 30, 2026

Text-audio foundation model from Boson AI

Python 7,999 613 Updated Jan 18, 2026

Pure Go implementation of the WebRTC API

Go 16,174 1,833 Updated Mar 25, 2026

General Speech Restoration

Python 1,311 158 Updated Feb 17, 2025

Noise supression using deep filtering

Python 4,009 434 Updated Oct 17, 2024

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 2,056 162 Updated Feb 2, 2026

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,830 267 Updated Aug 19, 2025

A python package to analyze and compare voices with deep learning

Python 3,241 477 Updated Oct 12, 2023

Receipts for creating AI Applications with APIs from DashScope (and friends)!

Jupyter Notebook 73 21 Updated Sep 26, 2024

GNU/Linux 更换系统软件源脚本及 Docker 安装与换源脚本

Shell 7,203 671 Updated Mar 7, 2026

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

3,561 243 Updated Jan 26, 2026

Production-ready platform for agentic workflow development.

TypeScript 135,374 21,090 Updated Apr 1, 2026

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,012 329 Updated Aug 14, 2025

Retrieval and Retrieval-augmented LLMs

Python 11,486 850 Updated Apr 1, 2026

a voice activity detection module for freeswitch.

C 23 23 Updated May 13, 2024

a freeswitch mod

C 21 20 Updated Jul 30, 2019

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,350 2,317 Updated Mar 16, 2026

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,865 1,528 Updated Mar 4, 2026

SOTA Open Source TTS

Python 28,994 2,435 Updated Mar 30, 2026

使用Docker Stack搭建Milvus向量数据库集群

Python 39 4 Updated Sep 22, 2023

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。

Python 16,884 2,489 Updated Oct 12, 2025

👾 Fast and simple video download library and CLI tool written in Go

Go 31,000 3,245 Updated Mar 29, 2026

Bilibili Downloader. 一个命令行式哔哩哔哩下载器.

C# 13,624 1,583 Updated Jan 10, 2026
Next