alphonz

long88 alphonz

Stars

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 11,264 1,274 Updated Apr 1, 2026

lbjlaq / Antigravity-Manager

Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…

Rust 27,613 3,004 Updated Mar 25, 2026

breizhn / DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Python 703 172 Updated Jul 28, 2023

sashabaranov / go-openai

OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go

Go 10,604 1,686 Updated Oct 21, 2025

baresip / baresip

Baresip is a modular SIP User-Agent with audio and video support

C 2,050 503 Updated Mar 24, 2026

restsend / rustpbx

A PBX written by rust

Rust 518 75 Updated Mar 31, 2026

asr-pub / index-tts-lora

High-quality speech synthesis with LoRA fine-tuning on index-tts, enhancing prosody and naturalness for single and multi-speaker voices.

Python 299 25 Updated Mar 12, 2026

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,453 1,043 Updated Mar 30, 2026

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,999 613 Updated Jan 18, 2026

pion / webrtc

Pure Go implementation of the WebRTC API

Go 16,174 1,833 Updated Mar 25, 2026

haoheliu / voicefixer

General Speech Restoration

Python 1,311 158 Updated Feb 17, 2025

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 4,009 434 Updated Oct 17, 2024

TEN-framework / ten-vad

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 2,056 162 Updated Feb 2, 2026

timsainb / noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,830 267 Updated Aug 19, 2025

resemble-ai / Resemblyzer

A python package to analyze and compare voices with deep learning

Python 3,241 477 Updated Oct 12, 2023

dashscope / dash-cookbook

Receipts for creating AI Applications with APIs from DashScope (and friends)!

Jupyter Notebook 73 21 Updated Sep 26, 2024

SuperManito / LinuxMirrors

GNU/Linux 更换系统软件源脚本及 Docker 安装与换源脚本

Shell 7,203 671 Updated Mar 7, 2026

eosphoros-ai / Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

3,561 243 Updated Jan 26, 2026

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 135,374 21,090 Updated Apr 1, 2026

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,012 329 Updated Aug 14, 2025

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 11,486 850 Updated Apr 1, 2026

Tangwego / mod_vad

a voice activity detection module for freeswitch.

C 23 23 Updated May 13, 2024

zc-passerby / mod_vad

a freeswitch mod

C 21 20 Updated Jul 30, 2019

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,350 2,317 Updated Mar 16, 2026

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,865 1,528 Updated Mar 4, 2026

fishaudio / fish-speech

SOTA Open Source TTS

Python 28,994 2,435 Updated Mar 30, 2026

yangxianpku / milvus

使用Docker Stack搭建Milvus向量数据库集群

Python 39 4 Updated Sep 22, 2023

Evil0ctal / Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。

Python 16,884 2,489 Updated Oct 12, 2025

iawia002 / lux

👾 Fast and simple video download library and CLI tool written in Go

Go 31,000 3,245 Updated Mar 29, 2026

nilaoda / BBDown

Bilibili Downloader. 一个命令行式哔哩哔哩下载器.

C# 13,624 1,583 Updated Jan 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly