Stars
"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"
Shared starter template configuration and CLAUDE.md memory bank system for Claude Code
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Recurrent neural network for audio noise reduction
用 onnx 和 gguf 格式混合运行 Fun-ASR-Nano 模型全流程
An agentic skills framework & software development methodology that works.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support.
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
Robust Speech Recognition via Large-Scale Weak Supervision
Port of OpenAI's Whisper model in C/C++
💫 Toolkit to help you get started with Spec-Driven Development
The official Python SDK for Model Context Protocol servers and clients
一个提供开发中常用数据的一个稳定聚合Api接口源,运行于独立服务器,免费,且长期维护,会持续添加新的接口!【只fork不star是很没品的】
LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词(Structured Prompt)提出者 📌 元提示词(Meta-Prompt)发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt…
一个批量转存、分享和下载夸克网盘文件的工具,可以快速地将大量分享文件转存到到自己的网盘内,或者将网盘文件批量生成分享链接
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Multilingual Voice Understanding Model
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Retrieval and Retrieval-augmented LLMs
Netease Youdao's open-source embedding and reranker models for RAG products.