Zotero MCP: Connects your Zotero research library with Claude and other AI assistants via the Model Context Protocol to discuss papers, get summaries, analyze citations, and more.

Python 670 60 Updated Oct 24, 2025

bigai-nlco / UltraVoice

Official Repository of UltraVoice

JavaScript 44 1 Updated Oct 28, 2025

Soul-AILab / SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 1,606 166 Updated Nov 4, 2025

kyutai-labs / nanoGPTaudio

Forked from karpathy/nanoGPT

Code for the blog "Neural audio codecs: how to get audio into LLMs"

Python 122 3 Updated Oct 20, 2025

lifeiteng / Aligner-SUPERB

Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark

Python 33 4 Updated May 7, 2025

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 610 83 Updated Nov 4, 2025

OpenBMB / UltraEval-Audio

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 160 8 Updated Oct 31, 2025

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 494 50 Updated Nov 5, 2025

meituan-longcat / LongCat-Video

Python 1,022 90 Updated Nov 4, 2025

markson14 / FinancialReportAnalysis

中国市场分析脚本是一个功能强大的Python工具，旨在为用户提供对中国A股市场的深入分析。该脚本利用Akshare库从多种数据源获取实时和历史股票数据，并计算关键财务指标，以帮助投资者做出明智的决策。

Python 19 1 Updated Oct 10, 2025

phildougherty / sesame_csm_openai

OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT

Python 421 80 Updated Sep 26, 2025

vogent / vogent-turn

Vogent Turn: fast, open-source turn-detection for Voice AI applications

Python 33 2 Updated Oct 28, 2025

ZhangXinWhut / SimWhisper-Codec

Python 13 2 Updated Oct 21, 2025

liduojia1 / MeanFlowSE

Python 16 2 Updated Oct 16, 2025

nanless

Lists (32)

academic

acoustic echo cancellation

AIGC

audio codec

audio codecs

audio separation

audio tools

bandwidth extension

beamforming

computer vision

deep learning

diffusion

entertainments

hearing aid

LLM

mircophone array

music tools

noise reduction

packet loss compensation

programming related

simulation tools

singing voice tools

sound source localization

spatial audio

speaker recognition

speech dereverberation

speech diarization

speech frontend

speech recognition

speech separation

speech signal processing

speech voice tools

Starred repositories

LaTeX

noise-reduction