Skip to content
View NormonisPing's full-sized avatar

Block or report NormonisPing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A python API for reading and writing SOFA files (https://www.sofaconventions.org/)

Python 28 8 Updated Mar 8, 2021

SOFA Toolbox (API for Matlab, Octave)

MATLAB 138 32 Updated Oct 8, 2025

A Python library aimed at acousticians.

Python 537 146 Updated Dec 10, 2023

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,790 108 Updated Jul 3, 2024

Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Python 23 4 Updated Jan 3, 2025

This is the official implementation of reverberant speech to room impulse response estimator

Python 38 5 Updated Aug 7, 2024

Frontier Open-Source Text-to-Speech

9,511 1,166 Updated Sep 5, 2025

Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 2024.

Jupyter Notebook 14 Updated Sep 1, 2024

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,683 469 Updated Sep 27, 2025

Model for selecting perceptually relevant early reflections for parametric spatial sound rendering

MATLAB 13 7 Updated Oct 26, 2023

Impulse response generation based on state-of-the-art geometric sound propagation engine.

C++ 168 25 Updated Jan 17, 2023

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

Python 245 23 Updated Jan 22, 2025

A collection of projects showcasing RAG, agents, workflows, and other AI use cases

Python 6,599 800 Updated Oct 8, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 46,743 3,818 Updated Oct 8, 2025

Production-ready platform for agentic workflow development.

TypeScript 116,027 17,894 Updated Oct 9, 2025

实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

Python 420 50 Updated Dec 31, 2024

TTS with kokoro and onnx runtime

Python 2,215 222 Updated Jun 20, 2025

SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Java 4,235 843 Updated Sep 22, 2025

Self-hosted voice chat with LLMs

Rust 462 39 Updated Feb 28, 2025

The reproduced code for Google's SoundStorm

Python 269 20 Updated Oct 7, 2023

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 16,773 1,821 Updated Sep 28, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,701 10,580 Updated Oct 9, 2025

🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot …

TypeScript 8,776 1,174 Updated Aug 27, 2025

The official implementation of GTCRN, an ultra-lightweight SE model.

Python 440 76 Updated May 28, 2025

Efficient Multimodal Large Language Models: A Survey

373 21 Updated Apr 29, 2025

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,807 133 Updated Jul 5, 2024

A generative speech model for daily dialogue.

Python 37,939 4,106 Updated Jul 6, 2025

Control adaptive filters with neural networks.

Python 253 44 Updated Feb 2, 2025

Executable file for VITS inference

Python 2,396 245 Updated Aug 22, 2023
Next