Stars
Port of OpenAI's Whisper model in C/C++
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
A free and strong UCI chess engine
MuseScore is an open source and free music notation software. For support, contribution, bug reports, visit MuseScore.org. Fork and make pull requests!
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
C++ library for audio and music analysis, description and synthesis, including Python bindings
Audio Plugin for Audio to MIDI transcription using deep learning.
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON, RISC-V RVV)
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
🇸Superpowered Audio, Networking and Cryptographics SDKs. High performance and cross platform on Android, iOS, macOS, tvOS, Linux, Windows and modern web browsers.
Official mirror of Rubber Band Library, an audio time-stretching and pitch-shifting library.
Fast implementation of the edit distance(Levenshtein distance)
PaulXStretch - Extreme Timestretching application and plugin
A fast K Nearest Neighbor library for low-dimensional spaces
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech