Skip to content
View Ryuk17's full-sized avatar
👻
Who can save me?
👻
Who can save me?

Block or report Ryuk17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
JavaScript 7 Updated May 15, 2026

Lightweight streaming Voice Activity Detection (VAD) tool with ONNX runtime

Python 19 Updated Mar 18, 2026

Public repository for Agent Skills

Python 136,374 16,086 Updated May 17, 2026

收集整理开源的数据标注工具

958 184 Updated Oct 9, 2019

speex aec kalman filter

Python 15 6 Updated Mar 17, 2024

DistantSpeech

Jupyter Notebook 22 6 Updated Oct 9, 2023

A training code template for DNN-based speech enhancement.

Python 195 45 Updated Sep 4, 2025

streaming codes for funasr-nano

Python 10 Updated Jan 26, 2026

每日学习笔记

Python 67 17 Updated Dec 12, 2025

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 2,121 168 Updated Feb 2, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 57,547 6,278 Updated Apr 30, 2026

Official Implementation of GLAP - General Language Audio Pretraining

Python 72 3 Updated May 14, 2026

[TACL'26] VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 363 24 Updated Apr 28, 2026

Official inference framework for 1-bit LLMs

Python 39,025 3,555 Updated Mar 10, 2026

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,884 161 Updated Feb 25, 2026

https://hf.co/hexgrad/Kokoro-82M

JavaScript 7,058 766 Updated Aug 6, 2025

pHash - the open source perceptual hash library

C++ 631 82 Updated Jan 30, 2023

partitioned block based frequency domain Kalman filter

Python 58 9 Updated Jan 14, 2023

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 111 15 Updated Dec 20, 2024

Different implementations of "Weighted Prediction Error" for speech dereverberation

Python 562 169 Updated Mar 19, 2025

KWS demo based on CTC prefix beam search.

Python 18 2 Updated Oct 21, 2023

Python library & examples for Masked Language Model Scoring (ACL 2020)

Python 349 60 Updated Dec 20, 2022

使用Bert,ERNIE,进行中文文本分类

Python 4,424 926 Updated Jun 28, 2024

Experiments with BitNet inference on CPU

C++ 57 4 Updated Apr 1, 2024

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,894 1,556 Updated Feb 6, 2026
Next