Skip to content
View kainguo's full-sized avatar

Block or report kainguo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

QVAC Fabric: cross-platform LLM inference and fine-tuning, optimized for edge devices and heterogenous GPUs

C++ 89 31 Updated Apr 7, 2026

收集并整理有关OCR的数据集并统一标注格式,以便实验需要

Python 971 201 Updated Nov 28, 2023

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…

Python 1,315 119 Updated Mar 2, 2026

Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS / MPEG-TS / RTP media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.

Go 18,461 2,183 Updated Apr 13, 2026

The Galène videoconference server

Go 1,280 182 Updated Apr 7, 2026

Boost ASIO with Bluetooth RFCOMM

C++ 17 4 Updated Nov 4, 2019

Fast Multimodal LLM on Mobile Devices

C++ 1,467 188 Updated Apr 12, 2026

Official repository for the WenetSpeech-Chuan dataset.

Python 170 4 Updated Feb 5, 2026

FastBee开源物联网平台,简单易用,可用于搭建物联网平台以及二次开发和学习。适用于智能家居、智慧办公、智慧社区、农业监测、水利监测、工业控制等。

Java 2,149 577 Updated Apr 2, 2026

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1

Python 474 114 Updated Mar 13, 2025

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 409 60 Updated Oct 6, 2025

BSP kernel source

C 162 155 Updated Mar 31, 2026

My AI study

27 7 Updated Apr 10, 2026

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 11,535 1,324 Updated Apr 13, 2026

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 76 7 Updated Jul 29, 2024

Port of Funasr's Sense-voice model in C/C++

C 540 70 Updated Dec 19, 2025

Port of Funasr's Sense-voice model in C/C++

C 1 Updated Dec 11, 2025

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

Python 134 21 Updated Jan 28, 2026

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 23,096 4,412 Updated Apr 13, 2026

autocorrelation-based O(NlogN) pitch detection

C++ 645 75 Updated Jan 7, 2025

Deep learning for audio denoising

Python 755 131 Updated Oct 15, 2023

interest repositories

341 67 Updated Feb 6, 2024

speech enhancement\speech seperation\sound source localization

1,231 224 Updated Nov 14, 2023

Open-Source Large Vocabulary Continuous Speech Recognition Engine

C 1,927 305 Updated Jun 16, 2025

Machine learning framework for both deep learning and traditional algorithms

C++ 797 126 Updated Nov 26, 2025

Machine learning framework for both deep learning and traditional algorithms

C++ 1 Updated Dec 23, 2021

Espressif Advanced Development Framework for Multimedia Applications

C 2,212 849 Updated Apr 3, 2026
Next