Skip to content
View wantongtang's full-sized avatar

Block or report wantongtang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,422 609 Updated Oct 20, 2021

a small build system with a focus on speed

C++ 13,013 1,805 Updated May 21, 2026

Re-implementation of TensorFlow in pure python, with an emphasis on code understandability

Jupyter Notebook 680 88 Updated Apr 11, 2021

A library to detect wake words and aid in responding to them

JavaScript 2 Updated Nov 18, 2019

A port of python_speech_features to C.

C 49 13 Updated May 17, 2017

Working POC of Mikrotik exploit from Vault 7 CIA Leaks

Python 2 Updated Jun 6, 2018

Working POC of Mikrotik exploit from Vault 7 CIA Leaks

Python 663 214 Updated Sep 20, 2022

TensorFlow examples in C, C++, Go and Python without bazel but with cmake and FindTensorFlow.cmake

CMake 443 86 Updated Aug 18, 2019

限定域问答系统包括:自动构建知识库、问句检索、基于微信平台搭建问答系统。本项目所有代码已开源。用户通过简单配置,可以实现快速自动化搭建一个比较完备的领域知识库。另外,基于微信平台如何通过配置来搭建问答系统,具体操作见readme.txt

Java 71 28 Updated Nov 26, 2016

限定域问答系统包括:自动构建知识库、问句检索、基于微信平台搭建问答系统。本项目所有代码已开源。用户通过简单配置,可以实现快速自动化搭建一个比较完备的领域知识库。另外,基于微信平台如何通过配置来搭建问答系统,具体操作见readme.txt

Java 1 Updated Nov 26, 2016

download URL for the ASR transcripts and the lattices

4 2 Updated Aug 3, 2018

CHiME-5 Baseline Array Synchronisation

Python 12 4 Updated Sep 24, 2018
C 2 2 Updated Apr 5, 2018

Rokid智能语音识别Demo(AS工程),运行在Android6.0平台

Java 5 3 Updated Nov 30, 2017

[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.

C 3,151 688 Updated Jan 22, 2025

A Demo of Mandarin/Chinese TTS frontend

Python 284 122 Updated Apr 18, 2022

A collection of resources to make a smart speaker

477 94 Updated Dec 20, 2019

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Jupyter Notebook 58 28 Updated Mar 30, 2018

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

Python 386 77 Updated Mar 24, 2023

A Python wrapper for Kaldi

Python 1,036 251 Updated Nov 30, 2025

RecordRTC is WebRTC JavaScript library for audio/video as well as screen activity recording. It supports Chrome, Firefox, Opera, Android, and Microsoft Edge. Platforms: Linux, Mac and Windows.

JavaScript 6,905 1,762 Updated May 13, 2024

Speaker embedding(verification and recognition) using Pytorch

Python 369 97 Updated Jul 24, 2020

VoCore2 firmware drivers

C 106 46 Updated Mar 13, 2025

This project has been deprecated

Python 75 37 Updated Feb 3, 2017

DOA, VAD and KWS for ReSpeaker Microphone Array

Python 331 90 Updated Aug 28, 2018

JAMS annotation files for the original and augmented UrbanSound8K dataset

35 5 Updated Jan 31, 2018

A library for augmenting annotated audio data

Python 237 32 Updated May 3, 2021

Audio Classifier in Keras using Convolutional Neural Network

Python 159 60 Updated May 6, 2019

Stretch any audio to extreme lengths

Python 6 1 Updated May 17, 2014

Manipulate audio with a simple and easy high level interface

Python 9,766 1,133 Updated Mar 19, 2026
Next