-
TurnLite Public
TurnLite: TurnLite: Lightweight Interruption Detection(ICASSP 2026 Submission)
-
LLaMA-Omni Public
Forked from ictnlp/LLaMA-OmniLLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Python Apache License 2.0 UpdatedApr 17, 2025 -
-
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit
Python Apache License 2.0 UpdatedMar 11, 2025 -
neural_sp Public
Forked from hirofumi0810/neural_spEnd-to-end ASR/LM implementation with pytorch.
Python Apache License 2.0 UpdatedJun 27, 2020 -
muduo Public
Forked from chenshuo/muduoEvent-driven network library for multi-threaded Linux server in C++11
C++ Other UpdatedMay 31, 2020 -
pychain Public
Forked from YiwenShaoStephen/pychainPyTorch implementation of LF-MMI for End-to-end ASR
C++ UpdatedMay 29, 2020 -
tf-code-acoustics Public
Forked from datemoon/tf-code-acousticsit's a train acoustics model code lib
Python UpdatedMay 20, 2020 -
speech-representations Public
Forked from awslabs/speech-representationsCode for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
Python Apache License 2.0 UpdatedMay 15, 2020 -
warp-transducer Public
Forked from HawkAaron/warp-transducerA fast parallel implementation of RNN Transducer.
C++ Apache License 2.0 UpdatedApr 27, 2020 -
Papers with code. Sorted by stars. Updated weekly.
UpdatedJan 16, 2020 -
Application-of-Word2vec-in-Phoneme-Recognition Public
Forked from fengxin-bupt/Application-of-Word2vec-in-Phoneme-RecognitionBuild an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.
Python MIT License UpdatedDec 18, 2019 -
learn-regex Public
Forked from ziishaned/learn-regexLearn regex the easy way
MIT License UpdatedDec 9, 2019 -
DeepXi Public
Forked from anicolson/DeepXiDeep Xi: A Deep Learning Approach to A Priori SNR Estimation. Used for Speech Enhancement and robust ASR.
Python Mozilla Public License 2.0 UpdatedDec 5, 2019 -
pykaldi2 Public
Forked from jzlianglu/pykaldi2Yet another speech toolkit based on Kaldi and PyTorch
Python MIT License UpdatedDec 4, 2019 -
onssen Public
Forked from speechLabBcCuny/onssenAn open-source speech separation and enhancement library
Python GNU General Public License v3.0 UpdatedNov 29, 2019 -
TAC Public
Forked from yluo42/TACtransform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Python UpdatedNov 11, 2019 -
espresso Public
Forked from freewym/espressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Python Other UpdatedOct 28, 2019 -
OR-NMT Public
Forked from libo8621696/OR-NMTSource code for the paper <Bridging the Gap between Training and Inference for Neural Machine Translation>
Python UpdatedOct 2, 2019 -
Wave-U-Net-for-Speech-Enhancement Public template
Forked from haoxiangsnr/Wave-U-Net-for-Speech-EnhancementImplement [Wave-U-Net](https://arxiv.org/abs/1806.03185) by PyTorch, and migrate it to the speech enhancement area.
Python MIT License UpdatedSep 12, 2019 -
Speech-Recognition Public
A case study: Build speech recognition model using Python
Jupyter Notebook UpdatedSep 6, 2019 -
dejavu Public
Forked from worldveil/dejavuAudio fingerprinting and recognition in Python
Python MIT License UpdatedAug 15, 2019 -
A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement Public
Forked from haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-EnhancementImplement A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement by PyTorch.
Python UpdatedAug 12, 2019 -
noise_adaptive_DAT_SE Public
Forked from jerrygood0703/noise_adaptive_DAT_SEPython UpdatedJul 25, 2019 -
TDengine Public
Forked from taosdata/TDengineAn open-source big data platform designed and optimized for the Internet of Things (IoT).
C GNU Affero General Public License v3.0 UpdatedJul 15, 2019 -
boxx Public
Forked from DIYer22/boxxTool-box for efficient build and debug in Python. Especially for Scientific Computing and Computer Vision.
Python UpdatedJul 13, 2019 -
leetcode-1 Public
Forked from azl397985856/leetcodeLeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
JavaScript Apache License 2.0 UpdatedJul 3, 2019 -
Speech-enhancement-1 Public
Forked from jtkim-kaist/Speech-enhancementDeep neural network based speech enhancement toolkit
MATLAB GNU General Public License v2.0 UpdatedJun 14, 2019 -
Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization Public
Forked from yinkalario/Two-Stage-Polyphonic-Sound-Event-Detection-and-LocalizationA two-stage polyphonic sound event detection and localization method for both SED and DOA.
Python UpdatedJun 13, 2019 -