-
University of Science and Technology of China
- Hefei, Anhui, P.R.China
- http://husencd.github.io/
Stars
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Training & Inference Code of PRNet in PyTorch 1.1.0
Expressive Anechoic Recordings of Speech (EARS)
Source code for blog post: A Practical Introduction to Deep Learning with Caffe and Python
Official PyTorch code for Deep Audio-Signal Holistic Embeddings
Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
Lightweight, zero-dependency proxy and storage RTSP server
FxNorm-Automix - Implementation of automatic music mixing systems. We show how we can use wet music data and repurpose it to train a fully automatic mixing system
Headless multitrack mixing console in Python
This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".
Blind source separation with independent vector analysis family of algorithm in torch
use STN+CNN+BLSTM+CTC to do OCR
The EBU ADM Renderer, written in Python, is the reference implementation of EBU Tech 3388
Measuring room impulse responses with python and sounddevice
Real-Time Spherical Array Renderer for binaural reproduction in Python
Companion code of DAFx23 "Differentiable Feedback Delay Network for Colorless Reverberation"
SHN-based (Stacked Hourglass Network) methods for 2D face alignment
PyTorch-based Driver Posture Classification
An experiment to do acoustic beamforming and beamsteering with Arduino.
Neural Modeling of Magnetic Tape Recorders
use yolo2 to detect character
This is a project which contains all of modules used in Posetrack and I will write a tutorial to teach everyone who knows little about deep learning and computer vision to construct an entire PoseT…