-
University of Science and Technology of China
- Hefei, Anhui, P.R.China
- http://husencd.github.io/
Stars
Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up…
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Open repository of simulated Room Impulse Responses (RIR) accompanying the paper "Hearing Anywhere in Any Environment"
Companion repository of the DAFx25 tutorial "Building Flexible Audio DDSP Pipelines: A Case Study on Artificial Reverb"
Lightweight, zero-dependency proxy and storage RTSP server
Audio Share can share Windows/Linux computer's audio to Android phone over network, so your phone becomes the speaker of computer. (You needn't buy a new speaker😄.)
Realtime human head pose estimation with ONNXRuntime and OpenCV.
Convert any wav audio files to apple .ahap file (Apple Haptic and Audio Pattern)
simple delaysum, MVDR and CGMM-MVDR
C++ library for audio and music analysis, description and synthesis, including Python bindings
Pitch tracking in real-time with the Kalman filter
RNNOISE Noise elimination, MCRA noise estimation, OMLSA post filtering
This is the code repository for the course: https://b23.tv/F3n4Oh7
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
A lightweight, high-performance Kalman Filter library in C, C++, and MATLAB, offering superior numerical stability and efficiency with minimal dependencies.
Differentiable signal processing on the sphere for PyTorch
Official PyTorch code for Deep Audio-Signal Holistic Embeddings
The End-to-End Magnitude Least Squares Binaural Renderer for Spherical Microphone Array Signals
The official implementation of GTCRN, an ultra-lightweight SE model.
Hearing Anything Anywhere Code Release