default search action
EURASIP Journal on Audio, Speech, and Music Processing, Volume 2021
Volume 2021, Number 1, December 2021
- Jorge Llombart, Dayana Ribas, Antonio Miguel, Luis Vicente, Alfonso Ortega, Eduardo Lleida:
Progressive loss functions for speech enhancement with deep neural networks. 1 - Yuval Dorfan, Boaz Schwartz, Sharon Gannot:
Forward-backward recursive expectation-maximization for concurrent speaker tracking. 2 - Vincent Lostanlen, Christian El-Hajj, Mathias Rossignol, Grégoire Lafay, Joakim Andén, Mathieu Lagrange:
Time-frequency scattering accurately models auditory similarities between instrumental playing techniques. 3 - Yuki Saishu, Amir Hossein Poorjam, Mads Græsbøll Christensen:
A CNN-based approach to identification of degradations in speech signals. 1-10 - Chunyan Ji, Thosini Bamunu Mudiyanselage, Yutong Gao, Yi Pan:
A review of infant cry analysis and classification. 1-17 - Rajat Hebbar, Pavlos Papadopoulos, Ramon Reyes, Alexander F. Danvers, Angelina J. Polsinelli, Suzanne A. Moseley, David A. Sbarra, Matthias R. Mehl, Shrikanth Narayanan:
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices. 1-8 - Sören Schulze, Emily J. King:
Sparse pursuit and dictionary learning for blind source separation in polyphonic music recordings. 1-25 - Bracha Laufer-Goldshtein, Ronen Talmon, Sharon Gannot:
Audio source separation by activity probability detection with maximum correlation and simplex geometry. 1-16 - Norihide Kitaoka, Bohan Chen, Yuya Obashi:
Dynamic out-of-vocabulary word registration to language model for speech recognition. 1-8 - Randall Ali, Toon van Waterschoot, Marc Moonen:
An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones. 10 - Kacper Radzikowski, Le Wang, Osamu Yoshie, Robert M. Nowak:
Accent modification for speech recognition of non-native speakers using neural style transfer. 11 - Beatriz Martínez-González, José M. Pardo, José A. Vallejo-Pinto, Rubén San Segundo, Javier Ferreiros:
Analysis of transition cost and model parameters in speaker diarization for meetings. 12 - Sushmita Thakallapalli, Suryakanth V. Gangashetty, Nilesh Madhu:
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain. 13 - Zonglong Bai, Liming Shi, Jesper Rindom Jensen, Jinwei Sun, Mads Græsbøll Christensen:
Acoustic DOA estimation using space alternating sparse Bayesian learning. 14 - Randall Ali, Toon van Waterschoot, Marc Moonen:
Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones. 15 - Hodaya Hammer, Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot:
Dynamically localizing multiple speakers based on the time-frequency domain. 16 - Yuxuan Ke, Andong Li, Chengshi Zheng, Renhua Peng, Xiaodong Li:
Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms. 17 - Duowei Tang, Peter Kuppens, Luc Geurts, Toon van Waterschoot:
End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network. 18 - Sichen Liu, Feiran Yang, Yin Cao, Jun Yang:
Frequency-dependent auto-pooling function for weakly supervised sound event detection. 19 - Miao Liu, Jing Wang, Weiming Yi, Fang Liu:
Neural network-based non-intrusive speech quality assessment using attention pooling function. 20 - Marco Gimm, Philipp Bulling, Gerhard Schmidt:
Residual feedback suppression with extended model-based postfilters. 21 - Yong Lü, Han Lin, Pingping Wu, Yitao Chen:
Feature compensation based on independent noise estimation for robust speech recognition. 22 - Maoshen Jia, Shang Gao, Changchun Bao:
Multi-source localization by using offset residual weight. 23 - Ziyi Xu, Samy Elshamy, Ziyue Zhao, Tim Fingscheidt:
Components loss for neural networks in mask-based speech enhancement. 24 - Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. 25 - Lujun Li, Yikai Kang, Yuchen Shi, Ludwig Kürzinger, Tobias Watzel, Gerhard Rigoll:
Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition. 26 - Gui-Xin Shi, Wei-Qiang Zhang, Guan-Bo Wang, Jing Zhao, Shuzhou Chai, Ze-Yu Zhao:
Timestamp-aligning and keyword-biasing end-to-end ASR front-end for a KWS system. 27 - Alexandru-Lucian Georgescu, Alessandro Pappalardo, Horia Cucu, Michaela Blott:
Performance vs. hardware requirements in state-of-the-art automatic speech recognition. 28 - Linhui Sun, Yunyi Bu, Pingan Li, Zihao Wu:
Single-channel speech enhancement based on joint constrained dictionary learning. 29 - Mina Mounir, Peter Karsmakers, Toon van Waterschoot:
Musical note onset detection based on a spectral sparsity measure. 30 - Masoud Geravanchizadeh, Elnaz Forouhandeh, Meysam Bashirpour:
Feature compensation based on the normalization of vocal tract length for the improvement of emotion-affected speech recognition. 31 - Sujan Kumar Roy, Kuldip K. Paliwal:
A noise PSD estimation algorithm using derivative-based high-pass filter in non-stationary noise conditions. 32 - Nili Cohen, Gershon Hazan, Boaz Schwartz, Sharon Gannot:
An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method. 33 - Yanhua Long, Shuang Wei, Jie Lian, Yijie Li:
Pronunciation augmentation for Mandarin-English code-switching speech recognition. 34 - Hongsheng Chen, Guoliang Chen, Kai Chen, Jing Lu:
Nonlinear residual echo suppression based on dual-stream DPRNN. 35 - David Ackermann, Fabian Brinkmann, Franz Zotter, Malte Kob, Stefan Weinzierl:
Comparative evaluation of interpolation methods for the directivity of musical instruments. 36 - Johannes M. Arend, Tim Lübeck, Christoph Pörschmann:
Efficient binaural rendering of spherical microphone array data by linear filtering. 37 - You-Siang Chen, Zi-Jie Lin, Mingsian R. Bai:
A multichannel learning-based approach for sound source separation in reverberant environments. 38 - Diego Di Carlo, Pinchas Tandeitnik, Cédric Foy, Nancy Bertin, Antoine Deleforge, Sharon Gannot:
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing. 39 - Fangkun Liu, Hui Wang, Renhua Peng, Chengshi Zheng, Xiaodong Li:
U2-VC: one-shot voice conversion using two-level nested U-structure. 40 - Yuancheng Luo:
Spherical harmonic covariance and magnitude function encodings for beamformer design. 41 - Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, Norihide Kitaoka:
Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation. 42 - Ofer Schwartz, Sharon Gannot:
A recursive expectation-maximization algorithm for speaker tracking and separation. 43 - Yuki Takashima, Ryoichi Takashima, Ryota Tsunoda, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki, Nobuaki Motoyama:
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation. 44 - Jiacheng Yao, Jing Zhang, Jiafeng Li, Li Zhuo:
Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit. 45
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.