default search action
Shuai Wang 0016
Person information
- affiliation: Chinese University of Hong Kong-Shenzhen (CUKH-SZ), Shenzhen Research Institute of Big Data, Shenzhen, China
- affiliation (PhD 2020): Shanghai Jiao Tong University, Department of Computer Science and Engineering, China
Other persons with the same name
- Shuai Wang — disambiguation page
- Shuai Wang 0001 — Simula Research Laboratory, Oslo, Norway (and 1 more)
- Shuai Wang 0002 — Chinese Academy of Sciences, Academy of Mathematics and Systems Science, NCMIS, Beijing, China
- Shuai Wang 0003 — Hangzhou Dianzi University, School of Cyberspace, Lishui Institute, China (and 3 more)
- Shuai Wang 0004 — Chinese Academy of Sciences, Shenzhen Institute of Advanced Technology, China (and 2 more)
- Shuai Wang 0005 — Chinese Academy of Sciences, Institute of Automation, State Key Laboratory of Management and Control for Complex Systems, Beijing, China
- Shuai Wang 0006 — Nanjing University, Department of Computer Science and Technology, China (and 1 more)
- Shuai Wang 0007 — Tencent Robotics X, Shenzhen, China (and 1 more)
- Shuai Wang 0008 — Southeast University, School of Computer Science and Engineering, Nanjing, China (and 1 more)
- Shuai Wang 0009 — Sun Yat-sen University, Shenzhen, China (and 1 more)
- Shuai Wang 0010 — Ryerson University, Department of Electrical and Computer Engineering, Toronto, ON, Canada (and 1 more)
- Shuai Wang 0011 — Hong Kong University of Science and Technology, Hong Kong (and 2 more)
- Shuai Wang 0012 — Hong Kong Polytechnic University, Department of Computing, Hong Kong
- Shuai Wang 0013 — Beijing Institute of Technology, School of Information and Electronics, China
- Shuai Wang 0014 — Vrije Universiteit Amsterdam, The Netherlands (and 2 more)
- Shuai Wang 0015 — Wuhan University, School of Resource and Environmental Science, China
- Shuai Wang 0017 — Tianjin University, Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, China
- Shuai Wang 0018 — University of Science and Technology of China, Department of Automation, Hefei, China
- Shuai Wang 0019 — Xidian University, State Key Laboratory of Integrated Service Networks, Xian, China
- Shuai Wang 0020 — University of Illinois at Chicago, Department of Computer Science, USA
- Shuai Wang 0021 — George Mason University, Department of Computer Science, Fairfax, VA, USA (and 1 more)
- Shuai Wang 0022 — SRI International, Center for Technology in Learning, Menlo Park, CA, USA (and 1 more)
- Shuai Wang 0023 — Changchun University of Science and Technology, School of Computer Science and Technology, China (and 1 more)
- Shuai Wang 0024 — Shenyang Agricultural University, College of Land and Environment, China (and 2 more)
- Shuai Wang 0025 — Changchun University of Science and Technology, School of Science, China
- Shuai Wang 0026 — Chinese Academy of Sciences, Aerospace Information Research Institute, Beijing, China (and 1 more)
- Shuai Wang 0027 — Beihang University, School of Computer Science and Engineering, Beijing, China (and 1 more)
- Shuai Wang 0028 — Tsinghua University, Department of Computer Science and Technology, Beijing, China
- Shuai Wang 0029 — Boston University, Division of Systems Engineering, Boston, MA, USA
- Shuai Wang 0030 — JOYY Inc, Beijing, China (and 2 more)
- Shuai Wang 0031 — China Three Gorges University, Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering, Yichang, China (and 1 more)
- Shuai Wang 0032 — University of Queensland, QLD, Australia
- Shuai Wang 0033 — Singapore University of Technology and Design, Information Systems Technology and Design Pillar, Tampines, Singapore (and 2 more)
- Shuai Wang 0034 — Yuncheng University, Shanxi Province Optoelectronic Information Science and Technology Laboratory, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j12]Shuai Wang, Zhengyang Chen, Bing Han, Hongji Wang, Chengdong Liang, Binbin Zhang, Xu Xiang, Wen Ding, Johan Rohdin, Anna Silnova, Yanmin Qian, Haizhou Li:
Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Commun. 162: 103104 (2024) - [j11]Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1636-1649 (2024) - [j10]Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation With Pretrained Frontend to Minimize Domain Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4184-4198 (2024) - [j9]Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4971-4998 (2024) - [c45]Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu:
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding. AAAI 2024: 17924-17932 - [c44]Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data. ICASSP 2024: 1136-1140 - [c43]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. ICASSP 2024: 10601-10605 - [c42]Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech. ICASSP 2024: 10666-10670 - [c41]Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition. ICASSP 2024: 10901-10905 - [c40]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. ICASSP 2024: 11106-11110 - [c39]Wen Huang, Bing Han, Shuai Wang, Zhengyang Chen, Yanmin Qian:
Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters. ICASSP 2024: 11781-11785 - [i42]Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu:
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech. CoRR abs/2401.14321 (2024) - [i41]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Fine-Grained Quantitative Emotion Editing for Speech Generation. CoRR abs/2403.02002 (2024) - [i40]Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu:
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge. CoRR abs/2404.06079 (2024) - [i39]Hankun Wang, Chenpeng Du, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu:
Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech. CoRR abs/2404.19723 (2024) - [i38]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. CoRR abs/2405.09171 (2024) - [i37]Zhijun Liu, Shuai Wang, Sho Inoue, Qibing Bai, Haizhou Li:
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis. CoRR abs/2406.05551 (2024) - [i36]Bohan Li, Feiyu Shen, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu:
On the Effectiveness of Acoustic BPE in Decoder-Only TTS. CoRR abs/2407.03892 (2024) - [i35]Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. CoRR abs/2407.15188 (2024) - [i34]Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, Jie Ding, Lei Xie:
Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation. CoRR abs/2408.15474 (2024) - [i33]Yiyang Zhao, Shuai Wang, Guangzhi Sun, Zehua Chen, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models. CoRR abs/2408.15585 (2024) - [i32]Yiwei Guo, Zhihan Li, Junjie Li, Chenpeng Du, Hankun Wang, Shuai Wang, Xie Chen, Kai Yu:
vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders. CoRR abs/2409.01995 (2024) - [i31]Zhengyang Chen, Bing Han, Shuai Wang, Yidi Jiang, Yanmin Qian:
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching. CoRR abs/2409.04859 (2024) - [i30]Zhengyang Chen, Shuai Wang, Mingyang Zhang, Xuechen Liu, Junichi Yamagishi, Yanmin Qian:
Disentangling the Prosody and Semantic Information with Pre-trained Model for In-Context Learning based Zero-Shot Voice Conversion. CoRR abs/2409.05004 (2024) - [i29]Zhijun Liu, Shuai Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
E1 TTS: Simple and Fast Non-Autoregressive TTS. CoRR abs/2409.09351 (2024) - [i28]Sho Inoue, Shuai Wang, Wanxing Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion. CoRR abs/2409.09352 (2024) - [i27]Junjie Li, Ke Zhang, Shuai Wang, Haizhou Li, Man-Wai Mak, Kong Aik Lee:
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction. CoRR abs/2409.09589 (2024) - [i26]Shuai Wang, Pengcheng Zhu, Haizhou Li:
M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions. CoRR abs/2409.15782 (2024) - [i25]Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. CoRR abs/2409.15799 (2024) - [i24]Ke Zhang, Junjie Li, Shuai Wang, Yangjie Wei, Yi Wang, Yannan Wang, Haizhou Li:
Multi-Level Speaker Representation for Target Speaker Extraction. CoRR abs/2410.16059 (2024) - [i23]Wen Huang, Bing Han, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification. CoRR abs/2410.17033 (2024) - [i22]Kangxiang Xia, Dake Guo, Jixun Yao, Liumeng Xue, Hanzhao Li, Shuai Wang, Zhao Guo, Lei Xie, Qingqing Zhang, Lei Luo, Minghui Dong, Peng Sun:
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings. CoRR abs/2411.00064 (2024) - [i21]Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation with Pretrained Frontend to Minimize Domain Mismatch. CoRR abs/2411.03085 (2024) - 2023
- [c38]Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production Oriented Speaker Embedding Learning Toolkit. ICASSP 2023: 1-5 - [c37]Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation-based Voice Conversion. ICME 2023: 1691-1696 - [c36]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. INTERSPEECH 2023: 2063-2067 - [c35]Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. INTERSPEECH 2023: 3552-3556 - [i20]Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion. CoRR abs/2305.09167 (2023) - [i19]Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. CoRR abs/2305.10704 (2023) - [i18]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. CoRR abs/2305.12425 (2023) - [i17]Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu:
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding. CoRR abs/2306.07547 (2023) - [i16]Shuai Wang, Chengdong Liang, Xu Xiang, Bing Han, Zhengyang Chen, Hongji Wang, Wen Ding:
Wespeaker baselines for VoxSRC2023. CoRR abs/2306.15161 (2023) - [i15]Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer. CoRR abs/2309.06672 (2023) - [i14]Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech. CoRR abs/2309.08408 (2023) - [i13]Junyi Ao, Mehmet Sinan Yildirim, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li:
USED: Universal Speaker Extraction and Diarization. CoRR abs/2309.10674 (2023) - [i12]Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition. CoRR abs/2309.11730 (2023) - [i11]Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data. CoRR abs/2309.13905 (2023) - [i10]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. CoRR abs/2309.15496 (2023) - [i9]Meng Ge, Yizhou Peng, Yidi Jiang, Jingru Lin, Junyi Ao, Mehmet Sinan Yildirim, Shuai Wang, Haizhou Li, Mengling Feng:
The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge. CoRR abs/2312.16002 (2023) - 2022
- [c34]Aiwen Deng, Shuai Wang, Wenxiong Kang, Feiqi Deng:
On the Importance of Different Frequency Bins for Speaker Verification. ICASSP 2022: 7537-7541 - [c33]Bei Liu, Haoyu Wang, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Knowledge Distillation via Feature Enhancement for Speaker Verification. ICASSP 2022: 7542-7546 - [c32]Bei Liu, Zhengyang Chen, Shuai Wang, Haoyu Wang, Bing Han, Yanmin Qian:
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design. INTERSPEECH 2022: 296-300 - [c31]Jinchao Li, Shuai Wang, Yang Chao, Xunying Liu, Helen Meng:
Context-aware Multimodal Fusion for Emotion Recognition. INTERSPEECH 2022: 2013-2017 - [i8]Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit. CoRR abs/2210.17016 (2022) - 2021
- [j8]Yanmin Qian, Zhengyang Chen, Shuai Wang:
Audio-Visual Deep Neural Network for Robust Person Verification. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1079-1092 (2021) - [j7]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1542-1555 (2021) - [c30]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. ICASSP 2021: 5834-5838 - [c29]Chenpeng Du, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu:
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification. ICASSP 2021: 5844-5848 - [c28]Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification. ICASSP 2021: 5849-5853 - [c27]Yufei Liu, Chengzhu Yu, Shuai Wang, Zhenchuan Yang, Yang Chao, Weibin Zhang:
Non-Parallel Any-to-Many Voice Conversion by Replacing Speaker Statistics. Interspeech 2021: 1369-1373 - [c26]Xun Gong, Zhengyang Chen, Yexin Yang, Shuai Wang, Lan Wang, Yanmin Qian:
Speaker Embedding Augmentation with Noise Distribution Matching. ISCSLP 2021: 1-5 - [c25]Shuai Wang, Yexin Yang, Yanmin Qian, Kai Yu:
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning. ISCSLP 2021: 1-5 - [i7]Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit selection synthesis based data augmentation for fixed phrase speaker verification. CoRR abs/2102.09817 (2021) - [i6]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice activity detection in the wild: A data-driven approach using teacher-student training. CoRR abs/2105.04065 (2021) - [i5]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. CoRR abs/2108.13843 (2021) - 2020
- [j6]Shuai Wang, Yexin Yang, Zhanghao Wu, Yanmin Qian, Kai Yu:
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2598-2609 (2020) - [c24]Yexin Yang, Shuai Wang, Xun Gong, Yanmin Qian, Kai Yu:
Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings. ICASSP 2020: 6454-6458 - [c23]Mireia Díez, Lukás Burget, Federico Landini, Shuai Wang, Honza Cernocký:
Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6519-6523 - [c22]Federico Landini, Shuai Wang, Mireia Díez, Lukás Burget, Pavel Matejka, Katerina Zmolíková, Ladislav Mosner, Anna Silnova, Oldrich Plchot, Ondrej Novotný, Hossein Zeinali, Johan Rohdin:
But System for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6529-6533 - [c21]Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training. ICASSP 2020: 6574-6578 - [c20]Shuai Wang, Johan Rohdin, Oldrich Plchot, Lukás Burget, Kai Yu, Jan Cernocký:
Investigation of Specaugment for Deep Speaker Embedding Learning. ICASSP 2020: 7139-7143 - [c19]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection. INTERSPEECH 2020: 1086-1090 - [c18]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Multi-Modality Matters: A Performance Leap on VoxCeleb. INTERSPEECH 2020: 2252-2256 - [c17]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network. INTERSPEECH 2020: 3017-3021 - [c16]Jahangir Alam, Gilles Boulianne, Lukás Burget, Mohamed Dahmane, Mireia Díez Sánchez, Alicia Lozano-Diez, Ondrej Glembek, Pierre-Luc St-Charles, Marc Lalonde, Pavel Matejka, Petr Mizera, João Monteiro, Ladislav Mosner, Cedric Noiseux, Ondrej Novotný, Oldrich Plchot, Johan Rohdin, Anna Silnova, Josef Slavícek, Themos Stafylakis, Shuai Wang, Hossein Zeinali:
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. Odyssey 2020: 289-295 - [i4]Yefei Chen, Shuai Wang, Yanmin Qian, Kai Yu:
End-to-End Speaker-Dependent Voice Activity Detection. CoRR abs/2009.09906 (2020)
2010 – 2019
- 2019
- [j5]Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 20(3): 438 (2019) - [j4]Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1686-1696 (2019) - [c15]Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. APSIPA 2019: 1652-1656 - [c14]Shuai Wang, Yexin Yang, Tianzhe Wang, Yanmin Qian, Kai Yu:
Knowledge Distillation for Small Foot-print Deep Speaker Embedding. ICASSP 2019: 6021-6025 - [c13]Mireia Díez, Lukás Burget, Shuai Wang, Johan Rohdin, Jan Cernocký:
Bayesian HMM Based x-Vector Clustering for Speaker Diarization. INTERSPEECH 2019: 346-350 - [c12]Yexin Yang, Hongji Wang, Heinrich Dinkel, Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge. INTERSPEECH 2019: 1038-1042 - [c11]Shuai Wang, Johan Rohdin, Lukás Burget, Oldrich Plchot, Yanmin Qian, Kai Yu, Jan Cernocký:
On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction. INTERSPEECH 2019: 1148-1152 - [c10]Zhanghao Wu, Shuai Wang, Yanmin Qian, Kai Yu:
Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification. INTERSPEECH 2019: 1163-1167 - [c9]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training. INTERSPEECH 2019: 2938-2942 - [i3]Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. CoRR abs/1906.07317 (2019) - [i2]Hossein Zeinali, Shuai Wang, Anna Silnova, Pavel Matejka, Oldrich Plchot:
BUT System Description to VoxCeleb Speaker Recognition Challenge 2019. CoRR abs/1910.12592 (2019) - 2018
- [j3]Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(1): 40-63 (2018) - [j2]Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(4): 582 (2018) - [c8]Zili Huang, Shuai Wang, Yanmin Qian:
Joint I-Vector with End-to-End System for Short Duration Text-Independent Speaker Verification. ICASSP 2018: 4869-4873 - [c7]Shuai Wang, Yanmin Qian, Kai Yu:
Focal Kl-Divergence Based Dilated Convolutional Neural Networks for Co-Channel Speaker Identification. ICASSP 2018: 5339-5343 - [c6]Zili Huang, Shuai Wang, Kai Yu:
Angular Softmax for Short-Duration Text-independent Speaker Verification. INTERSPEECH 2018: 3623-3627 - [c5]Shuai Wang, Heinrich Dinkel, Yanmin Qian, Kai Yu:
Covariance Based Deep Feature for Text-Dependent Speaker Verification. IScIDE 2018: 231-242 - [c4]Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. ISCSLP 2018: 195-199 - [c3]Yexin Yang, Shuai Wang, Man Sun, Yanmin Qian, Kai Yu:
Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification. ISCSLP 2018: 205-209 - [i1]Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. CoRR abs/1805.01344 (2018) - 2017
- [c2]Xiaowei Jiang, Shuai Wang, Xu Xiang, Yanmin Qian:
Integrating online i-vector into GMM-UBM for text-dependent speaker verification. APSIPA 2017: 1628-1632 - [c1]Shuai Wang, Yanmin Qian, Kai Yu:
What Does the Speaker Embedding Encode? INTERSPEECH 2017: 1497-1501 - 2012
- [j1]Yizhong Zhang, Huamin Wang, Shuai Wang, Yiying Tong, Kun Zhou:
A Deformable Surface Model for Real-Time Water Drop Animation. IEEE Trans. Vis. Comput. Graph. 18(8): 1281-1289 (2012)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-13 02:00 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint