default search action

combined dblp search
author search
venue search
publication search

ask others

Shuai Wang 0016

> Home > Persons

Person information

affiliation: Chinese University of Hong Kong-Shenzhen (CUKH-SZ), Shenzhen Research Institute of Big Data, Shenzhen, China
affiliation (PhD 2020): Shanghai Jiao Tong University, Department of Computer Science and Engineering, China

Other persons with the same name

see FAQ

Shuai Wang — disambiguation page
Shuai Wang 0001 — Simula Research Laboratory, Oslo, Norway (and 1 more)
Shuai Wang 0002 — Chinese Academy of Sciences, Academy of Mathematics and Systems Science, NCMIS, Beijing, China
Shuai Wang 0003 — Hangzhou Dianzi University, School of Cyberspace, Lishui Institute, China (and 3 more)
Shuai Wang 0004 — Chinese Academy of Sciences, Shenzhen Institute of Advanced Technology, China (and 2 more)
Shuai Wang 0005 — Chinese Academy of Sciences, Institute of Automation, State Key Laboratory of Management and Control for Complex Systems, Beijing, China
Shuai Wang 0006 — Nanjing University, Department of Computer Science and Technology, China (and 1 more)
Shuai Wang 0007 — Tencent Robotics X, Shenzhen, China (and 1 more)
Shuai Wang 0008 — Southeast University, School of Computer Science and Engineering, Nanjing, China (and 1 more)
Shuai Wang 0009 — Sun Yat-sen University, Shenzhen, China (and 1 more)

Shuai Wang 0010 — Ryerson University, Department of Electrical and Computer Engineering, Toronto, ON, Canada (and 1 more)
Shuai Wang 0011 — Hong Kong University of Science and Technology, Hong Kong (and 2 more)
Shuai Wang 0012 — Hong Kong Polytechnic University, Department of Computing, Hong Kong
Shuai Wang 0013 — Beijing Institute of Technology, School of Information and Electronics, China
Shuai Wang 0014 — Vrije Universiteit Amsterdam, The Netherlands (and 2 more)
Shuai Wang 0015 — Wuhan University, School of Resource and Environmental Science, China
Shuai Wang 0017 — Tianjin University, Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, China
Shuai Wang 0018 — University of Science and Technology of China, Department of Automation, Hefei, China
Shuai Wang 0019 — Xidian University, State Key Laboratory of Integrated Service Networks, Xian, China
Shuai Wang 0020 — University of Illinois at Chicago, Department of Computer Science, USA
Shuai Wang 0021 — George Mason University, Department of Computer Science, Fairfax, VA, USA (and 1 more)
Shuai Wang 0022 — SRI International, Center for Technology in Learning, Menlo Park, CA, USA (and 1 more)
Shuai Wang 0023 — Changchun University of Science and Technology, School of Computer Science and Technology, China (and 1 more)
Shuai Wang 0024 — Shenyang Agricultural University, College of Land and Environment, China (and 2 more)
Shuai Wang 0025 — Changchun University of Science and Technology, School of Science, China
Shuai Wang 0026 — Chinese Academy of Sciences, Aerospace Information Research Institute, Beijing, China (and 1 more)
Shuai Wang 0027 — Beihang University, School of Computer Science and Engineering, Beijing, China (and 1 more)
Shuai Wang 0028 — Tsinghua University, Department of Computer Science and Technology, Beijing, China
Shuai Wang 0029 — Boston University, Division of Systems Engineering, Boston, MA, USA
Shuai Wang 0030 — JOYY Inc, Beijing, China (and 2 more)
Shuai Wang 0031 — China Three Gorges University, Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering, Yichang, China (and 1 more)
Shuai Wang 0032 — University of Queensland, QLD, Australia
Shuai Wang 0033 — Singapore University of Technology and Design, Information Systems Technology and Design Pillar, Tampines, Singapore (and 2 more)
Shuai Wang 0034 — Yuncheng University, Shanxi Province Optoelectronic Information Science and Technology Laboratory, China (and 1 more)

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/WangCHWLZXDRSQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/WangCHWLZXDRSQL24
Shuai Wang, Zhengyang Chen, Bing Han, Hongji Wang, Chengdong Liang, Binbin Zhang, Xu Xiang, Wen Ding, Johan Rohdin, Anna Silnova, Yanmin Qian, Haizhou Li:
Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Commun. 162: 103104 (2024)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenHWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenHWQ24
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1636-1649 (2024)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangPLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangPLWL24
Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation With Pretrained Frontend to Minimize Domain Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4184-4198 (2024)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangCLQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangCLQL24
Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4971-4998 (2024)
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DuGSLLCWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DuGSLLCWZ024
Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu:
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding. AAAI 2024: 17924-17932
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuCBLLTLJW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuCBLLTLJW24
Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data. ICASSP 2024: 1136-1140
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Inoue0W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Inoue0W024
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. ICASSP 2024: 10601-10605
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiTPGW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiTPGW024
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech. ICASSP 2024: 10666-10670
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangBLYCHQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangBLYCHQ024
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition. ICASSP 2024: 10901-10905
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NingJ0WY0B24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NingJ0WY0B24
Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. ICASSP 2024: 11106-11110
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangHWCQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangHWCQ24
Wen Huang, Bing Han, Shuai Wang, Zhengyang Chen, Yanmin Qian:
Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters. ICASSP 2024: 11781-11785
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14321
Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu:
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech. CoRR abs/2401.14321 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-02002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-02002
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Fine-Grained Quantitative Emotion Editing for Speech Generation. CoRR abs/2403.02002 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06079
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06079
Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu:
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge. CoRR abs/2404.06079 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19723
Hankun Wang, Chenpeng Du, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu:
Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech. CoRR abs/2404.19723 (2024)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-09171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-09171
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. CoRR abs/2405.09171 (2024)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05551
Zhijun Liu, Shuai Wang, Sho Inoue, Qibing Bai, Haizhou Li:
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis. CoRR abs/2406.05551 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03892
Bohan Li, Feiyu Shen, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu:
On the Effectiveness of Acoustic BPE in Decoder-Only TTS. CoRR abs/2407.03892 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15188
Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. CoRR abs/2407.15188 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15474
Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, Jie Ding, Lei Xie:
Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation. CoRR abs/2408.15474 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15585
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15585
Yiyang Zhao, Shuai Wang, Guangzhi Sun, Zehua Chen, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models. CoRR abs/2408.15585 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-01995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-01995
Yiwei Guo, Zhihan Li, Junjie Li, Chenpeng Du, Hankun Wang, Shuai Wang, Xie Chen, Kai Yu:
vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders. CoRR abs/2409.01995 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-04859
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-04859
Zhengyang Chen, Bing Han, Shuai Wang, Yidi Jiang, Yanmin Qian:
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching. CoRR abs/2409.04859 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05004
Zhengyang Chen, Shuai Wang, Mingyang Zhang, Xuechen Liu, Junichi Yamagishi, Yanmin Qian:
Disentangling the Prosody and Semantic Information with Pre-trained Model for In-Context Learning based Zero-Shot Voice Conversion. CoRR abs/2409.05004 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09351
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09351
Zhijun Liu, Shuai Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
E1 TTS: Simple and Fast Non-Autoregressive TTS. CoRR abs/2409.09351 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09352
Sho Inoue, Shuai Wang, Wanxing Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion. CoRR abs/2409.09352 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09589
Junjie Li, Ke Zhang, Shuai Wang, Haizhou Li, Man-Wai Mak, Kong Aik Lee:
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction. CoRR abs/2409.09589 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15782
Shuai Wang, Pengcheng Zhu, Haizhou Li:
M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions. CoRR abs/2409.15782 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15799
Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. CoRR abs/2409.15799 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-16059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-16059
Ke Zhang, Junjie Li, Shuai Wang, Yangjie Wei, Yi Wang, Yannan Wang, Haizhou Li:
Multi-Level Speaker Representation for Target Speaker Extraction. CoRR abs/2410.16059 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-17033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-17033
Wen Huang, Bing Han, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification. CoRR abs/2410.17033 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-00064
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-00064
Kangxiang Xia, Dake Guo, Jixun Yao, Liumeng Xue, Hanzhao Li, Shuai Wang, Zhao Guo, Lei Xie, Qingqing Zhang, Lei Luo, Minghui Dong, Peng Sun:
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings. CoRR abs/2411.00064 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-03085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-03085
Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation with Pretrained Frontend to Minimize Domain Mismatch. CoRR abs/2411.03085 (2024)
2023
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLWCZXDQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLWCZXDQ23
Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production Oriented Speaker Embedding Learning Toolkit. ICASSP 2023: 1-5
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZhaoWCWM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZhaoWCWM23
Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation-based Voice Conversion. ICME 2023: 1691-1696
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NingJ0YW0B23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NingJ0YW0B23
Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. INTERSPEECH 2023: 2063-2067
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHWQ23
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. INTERSPEECH 2023: 3552-3556
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09167
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09167
Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion. CoRR abs/2305.09167 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10704
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. CoRR abs/2305.10704 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12425
Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. CoRR abs/2305.12425 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07547
Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu:
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding. CoRR abs/2306.07547 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15161
Shuai Wang, Chengdong Liang, Xu Xiang, Bing Han, Zhengyang Chen, Hongji Wang, Wen Ding:
Wespeaker baselines for VoxSRC2023. CoRR abs/2306.15161 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-06672
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-06672
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer. CoRR abs/2309.06672 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08408
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech. CoRR abs/2309.08408 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10674
Junyi Ao, Mehmet Sinan Yildirim, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li:
USED: Universal Speaker Extraction and Diarization. CoRR abs/2309.10674 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11730
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition. CoRR abs/2309.11730 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13905
Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data. CoRR abs/2309.13905 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15496
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15496
Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. CoRR abs/2309.15496 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16002
Meng Ge, Yizhou Peng, Yidi Jiang, Jingru Lin, Junyi Ao, Mehmet Sinan Yildirim, Shuai Wang, Haizhou Li, Mengling Feng:
The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge. CoRR abs/2312.16002 (2023)
2022
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DengWKD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DengWKD22
Aiwen Deng, Shuai Wang, Wenxiong Kang, Feiqi Deng:
On the Importance of Different Frequency Bins for Speaker Verification. ICASSP 2022: 7537-7541
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuWCWQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuWCWQ22
Bei Liu, Haoyu Wang, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Knowledge Distillation via Feature Enhancement for Speaker Verification. ICASSP 2022: 7542-7546
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCWWHQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCWWHQ22
Bei Liu, Zhengyang Chen, Shuai Wang, Haoyu Wang, Bing Han, Yanmin Qian:
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design. INTERSPEECH 2022: 296-300
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWCLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWCLM22
Jinchao Li, Shuai Wang, Yang Chao, Xunying Liu, Helen Meng:
Context-aware Multimodal Fusion for Emotion Recognition. INTERSPEECH 2022: 2013-2017
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17016
Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit. CoRR abs/2210.17016 (2022)
2021
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianCW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianCW21
Yanmin Qian, Zhengyang Chen, Shuai Wang:
Audio-Visual Deep Neural Network for Robust Person Verification. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1079-1092 (2021)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DinkelWXWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DinkelWXWY21
Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1542-1555 (2021)
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWQ21
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. ICASSP 2021: 5834-5838
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DuHWQ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DuHWQ021
Chenpeng Du, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu:
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification. ICASSP 2021: 5844-5848
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangXZWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangXZWQ21
Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification. ICASSP 2021: 5849-5853
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuYSYCZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuYSYCZ21
Yufei Liu, Chengzhu Yu, Shuai Wang, Zhenchuan Yang, Yang Chao, Weibin Zhang:
Non-Parallel Any-to-Many Voice Conversion by Replacing Speaker Statistics. Interspeech 2021: 1369-1373
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/GongCYWWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/GongCYWWQ21
Xun Gong, Zhengyang Chen, Yexin Yang, Shuai Wang, Lan Wang, Yanmin Qian:
Speaker Embedding Augmentation with Noise Distribution Matching. ISCSLP 2021: 1-5
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangYQ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangYQ021
Shuai Wang, Yexin Yang, Yanmin Qian, Kai Yu:
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning. ISCSLP 2021: 1-5
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09817
Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit selection synthesis based data augmentation for fixed phrase speaker verification. CoRR abs/2102.09817 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-04065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-04065
Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice activity detection in the wild: A data-driven approach using teacher-student training. CoRR abs/2105.04065 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-13843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-13843
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. CoRR abs/2108.13843 (2021)
2020
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangYWQY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangYWQY20
Shuai Wang, Yexin Yang, Zhanghao Wu, Yanmin Qian, Kai Yu:
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2598-2609 (2020)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Yang0GQ020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Yang0GQ020
Yexin Yang, Shuai Wang, Xun Gong, Yanmin Qian, Kai Yu:
Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings. ICASSP 2020: 6454-6458
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DiezBLWC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DiezBLWC20
Mireia Díez, Lukás Burget, Federico Landini, Shuai Wang, Honza Cernocký:
Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6519-6523
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LandiniWDBMZMSP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LandiniWDBMZMSP20
Federico Landini, Shuai Wang, Mireia Díez, Lukás Burget, Pavel Matejka, Katerina Zmolíková, Ladislav Mosner, Anna Silnova, Oldrich Plchot, Ondrej Novotný, Hossein Zeinali, Johan Rohdin:
But System for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6529-6533
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Chen0Q020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Chen0Q020
Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training. ICASSP 2020: 6574-6578
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0016RPBYC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0016RPBYC20
Shuai Wang, Johan Rohdin, Oldrich Plchot, Lukás Burget, Kai Yu, Jan Cernocký:
Investigation of Specaugment for Deep Speaker Embedding Learning. ICASSP 2020: 7139-7143
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangD0Q020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangD0Q020
Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection. INTERSPEECH 2020: 1086-1090
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWQ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWQ20
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Multi-Modality Matters: A Performance Leap on VoxCeleb. INTERSPEECH 2020: 2252-2256
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWQ20a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWQ20a
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network. INTERSPEECH 2020: 3017-3021
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/AlamBBDSLGSLMMM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/AlamBBDSLGSLMMM20
Jahangir Alam, Gilles Boulianne, Lukás Burget, Mohamed Dahmane, Mireia Díez Sánchez, Alicia Lozano-Diez, Ondrej Glembek, Pierre-Luc St-Charles, Marc Lalonde, Pavel Matejka, Petr Mizera, João Monteiro, Ladislav Mosner, Cedric Noiseux, Ondrej Novotný, Oldrich Plchot, Johan Rohdin, Anna Silnova, Josef Slavícek, Themos Stafylakis, Shuai Wang, Hossein Zeinali:
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. Odyssey 2020: 289-295
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-09906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-09906
Yefei Chen, Shuai Wang, Yanmin Qian, Kai Yu:
End-to-End Speaker-Dependent Voice Activity Detection. CoRR abs/2009.09906 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jzusc/QianWCWY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/QianWCWY19
Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 20(3): 438 (2019)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangHQY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangHQY19
Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1686-1696 (2019)
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XiangWHQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XiangWHQ019
Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. APSIPA 2019: 1652-1656
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangYWQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangYWQ019
Shuai Wang, Yexin Yang, Tianzhe Wang, Yanmin Qian, Kai Yu:
Knowledge Distillation for Small Foot-print Deep Speaker Embedding. ICASSP 2019: 6021-6025
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DiezBWRC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DiezBWRC19
Mireia Díez, Lukás Burget, Shuai Wang, Johan Rohdin, Jan Cernocký:
Bayesian HMM Based x-Vector Clustering for Speaker Diarization. INTERSPEECH 2019: 346-350
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangWDCWQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangWDCWQ019
Yexin Yang, Hongji Wang, Heinrich Dinkel, Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge. INTERSPEECH 2019: 1038-1042
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangRBPQ0C19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangRBPQ0C19
Shuai Wang, Johan Rohdin, Lukás Burget, Oldrich Plchot, Yanmin Qian, Kai Yu, Jan Cernocký:
On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction. INTERSPEECH 2019: 1148-1152
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuWQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuWQ019
Zhanghao Wu, Shuai Wang, Yanmin Qian, Kai Yu:
Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification. INTERSPEECH 2019: 1163-1167
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangD0Q019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangD0Q019
Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training. INTERSPEECH 2019: 2938-2942
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07317
Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. CoRR abs/1906.07317 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-12592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-12592
Hossein Zeinali, Shuai Wang, Anna Silnova, Pavel Matejka, Oldrich Plchot:
BUT System Description to VoxCeleb Speaker Recognition Challenge 2019. CoRR abs/1910.12592 (2019)
2018
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/jzusc/QianWCWY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/QianWCWY18
Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(1): 40-63 (2018)
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jzusc/QianWCWY18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/QianWCWY18a
Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(4): 582 (2018)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangWQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangWQ18
Zili Huang, Shuai Wang, Yanmin Qian:
Joint I-Vector with End-to-End System for Short Duration Text-Independent Speaker Verification. ICASSP 2018: 4869-4873
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQ018
Shuai Wang, Yanmin Qian, Kai Yu:
Focal Kl-Divergence Based Dilated Convolutional Neural Networks for Co-Channel Speaker Identification. ICASSP 2018: 5339-5343
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangW018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangW018
Zili Huang, Shuai Wang, Kai Yu:
Angular Softmax for Short-Duration Text-independent Speaker Verification. INTERSPEECH 2018: 3623-3627
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iscide/WangDQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscide/WangDQ018
Shuai Wang, Heinrich Dinkel, Yanmin Qian, Kai Yu:
Covariance Based Deep Feature for Text-Dependent Speaker Verification. IScIDE 2018: 231-242
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangHQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangHQ018
Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. ISCSLP 2018: 195-199
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/YangWSQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YangWSQ018
Yexin Yang, Shuai Wang, Man Sun, Yanmin Qian, Kai Yu:
Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification. ISCSLP 2018: 205-209
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-01344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-01344
Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. CoRR abs/1805.01344 (2018)
2017
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/JiangWXQ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/JiangWXQ17
Xiaowei Jiang, Shuai Wang, Xu Xiang, Yanmin Qian:
Integrating online i-vector into GMM-UBM for text-dependent speaker verification. APSIPA 2017: 1628-1632
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQ017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQ017
Shuai Wang, Yanmin Qian, Kai Yu:
What Does the Speaker Embedding Encode? INTERSPEECH 2017: 1497-1501
2012
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tvcg/ZhangWWTZ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvcg/ZhangWWTZ12
Yizhong Zhang, Huamin Wang, Shuai Wang, Yiying Tong, Kun Zhou:
A Deformable Surface Model for Real-Time Water Drop Animation. IEEE Trans. Vis. Comput. Graph. 18(8): 1281-1289 (2012)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.