default search action

combined dblp search
author search
venue search
publication search

ask others

Zexu Pan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangPLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangPLWL24
Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation With Pretrained Frontend to Minimize Domain Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4184-4198 (2024)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/PanBCSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PanBCSL24
Zexu Pan, Marvin Borsdorf, Siqi Cai, Tanja Schultz, Haizhou Li:
NeuroHeed: Neuro-Steered Speaker Extraction Using EEG Signals. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4456-4470 (2024)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tci/ZhangPLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tci/ZhangPLL24
Shuo Zhang, Zexu Pan, Yichang Lv, Youfang Lin:
Hierarchical Edge Refinement Network for Guided Depth Map Super-Resolution. IEEE Trans. Computational Imaging 10: 469-478 (2024)
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangPZT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangPZT024
Jiadong Wang, Zexu Pan, Malu Zhang, Robby T. Tan, Haizhou Li:
Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition. AAAI 2024: 19144-19152
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanWGSR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanWGSR24
Zexu Pan, Gordon Wichern, François G. Germain, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Late Audio-Visual Fusion for in-the-Wild Speaker Diarization. ICASSP Workshops 2024: 174-178
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasuyamaWGPKHR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasuyamaWGPKHR24
Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. ICASSP 2024: 1016-1020
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BraliosWGPKHR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BraliosWGPKHR24
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. ICASSP 2024: 1156-1160
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianPZCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianPZCL24
Xinyuan Qian, Zexu Pan, Qiquan Zhang, Kainan Chen, Shoufeng Lin:
GLMB 3D Speaker Tracking with Video-Assisted Multi-Channel Audio Optimization Functions. ICASSP 2024: 8100-8104
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenQPC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenQPC024
Yu Chen, Xinyuan Qian, Zexu Pan, Kainan Chen, Haizhou Li:
LOCSELECT: Target Speaker Localization with an Auditory Selective Hearing Mechanism. ICASSP 2024: 8696-8700
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiTPGW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiTPGW024
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech. ICASSP 2024: 10666-10670
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanWGKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanWGKR24
Zexu Pan, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
NeuroHeed+: Improving Neuro-Steered Speaker Extraction with Joint Auditory Attention Detection. ICASSP 2024: 11456-11460
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/SaijoWGPR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/SaijoWGPR24
Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement. IWAENC 2024: 205-209
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17907
Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. CoRR abs/2402.17907 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-03438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-03438
Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
Enhanced Reverberation as Supervision for Unsupervised Speech Separation. CoRR abs/2408.03438 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-03440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-03440
Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement. CoRR abs/2408.03440 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-16681
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-16681
Kun Zhou, You Zhang, Shengkui Zhao, Hao Wang, Zexu Pan, Dianwen Ng, Chong Zhang, Chongjia Ni, Yukun Ma, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions. CoRR abs/2409.16681 (2024)
2023
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WangPGYL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WangPGYL23
Tingting Wang, Zexu Pan, Meng Ge, Zhen Yang, Haizhou Li:
Time-Domain Speech Separation Networks With Graph Encoding Auxiliary. IEEE Signal Process. Lett. 30: 110-114 (2023)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PanWMGKHR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PanWMGKHR23
Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction. ASRU 2023: 1-8
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanWBL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanWBL23
Zexu Pan, Wupeng Wang, Marvin Borsdorf, Haizhou Li:
ImagineNet: Target Speaker Extraction with Intermittent Visual Cue Through Embedding Inpainting. ICASSP 2023: 1-5
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangTP023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangTP023
Yidi Jiang, Ruijie Tao, Zexu Pan, Haizhou Li:
Target Active Speaker Detection with Audio-visual Cues. INTERSPEECH 2023: 3152-3156
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangBP0WW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangBP0WW23
Ke Zhang, Marvin Borsdorf, Zexu Pan, Haizhou Li, Yangjie Wei, Yi Wang:
Speaker Extraction with Detection of Presence and Absence of Target Speakers. INTERSPEECH 2023: 3714-3718
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiGPCW0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiGPCW0Z23
Junjie Li, Meng Ge, Zexu Pan, Rui Cao, Longbiao Wang, Jianwu Dang, Shiliang Zhang:
Rethinking the Visual Cues in Audio-Visual Speaker Extraction. INTERSPEECH 2023: 3754-3758
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12831
Yidi Jiang, Ruijie Tao, Zexu Pan, Haizhou Li:
Target Active Speaker Detection with Audio-visual Cues. CoRR abs/2305.12831 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02625
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02625
Junjie Li, Meng Ge, Zexu Pan, Rui Cao, Longbiao Wang, Jianwu Dang, Shiliang Zhang:
Rethinking the visual cues in audio-visual speaker extraction. CoRR abs/2306.02625 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08408
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech. CoRR abs/2309.08408 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10497
Yu Chen, Xinyuan Qian, Zexu Pan, Kainan Chen, Haizhou Li:
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism. CoRR abs/2310.10497 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10604
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. CoRR abs/2310.10604 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19644
Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction. CoRR abs/2310.19644 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07513
Zexu Pan, Gordon Wichern, François G. Germain, Sameer Khurana, Jonathan Le Roux:
NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection. CoRR abs/2312.07513 (2023)
2022
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/PanQL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/PanQL22
Zexu Pan, Xinyuan Qian, Haizhou Li:
Speaker Extraction With Co-Speech Gestures Cue. IEEE Signal Process. Lett. 29: 1467-1471 (2022)
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/PanTXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PanTXL22
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li:
Selective Listening by Synchronizing Speech With Lips. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1650-1664 (2022)
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/PanGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PanGL22
Zexu Pan, Meng Ge, Haizhou Li:
USEV: Universal Speaker Extraction With Visual Cue. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3032-3045 (2022)
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiGPWD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiGPWD22
Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang:
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network. INTERSPEECH 2022: 906-910
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PanG022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PanG022
Zexu Pan, Meng Ge, Haizhou Li:
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. INTERSPEECH 2022: 1786-1790
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16840
Zexu Pan, Xinyuan Qian, Haizhou Li:
Speaker Extraction with Co-Speech Gestures Cue. CoRR abs/2203.16840 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16843
Zexu Pan, Meng Ge, Haizhou Li:
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. CoRR abs/2203.16843 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06177
Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang:
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network. CoRR abs/2210.06177 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00109
Zexu Pan, Wupeng Wang, Marvin Borsdorf, Haizhou Li:
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting. CoRR abs/2211.00109 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01299
Zexu Pan, Gordon Wichern, François G. Germain, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Towards End-to-end Speaker Diarization in the Wild. CoRR abs/2211.01299 (2022)
2021
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianMPW021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianMPW021
Xinyuan Qian, Maulik C. Madhavi, Zexu Pan, Jiadong Wang, Haizhou Li:
Multi-Target DoA Estimation with an Audio-Visual Fusion Mechanism. ICASSP 2021: 4280-4284
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanTX021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanTX021
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li:
Muse: Multi-Modal Target Speaker Extraction with Visual Cues. ICASSP 2021: 6678-6682
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoPDQS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoPDQS021
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. ACM Multimedia 2021: 3927-3935
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06107
Xinyuan Qian, Maulik C. Madhavi, Zexu Pan, Jiadong Wang, Haizhou Li:
Multi-target DoA Estimation with an Audio-visual Fusion Mechanism. CoRR abs/2105.06107 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-06592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-06592
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. CoRR abs/2107.06592 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-14831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-14831
Zexu Pan, Meng Ge, Haizhou Li:
USEV: Universal Speaker Extraction with Visual Cue. CoRR abs/2109.14831 (2021)
2020
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PanLY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PanLY020
Zexu Pan, Zhaojie Luo, Jichen Yang, Haizhou Li:
Multi-Modal Attention for Speech Emotion Recognition. INTERSPEECH 2020: 364-368
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-04107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-04107
Zexu Pan, Zhaojie Luo, Jichen Yang, Haizhou Li:
Multi-modal Attention for Speech Emotion Recognition. CoRR abs/2009.04107 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07775
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07775
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li:
Muse: Multi-modal target speaker extraction with visual cues. CoRR abs/2010.07775 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.