default search action

combined dblp search
author search
venue search
publication search

ask others

Pavel Denisov

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/ShuklaDT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/ShuklaDT24
Sakshi Deo Shukla, Pavel Denisov, Tugtekin Turan:
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings. ECAI 2024: 3956-3963
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangYCJLMSST0F24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangYCJLMSST0F24
Xuankai Chang, Brian Yan, Kwanghee Choi, Jee-Weon Jung, Yichen Lu, Soumi Maiti, Roshan S. Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang:
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study. ICASSP 2024: 11481-11485
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/DenisovV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/DenisovV24
Pavel Denisov, Thang Vu:
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training. NAACL-HLT (Findings) 2024: 814-834
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-10922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-10922
Pavel Denisov, Ngoc Thang Vu:
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training. CoRR abs/2404.10922 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06222
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06222
Sakshi Deo Shukla, Pavel Denisov, Tugtekin Turan:
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings. CoRR abs/2409.06222 (2024)
2023
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DenisovV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DenisovV23
Pavel Denisov, Ngoc Thang Vu:
Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding. ASRU 2023: 1-8
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/LuxKMBSDSV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/LuxKMBSDSV23
Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu:
The IMS Toucan System for the Blizzard Challenge 2023. Blizzard Challenge 2023
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MeyerLKDTV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MeyerLKDTV23
Sarina Meyer, Florian Lux, Julia Koch, Pavel Denisov, Pascal Tilli, Ngoc Thang Vu:
Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning. ICASSP 2023: 1-5
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15800
Xuankai Chang, Brian Yan, Kwanghee Choi, Jee-Weon Jung, Yichen Lu, Soumi Maiti, Roshan S. Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang:
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study. CoRR abs/2309.15800 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06103
Pavel Denisov, Ngoc Thang Vu:
Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding. CoRR abs/2310.06103 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-17499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-17499
Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu:
The IMS Toucan System for the Blizzard Challenge 2023. CoRR abs/2310.17499 (2023)
2022
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/HamedDLEAV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/HamedDLEAV22
Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu:
Investigations on speech recognition systems for low-resource dialectal Arabic-English code-switching speech. Comput. Speech Lang. 72: 101278 (2022)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AroraDDCUPZKGYV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AroraDDCUPZKGYV22
Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W. Black, Shinji Watanabe:
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet. ICASSP 2022: 7167-7171
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeyerLDKTV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeyerLDKTV22
Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu:
Speaker Anonymization with Phonetic Intermediate Representations. INTERSPEECH 2022: 4925-4929
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MeyerTDLKV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MeyerTDLKV22
Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu:
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy. SLT 2022: 912-919
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04834
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04834
Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu:
Speaker Anonymization with Phonetic Intermediate Representations. CoRR abs/2207.04834 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07002
Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu:
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy. CoRR abs/2210.07002 (2022)
2021
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/DenisovMV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/DenisovMV21
Pavel Denisov, Manuel Mager, Ngoc Thang Vu:
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task. IWSLT 2021: 175-181
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/EbrahimiMWDOLKULNRTAKPBCSATIACCCFLMOPZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/EbrahimiMWDOLKULNRTAKPBCSATIACCCFLMOPZ21
Abteen Ebrahimi, Manuel Mager, Adam Wiemerslage, Pavel Denisov, Arturo Oncevay, Danni Liu, Sai Koneru, Enes Yavuz Ugan, Zhaolin Li, Jan Niehues, Monica Romero, Iván G. Torre, Tanel Alumäe, Jiaming Kong, Sergey Polezhaev, Yury Belousov, Wei-Rui Chen, Peter Sullivan, Ife Adebara, Bashar Talafha, Alcides Alcoba Inciarte, Muhammad Abdul-Mageed, Luis Chiruzzo, Rolando Coto-Solano, Hilaria Cruz, Sofía Flores-Solórzano, Aldo Andrés Alvarez López, Iván V. Meza-Ruíz, John E. Ortega, Alexis Palmer, Rodolfo Zevallos, Kristine Stenzel, Thang Vu, Katharina Kann:
Findings of the Second AmericasNLP Competition on Speech-to-Text Translation. NeurIPS (Competition and Demos) 2021: 217-232
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/RajDCEHH0DYLKLW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/RajDCEHH0DYLKLW21
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-16055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-16055
Pavel Denisov, Manuel Mager, Ngoc Thang Vu:
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task. CoRR abs/2106.16055 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-12881
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-12881
Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu:
Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech. CoRR abs/2108.12881 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-14706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-14706
Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W. Black, Shinji Watanabe:
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet. CoRR abs/2111.14706 (2021)
2020
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiOVLVSNVDJKV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiOVLVSNVDJKV20
Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic, Ngoc Thang Vu:
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents. ACL (demo) 2020: 279-286
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DenisovV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DenisovV20
Pavel Denisov, Ngoc Thang Vu:
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning. INTERSPEECH 2020: 881-885
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-01777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-01777
Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic, Ngoc Thang Vu:
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents. CoRR abs/2005.01777 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-01836
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-01836
Pavel Denisov, Ngoc Thang Vu:
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning. CoRR abs/2007.01836 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02014
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OrtegaLVDV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OrtegaLVDV19
Daniel Ortega, Chia-Yu Li, Gisela Vallejo, Pavel Denisov, Ngoc Thang Vu:
Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions. ICASSP 2019: 7265-7269
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/ZakharovD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/ZakharovD19
Alexander Zakharov, Pavel Denisov:
Advantages and Limitations of Forward Squint SAR In Single Pass Interferometric Mapping Of Topography. IGARSS 2019: 8614-8616
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DenisovV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DenisovV19
Pavel Denisov, Ngoc Thang Vu:
End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning. INTERSPEECH 2019: 4425-4429
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-11060
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-11060
Daniel Ortega, Chia-Yu Li, Gisela Vallejo, Pavel Denisov, Ngoc Thang Vu:
Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions. CoRR abs/1902.11060 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-04737
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-04737
Pavel Denisov, Ngoc Thang Vu:
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning. CoRR abs/1908.04737 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-04743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-04743
Pavel Denisov, Ngoc Thang Vu:
IMS-Speech: A Speech to Text Tool. CoRR abs/1908.04743 (2019)
2018
[c3]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/ITGspeech/DenisovVF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ITGspeech/DenisovVF18
Pavel Denisov, Ngoc Thang Vu, Marc Ferras Font:
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition. ITG Symposium on Speech Communication 2018: 1-5
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/aist/KustikovaKPDZIM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aist/KustikovaKPDZIM18
Valentina Kustikova, Mikhail Krivonosov, Alexey S. Pimashkin, Pavel Denisov, Alexey Zaikin, Mikhail Ivanchenko, Iosif B. Meyerov, Alexey V. Semyanov:
CalciumCV: Computer Vision Software for Calcium Signaling in Astrocytes. AIST 2018: 168-179
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/ZakharovZMD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/ZakharovZMD18
Alexander Zakharov, Ludmila Zakharova, Polina Mikhaylyukova, Pavel Denisov:
Atmospheric Effects on Radarsat-2 Interferograms of Tolbachik Volcanic Complex. IGARSS 2018: 2192-2195
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-11284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-11284
Pavel Denisov, Ngoc Thang Vu, Marc Ferras Font:
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition. CoRR abs/1807.11284 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.