default search action
Genshun Wan
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j2]Hang Chen, Qing Wang, Jun Du, Genshun Wan, Shifu Xiong, Baocai Yin, Jia Pan, Chin-Hui Lee:
Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading. IEEE Trans. Multim. 26: 9358-9371 (2024) - [c10]Minghui Wu, Haitao Tang, Jiahuan Fan, Ruoyu Wang, Hang Chen, Yanyong Zhang, Jun Du, Hengshun Zhou, Lei Sun, Xin Fang, Tian Gao, Genshun Wan, Jia Pan, Jianqing Gao:
Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization. ICASSP 2024: 10051-10055 - [c9]Genshun Wan, Zhongfu Ye:
Multi-Modal Knowledge Transfer for Target Speaker Lipreading with Improved Audio-Visual Pretraining and Cross-Lingual Fine-Tuning. ICME Workshops 2024: 1-6 - [i9]Shutong Niu, Ruoyu Wang, Jun Du, Gaobin Yang, Yanhui Tu, Siyuan Wu, Shuangqing Qian, Huaxin Wu, Haitao Xu, Xueyang Zhang, Guolong Zhong, Xindi Yu, Jieru Chen, Mengzhi Wang, Di Cai, Tian Gao, Genshun Wan, Feng Ma, Jia Pan, Jianqing Gao:
The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge. CoRR abs/2409.02041 (2024) - [i8]Genshun Wan, Mengzhi Wang, Tingzhi Mao, Hang Chen, Zhongfu Ye:
Lightweight Transducer Based on Frame-Level Criterion. CoRR abs/2409.13698 (2024) - 2023
- [c8]Genshun Wan, Hang Chen, Tan Liu, Chenxi Wang, Jia Pan, Zhongfu Ye:
Progressive Multi-scale Self-supervised Learning for Speech Recognition. APSIPA ASC 2023: 978-982 - [c7]Genshun Wan, Hang Chen, Pengcheng Li, Jia Pan, Zhongfu Ye:
Improved Data2vec with Soft Supervised Hidden Unit for Mandarin Speech Recognition. APSIPA ASC 2023: 983-987 - [c6]Haitao Tang, Yu Fu, Lei Sun, Jiabin Xue, Dan Liu, Yongchao Li, Zhiqiang Ma, Minghui Wu, Jia Pan, Genshun Wan, Ming'en Zhao:
Reducing the GAP Between Streaming and Non-Streaming Transducer-Based ASR by Adaptive Two-Stage Knowledge Distillation. ICASSP 2023: 1-5 - [c5]Jing-Xuan Zhang, Genshun Wan, Zhen-Hua Ling, Jia Pan, Jianqing Gao, Cong Liu:
Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-Distillation. ICASSP 2023: 1-5 - [i7]Haitao Tang, Yu Fu, Lei Sun, Jiabin Xue, Dan Liu, Yongchao Li, Zhiqiang Ma, Minghui Wu, Jia Pan, Genshun Wan, Ming'en Zhao:
Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation. CoRR abs/2306.15171 (2023) - [i6]Ruoyu Wang, Maokui He, Jun Du, Hengshun Zhou, Shutong Niu, Hang Chen, Yanyan Yue, Gaobin Yang, Shilong Wu, Lei Sun, Yanhui Tu, Haitao Tang, Shuangqing Qian, Tian Gao, Mengzhi Wang, Genshun Wan, Jia Pan, Jianqing Gao, Chin-Hui Lee:
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge. CoRR abs/2308.14638 (2023) - 2022
- [c4]Jing-Xuan Zhang, Genshun Wan, Jia Pan:
Is Lip Region-of-Interest Sufficient for Lipreading? ICMI 2022: 368-372 - [i5]Jing-Xuan Zhang, Genshun Wan, Jia Pan:
Is Lip Region-of-Interest Sufficient for Lipreading? CoRR abs/2205.14295 (2022) - [i4]Jing-Xuan Zhang, Genshun Wan, Zhen-Hua Ling, Jia Pan, Jianqing Gao, Cong Liu:
Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation. CoRR abs/2212.02782 (2022) - [i3]Fenglin Ding, Genshun Wan, Pengcheng Li, Jia Pan, Cong Liu:
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information. CoRR abs/2212.03476 (2022) - [i2]Genshun Wan, Tan Liu, Hang Chen, Jia Pan, Cong Liu, Zhongfu Ye:
Progressive Multi-Scale Self-Supervised Learning for Speech Recognition. CoRR abs/2212.03480 (2022) - [i1]Pengcheng Li, Genshun Wan, Fenglin Ding, Hang Chen, Jianqing Gao, Jia Pan, Cong Liu:
Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit. CoRR abs/2212.03482 (2022) - 2020
- [j1]Jia Pan, Genshun Wan, Jun Du, Zhongfu Ye:
Online Speaker Adaptation Using Memory-Aware Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1025-1037 (2020) - [c3]Genshun Wan, Jia Pan, Qingran Wang, Jianqing Gao, Zhongfu Ye:
Speaker Adaptive Training for Speech Recognition Based on Attention-Over-Attention Mechanism. INTERSPEECH 2020: 1251-1255 - [c2]Huaxin Wu, Genshun Wan, Jia Pan:
Speaker Code Based Speaker Adaptive Training Using Model Agnostic Meta-Learning. INTERSPEECH 2020: 4362-4366
2010 – 2019
- 2018
- [c1]Jia Pan, Diyuan Liu, Genshun Wan, Jun Du, Qingfeng Liu, Zhongfu Ye:
Online Speaker Adaptation for LVCSR Based on Attention Mechanism. APSIPA 2018: 183-186
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-08 01:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint