default search action
Yun Tang 0002
Person information
- affiliation: Facebook AI, USA
Other persons with the same name
- Yun Tang — disambiguation page
- Yun Tang 0001 — East China University of Science and Technology, School of Pharmacy, Shanghai, China
- Yun Tang 0003 — Nanyang Technological University, Alibaba-NTU Singapore Joint Research Institute, Singapore, Singapore
- Yun Tang 0004 — Central China Normal University, Wuhan, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c32]Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polak, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe:
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit. ACL (demo) 2023: 400-411 - [c31]Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino:
Simple and Effective Unsupervised Speech Translation. ACL (1) 2023: 10771-10784 - [c30]Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden Tomasello, Juan Pino:
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks. ACL (1) 2023: 12441-12455 - [c29]Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. ACL (1) 2023: 15655-15680 - [c28]Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma:
Named Entity Detection and Injection for Direct Speech Translation. ICASSP 2023: 1-5 - [c27]Xuan-Phi Nguyen, Sravya Popuri, Changhan Wang, Yun Tang, Ilia Kulikov, Hongyu Gong:
Improving Speech-to-Speech Translation Through Unlabeled Text. ICASSP 2023: 1-5 - [c26]Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-To-Speech Translation with Multiple TTS Targets. ICASSP 2023: 1-5 - [c25]Jiatong Shi, Yun Tang, Hirofumi Inaguma, Hongyu Gong, Juan Pino, Shinji Watanabe:
Exploration on HuBERT with Multiple Resolution. INTERSPEECH 2023: 3287-3291 - [c24]Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polák, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibel, Mingxuan Wang, Shinji Watanabe, Rodolfo Zevallos:
Findings of the IWSLT 2023 Evaluation Campaign. IWSLT@ACL 2023: 1-61 - [i26]Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-to-Speech Translation with Multiple TTS Targets. CoRR abs/2304.04618 (2023) - [i25]Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden D. Tomasello, Juan Pino:
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks. CoRR abs/2305.03101 (2023) - [i24]Jiatong Shi, Yun Tang, Hirofumi Inaguma, Hongyu Gong, Juan Pino, Shinji Watanabe:
Exploration on HuBERT with Multiple Resolutions. CoRR abs/2306.01084 (2023) - 2022
- [c23]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. ACL (1) 2022: 1488-1499 - [c22]Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu:
Direct Speech-to-Speech Translation With Discrete Units. ACL (1) 2022: 3327-3339 - [c21]Xuan-Phi Nguyen, Hongyu Gong, Yun Tang, Changhan Wang, Philipp Koehn, Shafiq R. Joty:
Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation. ICLR 2022 - [c20]Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Miguel Pino:
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation. INTERSPEECH 2022: 1771-1775 - [i23]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. CoRR abs/2204.05409 (2022) - [i22]Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino:
Simple and Effective Unsupervised Speech Translation. CoRR abs/2210.10191 (2022) - [i21]Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma:
Named Entity Detection and Injection for Direct Speech Translation. CoRR abs/2210.11981 (2022) - [i20]Xuan-Phi Nguyen, Sravya Popuri, Changhan Wang, Yun Tang, Ilia Kulikov, Hongyu Gong:
Improving Speech-to-Speech Translation Through Unlabeled Text. CoRR abs/2210.14514 (2022) - [i19]Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. CoRR abs/2212.08055 (2022) - 2021
- [c19]Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Miguel Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models. ACL/IJCNLP (1) 2021: 827-838 - [c18]Yun Tang, Juan Miguel Pino, Xian Li, Changhan Wang, Dmitriy Genzel:
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task. ACL/IJCNLP (1) 2021: 4252-4261 - [c17]Yun Tang, Juan Miguel Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel:
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks. ICASSP 2021: 6209-6213 - [c16]Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Miguel Pino, Holger Schwenk, Naman Goyal:
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task. IWSLT 2021: 131-137 - [c15]Hongyu Gong, Yun Tang, Juan Miguel Pino, Xian Li:
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling. NeurIPS 2021: 2668-2681 - [i18]Hongyu Gong, Yun Tang, Juan Miguel Pino, Xian Li:
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling. CoRR abs/2106.10840 (2021) - [i17]Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Miguel Pino, Wei-Ning Hsu:
Direct speech-to-speech translation with discrete units. CoRR abs/2107.05604 (2021) - [i16]Yun Tang, Juan Miguel Pino, Xian Li, Changhan Wang, Dmitriy Genzel:
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task. CoRR abs/2107.05782 (2021) - [i15]Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Miguel Pino, Holger Schwenk, Naman Goyal:
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task. CoRR abs/2107.06959 (2021) - [i14]Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Miguel Pino:
Incremental Speech Synthesis For Speech-To-Speech Translation. CoRR abs/2110.08214 (2021) - [i13]Xutai Ma, Hongyu Gong, Danni Liu, Ann Lee, Yun Tang, Peng-Jen Chen, Wei-Ning Hsu, Kenneth Heafield, Phillip Koehn, Juan Miguel Pino:
Direct simultaneous speech to speech translation. CoRR abs/2110.08250 (2021) - 2020
- [c14]Shuaichen Chang, Pengfei Liu, Yun Tang, Jing Huang, Xiaodong He, Bowen Zhou:
Zero-Shot Text-to-SQL Learning with Auxiliary Task. AAAI 2020: 7488-7495 - [c13]Yun Tang, Jing Huang, Guangtao Wang, Xiaodong He, Bowen Zhou:
Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding. ACL 2020: 2713-2722 - [c12]Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Miguel Pino:
Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq. AACL/IJCNLP (System Demonstrations) 2020: 33-39 - [c11]Juan Miguel Pino, Qiantong Xu, Xutai Ma, Mohammad Javad Dousti, Yun Tang:
Self-Training for End-to-End Speech Translation. INTERSPEECH 2020: 1476-1480 - [i12]Juan Miguel Pino, Qiantong Xu, Xutai Ma, Mohammad Javad Dousti, Yun Tang:
Self-Training for End-to-End Speech Translation. CoRR abs/2006.02490 (2020) - [i11]Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Miguel Pino:
fairseq S2T: Fast Speech-to-Text Modeling with fairseq. CoRR abs/2010.05171 (2020) - [i10]Yun Tang, Juan Miguel Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel:
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks. CoRR abs/2010.11338 (2020) - [i9]Chau Tran, Changhan Wang, Yuqing Tang, Yun Tang, Juan Miguel Pino, Xian Li:
Cross-Modal Transfer Learning for Multilingual Speech-to-Text Translation. CoRR abs/2010.12829 (2020)
2010 – 2019
- 2019
- [c10]Chao Shang, Yun Tang, Jing Huang, Jinbo Bi, Xiaodong He, Bowen Zhou:
End-to-End Structure-Aware Convolutional Networks for Knowledge Base Completion. AAAI 2019: 3060-3067 - [c9]Ming Tu, Guangtao Wang, Jing Huang, Yun Tang, Xiaodong He, Bowen Zhou:
Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs. ACL (1) 2019: 2704-2713 - [c8]Kevin Huang, Yun Tang, Jing Huang, Xiaodong He, Bowen Zhou:
Relation Module for Non-Answerable Predictions on Reading Comprehension. CoNLL 2019: 747-756 - [c7]Yun Tang, Guohong Ding, Jing Huang, Xiaodong He, Bowen Zhou:
Deep Speaker Embedding Learning with Multi-level Pooling for Text-independent Speaker Verification. ICASSP 2019: 6116-6120 - [c6]Kyu Jeong Han, Jing Huang, Yun Tang, Xiaodong He, Bowen Zhou:
Multi-Stride Self-Attention for Speech Recognition. INTERSPEECH 2019: 2788-2792 - [i8]Yun Tang, Guohong Ding, Jing Huang, Xiaodong He, Bowen Zhou:
Deep Speaker Embedding Learning with Multi-Level Pooling for Text-Independent Speaker Verification. CoRR abs/1902.07821 (2019) - [i7]Ming Tu, Yun Tang, Jing Huang, Xiaodong He, Bowen Zhou:
Towards adversarial learning of speaker-invariant representation for speech emotion recognition. CoRR abs/1903.09606 (2019) - [i6]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i5]Ming Tu, Guangtao Wang, Jing Huang, Yun Tang, Xiaodong He, Bowen Zhou:
Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs. CoRR abs/1905.07374 (2019) - [i4]Shuaichen Chang, Pengfei Liu, Yun Tang, Jing Huang, Xiaodong He, Bowen Zhou:
Zero-shot Text-to-SQL Learning with Auxiliary Task. CoRR abs/1908.11052 (2019) - [i3]Kevin Huang, Yun Tang, Jing Huang, Xiaodong He, Bowen Zhou:
Relation Module for Non-answerable Prediction on Question Answering. CoRR abs/1910.10843 (2019) - [i2]Yun Tang, Jing Huang, Guangtao Wang, Xiaodong He, Bowen Zhou:
Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding. CoRR abs/1911.04910 (2019) - 2018
- [i1]Chao Shang, Yun Tang, Jing Huang, Jinbo Bi, Xiaodong He, Bowen Zhou:
End-to-end Structure-Aware Convolutional Networks for Knowledge Base Completion. CoRR abs/1811.04441 (2018)
2000 – 2009
- 2006
- [j1]Yun Tang, Wenju Liu, Yiyan Zhang, Bo Xu:
A Fast Framework for the Constrained Mean Trajectory Segment Model by Avoidance of Redundant Computation on Segment. Int. J. Comput. Linguistics Chin. Lang. Process. 11(1) (2006) - [c5]Yun Tang, Wenju Liu, Hua Zhang, Bo Xu, Guo-Hong Ding:
One-Pass Coarse-to-Fine Segmental Speech Decoding Algorithm. ICASSP (1) 2006: 441-444 - [c4]Hua Zhang, Yun Tang, Wenju Liu, Bo Xu:
Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition. ISCSLP 2006 - [c3]Yun Tang, Wenju Liu, Bo Xu:
All-Path Decoding Algorithm for Segmental Based Speech Recognition. ISCSLP (Selected Papers) 2006: 435-444 - 2004
- [c2]Yun Tang, Wenju Liu, Yiyan Zhang, Bo Xu:
A framework for fast segment model by avoidance of redundant computation on segment. ISCSLP 2004: 117-120 - [c1]Yun Tang, Wenju Liu, Bo Xu:
Trigram duration modeling in speech recognition. ISCSLP 2004: 225-228
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-18 18:21 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint