default search action

combined dblp search
author search
venue search
publication search

ask others

Di He 0004

> Home > Persons

Person information

affiliation: Amazon Alexa, Seattle, WA, USA
affiliation (PhD 2019): University of Illinois at Urbana-Champaign, Department of Electrical and Computer Engineering, Coordinated Science Lab, Urbana, IL, USA
affiliation: Inspirit IoT, Inc., Champaign, IL, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangCKRD0WSR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangCKRD0WSR24
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran:
Turn-Taking and Backchannel Prediction with Acoustic and Large Language Model Fusion. ICASSP 2024: 12121-12125
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08916
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow:
Two-pass Endpoint Detection for Speech Recognition. CoRR abs/2401.08916 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14717
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14717
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran:
Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion. CoRR abs/2401.14717 (2024)
2023
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RajuKHSCATZVRMR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RajuKHSCATZVRMR23
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow:
Two-Pass Endpoint Detection for Speech Recognition. ASRU 2023: 1-8
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanVHHTZR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanVHHTZR23
Yifeng Fan, Colin Vaz, Di He, Jahn Heymann, Viet Anh Trinh, Zhe Zhang, Venkatesh Ravichandran:
Towards Accurate and Real-Time End-of-Speech Estimation. ICASSP 2023: 1-5
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MinSRVHRT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MinSRVHRT23
Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh:
Adaptive Endpointing with Deep Contextual Multi-Armed Bandits. ICASSP 2023: 1-5
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Schwarz0SHR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Schwarz0SHR23
Andreas Schwarz, Di He, Maarten Van Segbroeck, Mohammed Hethnawi, Ariya Rastrow:
Personalized Predictive ASR for Latency Reduction in Voice Assistants. INTERSPEECH 2023: 745-749
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-13407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-13407
Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh:
Adaptive Endpointing with Deep Contextual Multi-armed Bandits. CoRR abs/2303.13407 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13794
Andreas Schwarz, Di He, Maarten Van Segbroeck, Mohammed Hethnawi, Ariya Rastrow:
Personalized Predictive ASR for Latency Reduction in Voice Assistants. CoRR abs/2305.13794 (2023)
2022
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangTGHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangTGHM22
Jinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas:
VADOI: Voice-Activity-Detection Overlapping Inference for End-To-End Long-Form Speech Recognition. ICASSP 2022: 6977-6981
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-10593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-10593
Jinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas:
VADOI: Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition. CoRR abs/2202.10593 (2022)
2021
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadhuHHMWRSDM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadhuHHMWRSDM21
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
wav2vec-C: A Self-Supervised Model for Speech Representation Learning. Interspeech 2021: 711-715
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-08393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-08393
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
Wav2vec-C: A Self-supervised Model for Speech Representation Learning. CoRR abs/2103.08393 (2021)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[b1]
- view
  - electronic edition via handle.net
  - details & citations
- export record
  dblp key:
  - phd/us/He19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/He19a
Di He:
The benefits of acoustic perceptual information for speech processing systems. University of Illinois Urbana-Champaign, USA, 2019
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeYLLHC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeYLLHC19
Di He, Xuesong Yang, Boon Pang Lim, Yi Liang, Mark Hasegawa-Johnson, Deming Chen:
When CTC Training Meets Acoustic Landmarks. ICASSP 2019: 5996-6000
2018
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeLYHC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeLYHC18
Di He, Boon Pang Lim, Xuesong Yang, Mark Hasegawa-Johnson, Deming Chen:
Improved ASR for Under-resourced Languages through Multi-task Learning with Acoustic Landmarks. INTERSPEECH 2018: 2618-2622
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-05574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-05574
Di He, Boon Pang Lim, Xuesong Yang, Mark Hasegawa-Johnson, Deming Chen:
Improved ASR for Under-Resourced Languages Through Multi-Task Learning with Acoustic Landmarks. CoRR abs/1805.05574 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-08349
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-08349
Di He:
Augmenting Input Method Language Model with user Location Type Information. CoRR abs/1809.08349 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02063
Di He, Xuesong Yang, Boon Pang Lim, Yi Liang, Mark Hasegawa-Johnson, Deming Chen:
When CTC Training Meets Acoustic Landmarks. CoRR abs/1811.02063 (2018)
2017
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/iccad/ZhangRZHZCRC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccad/ZhangRZHZCRC17
Xiaofan Zhang, Anand Ramachandran, Chuanhao Zhuge, Di He, Wei Zuo, Zuofu Cheng, Kyle Rupnow, Deming Chen:
Machine learning on FPGAs to face the IoT revolution. ICCAD 2017: 819-826
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iccad/ZhangRZHZCRC17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccad/ZhangRZHZCRC17a
Xiaofan Zhang, Anand Ramachandran, Chuanhao Zhuge, Di He, Wei Zuo, Zuofu Cheng, Kyle Rupnow, Deming Chen:
Machine learning on FPGAs to face the IoT revolution. ICCAD 2017: 894-901
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeCHC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeCHC17
Di He, Zuofu Cheng, Mark Hasegawa-Johnson, Deming Chen:
Using Approximated Auditory Roughness as a Pre-Filtering Feature for Human Screaming and Affective Speech AED. INTERSPEECH 2017: 1914-1918
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-09985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-09985
Di He, Boon Pang Lim, Xuesong Yang, Mark Hasegawa-Johnson, Deming Chen:
Acoustic Landmarks Contain More Information About the Phone String than Other Frames. CoRR abs/1710.09985 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.