default search action

combined dblp search
author search
venue search
publication search

ask others

Ozlem Kalinli

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SunAM0KPK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SunAM0KPK24
Chuanneng Sun, Zeeshan Ahmed, Yingyi Ma, Zhe Liu, Lucas Kabela, Yutong Pang, Ozlem Kalinli:
Contextual Biasing of Named-Entities with Large Language Models. ICASSP 2024: 10151-10155
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShangguanYLWFWD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShangguanYLWFWD24
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models. ICASSP 2024: 10216-10220
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0011K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0011K24
Zhe Liu, Ozlem Kalinli:
Forgetting Private Textual Sequences in Language Models Via Leave-One-Out Ensemble. ICASSP 2024: 10261-10265
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VatsLSPMPAK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VatsLSPMPAK24
Arpita Vats, Zhe Liu, Peng Su, Debjyoti Paul, Yingyi Ma, Yutong Pang, Zeeshan Ahmed, Ozlem Kalinli:
Recovering from Privacy-Preserving Masking with Large Language Models. ICASSP 2024: 10771-10775
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Ma0K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Ma0K24
Yingyi Ma, Zhe Liu, Ozlem Kalinli:
Correction Focused Language Model Training For Speech Recognition. ICASSP 2024: 10856-10860
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieLGTSSWJMK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieLGTSSWJMK24
Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model. ICASSP 2024: 12201-12205
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LakomkinWFKSF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LakomkinWFKSF24
Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. ICASSP 2024: 12406-12410
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoMMSWMKFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoMMSWMKFS24
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FathullahWLJSLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FathullahWLJSLG24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355
[c42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/FathullahWLLJSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FathullahWLLJSM24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01716
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18108
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18108
Gil Keren, Wei Zhou, Ozlem Kalinli:
Token-Weighted RNN-T for Learning from Flawed Data. CoRR abs/2406.18108 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-12734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-12734
Irina-Elena Veliche, Zhuangqun Huang, Vineeth Ayyat Kochaniyan, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Towards measuring fairness in speech recognition: Fair-Speech dataset. CoRR abs/2408.12734 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08148
Desh Raj, Gil Keren, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Faster Speech-LLaMA Inference with Multi-token Prediction. CoRR abs/2409.08148 (2024)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11494
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11494
Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. CoRR abs/2409.11494 (2024)
2023
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/JiaLMMMKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/JiaLMMMKS23
Junteng Jia, Ke Li, Mani Malek, Kshitiz Malik, Jay Mahadeokar, Ozlem Kalinli, Frank Seide:
Joint Federated Learning and Personalization for on-Device ASR. ASRU 2023: 1-8
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeSWLSKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeSWLSKS23
Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer:
Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers. ICASSP 2023: 1-5
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMGSKKSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMGSKKSL23
Ke Li, Jay Mahadeokar, Jinxi Guo, Yangyang Shi, Gil Keren, Ozlem Kalinli, Michael L. Seltzer, Duc Le:
Improving fast-slow Encoder based Transducer with Streaming Deliberation. ICASSP 2023: 1-5
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajJMWMZK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajJMWMZK23
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. ICASSP 2023: 1-5
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TjandraSZKMLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TjandraSZKMLS23
Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. ICASSP 2023: 1-5
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangTLZLK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangTLZLK23
Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, Ozlem Kalinli:
Learning ASR Pathways: A Sparse Multilingual ASR Model. ICASSP 2023: 1-5
[c35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FathullahWSJXML23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FathullahWSJXML23
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSLLKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSLLKS23
Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. INTERSPEECH 2023: 1119-1123
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12498
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00998
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00998
Shuo Liu, Leda Sari, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli:
Towards Selection of Text-to-speech Data to Augment ASR Training. CoRR abs/2306.00998 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11795
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12134
Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. CoRR abs/2307.12134 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-00723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-00723
Chuanneng Sun, Zeeshan Ahmed, Yingyi Ma, Zhe Liu, Lucas Kabela, Yutong Pang, Ozlem Kalinli:
Contextual Biasing of Named-Entities with Large Language Models. CoRR abs/2309.00723 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-01947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-01947
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models. CoRR abs/2309.01947 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08628
Arpita Vats, Zhe Liu, Peng Su, Debjyoti Paul, Yingyi Ma, Yutong Pang, Zeeshan Ahmed, Ozlem Kalinli:
Recovering from Privacy-Preserving Masking with Large Language Models. CoRR abs/2309.08628 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09390
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09390
Roshan Sharma, Suyoun Kim, Daniel Lazar, Trang Le, Akshat Shrivastava, Kwanghoon Ahn, Piyush Kansal, Leda Sari, Ozlem Kalinli, Michael L. Seltzer:
Augmenting text for spoken language understanding with Large Language Models. CoRR abs/2309.09390 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10917
Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. CoRR abs/2309.10917 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13018
Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model. CoRR abs/2309.13018 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-16082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-16082
Zhe Liu, Ozlem Kalinli:
Forgetting Private Textual Sequences in Language Models via Leave-One-Out Ensemble. CoRR abs/2309.16082 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11003
Yingyi Ma, Zhe Liu, Ozlem Kalinli:
Correction Focused Language Model Training for Speech Recognition. CoRR abs/2310.11003 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06753
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023)
2022
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KimLKHZKL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KimLKHZKL22
Suyoun Kim, Ke Li, Lucas Kabela, Ron Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. EMNLP (Findings) 2022: 5717-5722
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BruguierLPLLWCP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BruguierLPLLWCP22
Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Neural-FST Class Language Model for End-to-End Speech Recognition. ICASSP 2022: 6107-6111
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangSWLCZVKC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangSWLCZVKC22
Haichuan Yang, Yuan Shangguan, Dilin Wang, Meng Li, Pierce Chuang, Xiaohui Zhang, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Omni-Sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR Via Supernet. ICASSP 2022: 8197-8201
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiWWXMZLLSNKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiWWXMZLLSNKS22
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaMZSKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaMZSKS22
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. INTERSPEECH 2022: 536-540
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahadeokarSLLZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahadeokarSLLZC22
Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast slow cascaded encoders. INTERSPEECH 2022: 2083-2087
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeSTKLKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeSTKLKS22
Duc Le, Akshat Shrivastava, Paden D. Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. INTERSPEECH 2022: 3468-3472
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimLZSAZFKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimLZSAZFKS22
Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. INTERSPEECH 2022: 3978-3982
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengXKL0FKSM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengXKL0FKSM22
Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiuSYSKK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiuSYSKK22
Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli:
Learning a Dual-Mode Speech Recognition Model VIA Self-Pruning. SLT 2022: 273-279
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-11867
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-11867
Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Neural-FST Class Language Model for End-to-End Speech Recognition. CoRR abs/2201.11867 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15773
Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast-slow cascaded encoders. CoRR abs/2203.15773 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15966
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. CoRR abs/2203.15966 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01893
Duc Le, Akshat Shrivastava, Paden Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. CoRR abs/2204.01893 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-11906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-11906
Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli:
Learning a Dual-Mode Speech Recognition Model via Self-Pruning. CoRR abs/2207.11906 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-05735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-05735
Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, John H. L. Hansen, Ozlem Kalinli:
Learning ASR pathways: A sparse multilingual ASR model. CoRR abs/2209.05735 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11588
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. CoRR abs/2210.11588 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00174
Suyoun Kim, Ke Li, Lucas Kabela, Rongqing Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. CoRR abs/2211.00174 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00896
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00896
Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer:
Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers. CoRR abs/2211.00896 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05756
Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. CoRR abs/2211.05756 (2022)
2021
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuSCKBHKT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuSCKBHKT21
Ting-Yao Hu, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalinli, Oncel Tuzel:
SapAugment: Learning A Sample Adaptive Policy for Data Augmentation. ICASSP 2021: 4040-4044
[c22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuXSKFKH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuXSKFKH21
Chunyang Wu, Zhiping Xiu, Yangyang Shi, Ozlem Kalinli, Christian Fuegen, Thilo Köhler, Qing He:
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis. Interspeech 2021: 146-150
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeJKKSMCSFKSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeJKKSMCSFKSS21
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimALYFKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimALYFKS21
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. Interspeech 2021: 1977-1981
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiNWMLPXYCFKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiNWMLPXYCFKS21
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahadeokarSSWXS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahadeokarSSWXS21
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShangguanPSMSZW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShangguanPSMSZW21
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NagarajaSVKSC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NagarajaSVKSC21
Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh, Ozlem Kalinli, Michael L. Seltzer, Vikas Chandra:
Collaborative Training of Acoustic Encoders for Speech Recognition. Interspeech 2021: 4573-4577
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02138
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. CoRR abs/2104.02138 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02176
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02194
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02207
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02232
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-08960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-08960
Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh, Ozlem Kalinli, Michael L. Seltzer, Vikas Chandra:
Collaborative Training of Acoustic Encoders for Speech Recognition. CoRR abs/2106.08960 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-04677
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-04677
Dilin Wang, Yuan Shangguan, Haichuan Yang, Pierce Chuang, Jiatong Zhou, Meng Li, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Noisy Training Improves E2E ASR for the Edge. CoRR abs/2107.04677 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03174
Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer:
Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study. CoRR abs/2110.03174 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05241
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05376
Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. CoRR abs/2110.05376 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08352
Haichuan Yang, Yuan Shangguan, Dilin Wang, Meng Li, Pierce Chuang, Xiaohui Zhang, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet. CoRR abs/2110.08352 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05948
Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021)
2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01156
Ting-Yao Hu, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalinli, Oncel Tuzel:
SapAugment: Learning A Sample Adaptive Policy for Data Augmentation. CoRR abs/2011.01156 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KalinliBW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KalinliBW19
Ozlem Kalinli, Gautam Bhattacharya, Chao Weng:
Parametric Cepstral Mean Normalization for Robust Speech Recognition. ICASSP 2019: 6735-6739
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MantenaKAM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MantenaKAM19
Gautam Mantena, Ozlem Kalinli, Ossama Abdel-Hamid, Don McAllaster:
Bandwidth Embeddings for Mixed-Bandwidth Speech Recognition. INTERSPEECH 2019: 3203-3207
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-02667
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-02667
Gautam Mantena, Ozlem Kalinli, Ossama Abdel-Hamid, Don McAllaster:
Bandwidth Embeddings for Mixed-bandwidth Speech Recognition. CoRR abs/1909.02667 (2019)
2016
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kalinli16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kalinli16
Ozlem Kalinli:
Analysis of Multi-Lingual Emotion Recognition Using Auditory Attention Features. INTERSPEECH 2016: 3613-3617
2015
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MehrabaniKC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MehrabaniKC15
Mahnoosh Mehrabani, Ozlem Kalinli, Ruxin Chen:
Emotion clustering based on probabilistic linear discriminant analysis. INTERSPEECH 2015: 1314-1318
2013
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kalinli13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kalinli13
Ozlem Kalinli:
Combination of auditory attention features with phone posteriors for better automatic phoneme segmentation. INTERSPEECH 2013: 2302-2305
2012
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kalinli12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kalinli12
Ozlem Kalinli:
Automatic Phoneme Segmentation Using Auditory Attention Features. INTERSPEECH 2012: 2270-2273
2011
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Kalinli11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Kalinli11
Ozlem Kalinli:
Tone and pitch accent classification using auditory attention cues. ICASSP 2011: 5208-5211
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kalinli11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kalinli11
Ozlem Kalinli:
Syllable Segmentation of Continuous Speech Using Auditory Attention Cues. INTERSPEECH 2011: 425-428
2010
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KalinliSDA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KalinliSDA10
Ozlem Kalinli, Michael L. Seltzer, Jasha Droppo, Alex Acero:
Noise Adaptive Training for Robust Automatic Speech Recognition. IEEE Trans. Speech Audio Process. 18(8): 1889-1901 (2010)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KalinliN09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KalinliN09
Ozlem Kalinli, Shrikanth S. Narayanan:
Prominence Detection Using Auditory Attention Cues and Task-Dependent High Level Information. IEEE Trans. Speech Audio Process. 17(5): 1009-1024 (2009)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KalinliSA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KalinliSA09
Ozlem Kalinli, Michael L. Seltzer, Alex Acero:
Noise adaptive training using a vector taylor series approach for noise robust automatic speech recognition. ICASSP 2009: 3825-3828
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KalinliN09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KalinliN09
Ozlem Kalinli, Shrikanth S. Narayanan:
Continuous speech recognition using attention shift decoding with soft decision. INTERSPEECH 2009: 1927-1930
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/KalinliSN09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/KalinliSN09
Ozlem Kalinli, Shiva Sundaram, Shrikanth S. Narayanan:
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing. MMSP 2009: 1-6
2008
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KalinliN08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KalinliN08
Ozlem Kalinli, Shrikanth S. Narayanan:
A top-down auditory attention model for learning task dependent influences on prominence detection in speech. ICASSP 2008: 3981-3984
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KalinliN08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KalinliN08
Ozlem Kalinli, Shrikanth S. Narayanan:
Combining task-dependent information with auditory attention cues for prominence detection in speech. INTERSPEECH 2008: 1064-1067
2007
[c2]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/KalinliN07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KalinliN07
Ozlem Kalinli, Shrikanth S. Narayanan:
Early auditory processing inspired features for robust automatic speech recognition. EUSIPCO 2007: 2385-2389
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KalinliN07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KalinliN07
Ozlem Kalinli, Shrikanth S. Narayanan:
A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech. INTERSPEECH 2007: 1941-1944

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.