default search action
Ozlem Kalinli
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c51]Chuanneng Sun, Zeeshan Ahmed, Yingyi Ma, Zhe Liu, Lucas Kabela, Yutong Pang, Ozlem Kalinli:
Contextual Biasing of Named-Entities with Large Language Models. ICASSP 2024: 10151-10155 - [c50]Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models. ICASSP 2024: 10216-10220 - [c49]Zhe Liu, Ozlem Kalinli:
Forgetting Private Textual Sequences in Language Models Via Leave-One-Out Ensemble. ICASSP 2024: 10261-10265 - [c48]Arpita Vats, Zhe Liu, Peng Su, Debjyoti Paul, Yingyi Ma, Yutong Pang, Zeeshan Ahmed, Ozlem Kalinli:
Recovering from Privacy-Preserving Masking with Large Language Models. ICASSP 2024: 10771-10775 - [c47]Yingyi Ma, Zhe Liu, Ozlem Kalinli:
Correction Focused Language Model Training For Speech Recognition. ICASSP 2024: 10856-10860 - [c46]Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model. ICASSP 2024: 12201-12205 - [c45]Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. ICASSP 2024: 12406-12410 - [c44]Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691 - [c43]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355 - [c42]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532 - [i42]Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024) - [i41]Gil Keren, Wei Zhou, Ozlem Kalinli:
Token-Weighted RNN-T for Learning from Flawed Data. CoRR abs/2406.18108 (2024) - [i40]Irina-Elena Veliche, Zhuangqun Huang, Vineeth Ayyat Kochaniyan, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Towards measuring fairness in speech recognition: Fair-Speech dataset. CoRR abs/2408.12734 (2024) - [i39]Desh Raj, Gil Keren, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Faster Speech-LLaMA Inference with Multi-token Prediction. CoRR abs/2409.08148 (2024) - [i38]Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. CoRR abs/2409.11494 (2024) - 2023
- [c41]Junteng Jia, Ke Li, Mani Malek, Kshitiz Malik, Jay Mahadeokar, Ozlem Kalinli, Frank Seide:
Joint Federated Learning and Personalization for on-Device ASR. ASRU 2023: 1-8 - [c40]Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer:
Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers. ICASSP 2023: 1-5 - [c39]Ke Li, Jay Mahadeokar, Jinxi Guo, Yangyang Shi, Gil Keren, Ozlem Kalinli, Michael L. Seltzer, Duc Le:
Improving fast-slow Encoder based Transducer with Streaming Deliberation. ICASSP 2023: 1-5 - [c38]Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. ICASSP 2023: 1-5 - [c37]Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. ICASSP 2023: 1-5 - [c36]Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, Ozlem Kalinli:
Learning ASR Pathways: A Sparse Multilingual ASR Model. ICASSP 2023: 1-5 - [c35]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245 - [c34]Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. INTERSPEECH 2023: 1119-1123 - [i37]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023) - [i36]Shuo Liu, Leda Sari, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli:
Towards Selection of Text-to-speech Data to Augment ASR Training. CoRR abs/2306.00998 (2023) - [i35]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023) - [i34]Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. CoRR abs/2307.12134 (2023) - [i33]Chuanneng Sun, Zeeshan Ahmed, Yingyi Ma, Zhe Liu, Lucas Kabela, Yutong Pang, Ozlem Kalinli:
Contextual Biasing of Named-Entities with Large Language Models. CoRR abs/2309.00723 (2023) - [i32]Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models. CoRR abs/2309.01947 (2023) - [i31]Arpita Vats, Zhe Liu, Peng Su, Debjyoti Paul, Yingyi Ma, Yutong Pang, Zeeshan Ahmed, Ozlem Kalinli:
Recovering from Privacy-Preserving Masking with Large Language Models. CoRR abs/2309.08628 (2023) - [i30]Roshan Sharma, Suyoun Kim, Daniel Lazar, Trang Le, Akshat Shrivastava, Kwanghoon Ahn, Piyush Kansal, Leda Sari, Ozlem Kalinli, Michael L. Seltzer:
Augmenting text for spoken language understanding with Large Language Models. CoRR abs/2309.09390 (2023) - [i29]Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. CoRR abs/2309.10917 (2023) - [i28]Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model. CoRR abs/2309.13018 (2023) - [i27]Zhe Liu, Ozlem Kalinli:
Forgetting Private Textual Sequences in Language Models via Leave-One-Out Ensemble. CoRR abs/2309.16082 (2023) - [i26]Yingyi Ma, Zhe Liu, Ozlem Kalinli:
Correction Focused Language Model Training for Speech Recognition. CoRR abs/2310.11003 (2023) - [i25]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023) - 2022
- [c33]Suyoun Kim, Ke Li, Lucas Kabela, Ron Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. EMNLP (Findings) 2022: 5717-5722 - [c32]Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Neural-FST Class Language Model for End-to-End Speech Recognition. ICASSP 2022: 6107-6111 - [c31]Haichuan Yang, Yuan Shangguan, Dilin Wang, Meng Li, Pierce Chuang, Xiaohui Zhang, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Omni-Sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR Via Supernet. ICASSP 2022: 8197-8201 - [c30]Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281 - [c29]Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. INTERSPEECH 2022: 536-540 - [c28]Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast slow cascaded encoders. INTERSPEECH 2022: 2083-2087 - [c27]Duc Le, Akshat Shrivastava, Paden D. Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. INTERSPEECH 2022: 3468-3472 - [c26]Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. INTERSPEECH 2022: 3978-3982 - [c25]Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139 - [c24]Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli:
Learning a Dual-Mode Speech Recognition Model VIA Self-Pruning. SLT 2022: 273-279 - [i24]Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Neural-FST Class Language Model for End-to-End Speech Recognition. CoRR abs/2201.11867 (2022) - [i23]Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast-slow cascaded encoders. CoRR abs/2203.15773 (2022) - [i22]Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. CoRR abs/2203.15966 (2022) - [i21]Duc Le, Akshat Shrivastava, Paden Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. CoRR abs/2204.01893 (2022) - [i20]Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli:
Learning a Dual-Mode Speech Recognition Model via Self-Pruning. CoRR abs/2207.11906 (2022) - [i19]Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, John H. L. Hansen, Ozlem Kalinli:
Learning ASR pathways: A sparse multilingual ASR model. CoRR abs/2209.05735 (2022) - [i18]Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. CoRR abs/2210.11588 (2022) - [i17]Suyoun Kim, Ke Li, Lucas Kabela, Rongqing Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. CoRR abs/2211.00174 (2022) - [i16]Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer:
Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers. CoRR abs/2211.00896 (2022) - [i15]Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. CoRR abs/2211.05756 (2022) - 2021
- [c23]Ting-Yao Hu, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalinli, Oncel Tuzel:
SapAugment: Learning A Sample Adaptive Policy for Data Augmentation. ICASSP 2021: 4040-4044 - [c22]Chunyang Wu, Zhiping Xiu, Yangyang Shi, Ozlem Kalinli, Christian Fuegen, Thilo Köhler, Qing He:
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis. Interspeech 2021: 146-150 - [c21]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776 - [c20]Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. Interspeech 2021: 1977-1981 - [c19]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046 - [c18]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111 - [c17]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557 - [c16]Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh, Ozlem Kalinli, Michael L. Seltzer, Vikas Chandra:
Collaborative Training of Acoustic Encoders for Speech Recognition. Interspeech 2021: 4573-4577 - [i14]Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. CoRR abs/2104.02138 (2021) - [i13]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021) - [i12]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021) - [i11]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021) - [i10]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021) - [i9]Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh, Ozlem Kalinli, Michael L. Seltzer, Vikas Chandra:
Collaborative Training of Acoustic Encoders for Speech Recognition. CoRR abs/2106.08960 (2021) - [i8]Dilin Wang, Yuan Shangguan, Haichuan Yang, Pierce Chuang, Jiatong Zhou, Meng Li, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Noisy Training Improves E2E ASR for the Edge. CoRR abs/2107.04677 (2021) - [i7]Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer:
Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study. CoRR abs/2110.03174 (2021) - [i6]Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021) - [i5]Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. CoRR abs/2110.05376 (2021) - [i4]Haichuan Yang, Yuan Shangguan, Dilin Wang, Meng Li, Pierce Chuang, Xiaohui Zhang, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet. CoRR abs/2110.08352 (2021) - [i3]Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021) - 2020
- [i2]Ting-Yao Hu, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalinli, Oncel Tuzel:
SapAugment: Learning A Sample Adaptive Policy for Data Augmentation. CoRR abs/2011.01156 (2020)
2010 – 2019
- 2019
- [c15]Ozlem Kalinli, Gautam Bhattacharya, Chao Weng:
Parametric Cepstral Mean Normalization for Robust Speech Recognition. ICASSP 2019: 6735-6739 - [c14]Gautam Mantena, Ozlem Kalinli, Ossama Abdel-Hamid, Don McAllaster:
Bandwidth Embeddings for Mixed-Bandwidth Speech Recognition. INTERSPEECH 2019: 3203-3207 - [i1]Gautam Mantena, Ozlem Kalinli, Ossama Abdel-Hamid, Don McAllaster:
Bandwidth Embeddings for Mixed-bandwidth Speech Recognition. CoRR abs/1909.02667 (2019) - 2016
- [c13]Ozlem Kalinli:
Analysis of Multi-Lingual Emotion Recognition Using Auditory Attention Features. INTERSPEECH 2016: 3613-3617 - 2015
- [c12]Mahnoosh Mehrabani, Ozlem Kalinli, Ruxin Chen:
Emotion clustering based on probabilistic linear discriminant analysis. INTERSPEECH 2015: 1314-1318 - 2013
- [c11]Ozlem Kalinli:
Combination of auditory attention features with phone posteriors for better automatic phoneme segmentation. INTERSPEECH 2013: 2302-2305 - 2012
- [c10]Ozlem Kalinli:
Automatic Phoneme Segmentation Using Auditory Attention Features. INTERSPEECH 2012: 2270-2273 - 2011
- [c9]Ozlem Kalinli:
Tone and pitch accent classification using auditory attention cues. ICASSP 2011: 5208-5211 - [c8]Ozlem Kalinli:
Syllable Segmentation of Continuous Speech Using Auditory Attention Cues. INTERSPEECH 2011: 425-428 - 2010
- [j2]Ozlem Kalinli, Michael L. Seltzer, Jasha Droppo, Alex Acero:
Noise Adaptive Training for Robust Automatic Speech Recognition. IEEE Trans. Speech Audio Process. 18(8): 1889-1901 (2010)
2000 – 2009
- 2009
- [j1]Ozlem Kalinli, Shrikanth S. Narayanan:
Prominence Detection Using Auditory Attention Cues and Task-Dependent High Level Information. IEEE Trans. Speech Audio Process. 17(5): 1009-1024 (2009) - [c7]Ozlem Kalinli, Michael L. Seltzer, Alex Acero:
Noise adaptive training using a vector taylor series approach for noise robust automatic speech recognition. ICASSP 2009: 3825-3828 - [c6]Ozlem Kalinli, Shrikanth S. Narayanan:
Continuous speech recognition using attention shift decoding with soft decision. INTERSPEECH 2009: 1927-1930 - [c5]Ozlem Kalinli, Shiva Sundaram, Shrikanth S. Narayanan:
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing. MMSP 2009: 1-6 - 2008
- [c4]Ozlem Kalinli, Shrikanth S. Narayanan:
A top-down auditory attention model for learning task dependent influences on prominence detection in speech. ICASSP 2008: 3981-3984 - [c3]Ozlem Kalinli, Shrikanth S. Narayanan:
Combining task-dependent information with auditory attention cues for prominence detection in speech. INTERSPEECH 2008: 1064-1067 - 2007
- [c2]Ozlem Kalinli, Shrikanth S. Narayanan:
Early auditory processing inspired features for robust automatic speech recognition. EUSIPCO 2007: 2385-2389 - [c1]Ozlem Kalinli, Shrikanth S. Narayanan:
A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech. INTERSPEECH 2007: 1941-1944
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 20:17 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint