default search action

combined dblp search
author search
venue search
publication search

ask others

Krishna C. Puvvada

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BurchiPBGT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BurchiPBGT24
Maxime Burchi, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg, Radu Timofte:
Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer. ICASSP 2024: 10211-10215
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PuvvadaKDBG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PuvvadaKDBG24
Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg:
Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition. ICASSP 2024: 12111-12115
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHAHPLGBG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHAHPLGBG24
Zhehuai Chen, He Huang, Andrei Andrusenko, Oleksii Hrinchuk, Krishna C. Puvvada, Jason Li, Subhankar Ghosh, Jagadeesh Balam, Boris Ginsburg:
SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation. ICASSP 2024: 13521-13525
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-12983
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-12983
Maxime Burchi, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg, Radu Timofte:
Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer. CoRR abs/2405.12983 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19674
Krishna C. Puvvada, Piotr Zelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg:
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data. CoRR abs/2406.19674 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19954
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19954
Zhehuai Chen, He Huang, Oleksii Hrinchuk, Krishna C. Puvvada, Nithin Rao Koluguri, Piotr Zelasko, Jagadeesh Balam, Boris Ginsburg:
BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5. CoRR abs/2406.19954 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13106
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13106
He Huang, Taejin Park, Kunal Dhawan, Ivan Medennikov, Krishna C. Puvvada, Nithin Rao Koluguri, Weiqing Wang, Jagadeesh Balam, Boris Ginsburg:
NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks. CoRR abs/2408.13106 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-01438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-01438
Weiqing Wang, Kunal Dhawan, Taejin Park, Krishna C. Puvvada, Ivan Medennikov, Somshubra Majumdar, He Huang, Jagadeesh Balam, Boris Ginsburg:
Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR. CoRR abs/2409.01438 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06656
Taejin Park, Ivan Medennikov, Kunal Dhawan, Weiqing Wang, He Huang, Nithin Rao Koluguri, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg:
Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens. CoRR abs/2409.06656 (2024)
2023
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RekeshKKMNHHPKBG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RekeshKKMNHHPKBG23
Dima Rekesh, Nithin Rao Koluguri, Samuel Kriman, Somshubra Majumdar, Vahid Noroozi, He Huang, Oleksii Hrinchuk, Krishna C. Puvvada, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg:
Fast Conformer With Linearly Scalable Attention For Efficient Speech Recognition. ASRU 2023: 1-8
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BartleyJPKG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BartleyJPKG23
Travis M. Bartley, Fei Jia, Krishna C. Puvvada, Samuel Kriman, Boris Ginsburg:
Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models. ICASSP 2023: 1-5
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangPLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangPLG23
Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel Audio. ICASSP 2023: 1-5
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05218
Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio. CoRR abs/2308.05218 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10922
Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg:
Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition. CoRR abs/2309.10922 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-09424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-09424
Zhehuai Chen, He Huang, Andrei Andrusenko, Oleksii Hrinchuk, Krishna C. Puvvada, Jason Li, Subhankar Ghosh, Jagadeesh Balam, Boris Ginsburg:
SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation. CoRR abs/2310.09424 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12378
Taejin Park, He Huang, Ante Jukic, Kunal Dhawan, Krishna C. Puvvada, Nithin Rao Koluguri, Nikolay Karpov, Aleksandr Laptev, Jagadeesh Balam, Boris Ginsburg:
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System. CoRR abs/2310.12378 (2023)
2022
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05103
Travis M. Bartley, Fei Jia, Krishna C. Puvvada, Samuel Kriman, Boris Ginsburg:
Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models. CoRR abs/2211.05103 (2022)
2021
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangPSW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangPSW21
Hsin-Ping Huang, Krishna C. Puvvada, Ming Sun, Chao Wang:
Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification. ICASSP 2021: 331-335
2020
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiSPKMW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiSPKMW20
Bowen Shi, Ming Sun, Krishna C. Puvvada, Chieh-Chi Kao, Spyros Matsoukas, Chao Wang:
Few-Shot Acoustic Event Detection Via Meta Learning. ICASSP 2020: 76-80
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-09143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-09143
Bowen Shi, Ming Sun, Krishna C. Puvvada, Chieh-Chi Kao, Spyros Matsoukas, Chao Wang:
Few-shot acoustic event detection via meta-learning. CoRR abs/2002.09143 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.