default search action
Shruti Palaskar
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i15]Shruti Palaskar, Oggi Rudovic, Sameer Dharur, Florian Pesce, Gautam Krishna, Aswin Sivaraman, Jack Berkowitz, Ahmed Hussen Abdelaziz, Saurabh Adya, Ahmed H. Tewfik:
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection. CoRR abs/2406.09617 (2024) - 2022
- [c12]Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W. Black, Ana Marasovic:
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization. EMNLP (Findings) 2022: 2644-2657 - [c11]Roshan Sharma, Shruti Palaskar, Alan W. Black, Florian Metze:
End-to-End Speech Summarization Using Restricted Self-Attention. ICASSP 2022: 8072-8076 - [i14]Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W. Black, Ana Marasovic:
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization. CoRR abs/2205.11686 (2022) - 2021
- [c10]Amanda Cardoso Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giró-i-Nieto:
How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language. CVPR 2021: 2735-2744 - [c9]Shruti Palaskar, Ruslan Salakhutdinov, Alan W. Black, Florian Metze:
Multimodal Speech Summarization Through Semantic Concept Learning. Interspeech 2021: 791-795 - [i13]Roshan Sharma, Shruti Palaskar, Alan W. Black, Florian Metze:
Speech Summarization using Restricted Self-Attention. CoRR abs/2110.06263 (2021) - 2020
- [j3]Shruti Palaskar, Ramon Sanabria, Florian Metze:
Transfer learning for multimodal dialog. Comput. Speech Lang. 64: 101093 (2020) - [j2]Lucia Specia, Loïc Barrault, Ozan Caglayan, Amanda Cardoso Duarte, Desmond Elliott, Spandana Gella, Nils Holzenberger, Chiraag Lala, Sun Jae Lee, Jindrich Libovický, Pranava Madhyastha, Florian Metze, Karl Mulligan, Alissa Ostapenko, Shruti Palaskar, Ramon Sanabria, Josiah Wang, Raman Arora:
Grounded Sequence to Sequence Transduction. IEEE J. Sel. Top. Signal Process. 14(3): 577-591 (2020) - [j1]Odette Scharenborg, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller:
Speech Technology for Unwritten Languages. IEEE ACM Trans. Audio Speech Lang. Process. 28: 964-975 (2020) - [c8]Anirudh Mani, Shruti Palaskar, Nimshi Venkat Meripo, Sandeep Konam, Florian Metze:
ASR Error Correction and Domain Adaptation Using Machine Translation. ICASSP 2020: 6344-6348 - [i12]Anirudh Mani, Shruti Palaskar, Nimshi Venkat Meripo, Sandeep Konam, Florian Metze:
ASR Error Correction and Domain Adaptation Using Machine Translation. CoRR abs/2003.07692 (2020) - [i11]Amanda Cardoso Duarte, Shruti Palaskar, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giró-i-Nieto:
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language. CoRR abs/2008.08143 (2020)
2010 – 2019
- 2019
- [c7]Shruti Palaskar, Jindrich Libovický, Spandana Gella, Florian Metze:
Multimodal Abstractive Summarization for How2 Videos. ACL (1) 2019: 6587-6596 - [c6]Shruti Palaskar, Vikas Raunak, Florian Metze:
Learned in Speech Recognition: Contextual Acoustic Word Embeddings. ICASSP 2019: 6530-6534 - [c5]Nils Holzenberger, Shruti Palaskar, Pranava Madhyastha, Florian Metze, Raman Arora:
Learning from Multiview Correlations in Open-domain Videos. ICASSP 2019: 8628-8632 - [c4]Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze:
Multimodal Grounding for Sequence-to-sequence Speech Recognition. ICASSP 2019: 8648-8652 - [i10]Shruti Palaskar, Vikas Raunak, Florian Metze:
Learned In Speech Recognition: Contextual Acoustic Word Embeddings. CoRR abs/1902.06833 (2019) - [i9]Shruti Palaskar, Jindrich Libovický, Spandana Gella, Florian Metze:
Multimodal Abstractive Summarization for How2 Videos. CoRR abs/1906.07901 (2019) - 2018
- [c3]Odette Scharenborg, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux:
Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop. ICASSP 2018: 4979-4983 - [c2]Shruti Palaskar, Ramon Sanabria, Florian Metze:
End-to-end Multimodal Speech Recognition. ICASSP 2018: 5774-5778 - [c1]Shruti Palaskar, Florian Metze:
Acoustic-to-Word Recognition with Sequence-to-Sequence Models. SLT 2018: 397-404 - [i8]Eduard H. Hovy, Taylor Berg-Kirkpatrick, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alexander G. Hauptmann, Florian Metze, Teruko Mitamura, Aditi Chaudhary, Xianyang Chen, Bernie Po-Yao Huang, Hector Zhengzhong Liu, Xuezhe Ma, Shruti Palaskar, Dheeraj Rajagopal, Maria Ryskina, Ramon Sanabria:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2018 - [i7]Odette Scharenborg, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux:
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop. CoRR abs/1802.05092 (2018) - [i6]Shruti Palaskar, Ramon Sanabria, Florian Metze:
End-to-End Multimodal Speech Recognition. CoRR abs/1804.09713 (2018) - [i5]Shruti Palaskar, Florian Metze:
Acoustic-to-Word Recognition with Sequence-to-Sequence Models. CoRR abs/1807.09597 (2018) - [i4]Ramon Sanabria, Ozan Caglayan, Shruti Palaskar, Desmond Elliott, Loïc Barrault, Lucia Specia, Florian Metze:
How2: A Large-scale Dataset for Multimodal Language Understanding. CoRR abs/1811.00347 (2018) - [i3]Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze:
Multimodal Grounding for Sequence-to-Sequence Speech Recognition. CoRR abs/1811.03865 (2018) - [i2]Nils Holzenberger, Shruti Palaskar, Pranava Madhyastha, Florian Metze, Raman Arora:
Learning from Multiview Correlations in Open-Domain Videos. CoRR abs/1811.08890 (2018) - 2017
- [i1]Yohan Jo, Lisa Lee, Shruti Palaskar:
Combining LSTM and Latent Topic Modeling for Mortality Prediction. CoRR abs/1709.02842 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:06 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint