default search action
Sachin Kumar 0009
Person information
- affiliation: Carnegie Mellon University, Pittsburg, PA, USA
Other persons with the same name
- Sachin Kumar — disambiguation page
- Sachin Kumar 0001 — South Ural State University, Chelyabinsk, Russian Federation
- Sachin Kumar 0002 — Ajay Kumar Garg Engineering College, Ghaziabad, India
- Sachin Kumar 0003 — Kyungpook National University, Daegu, South Korea
- Sachin Kumar 0004 — University of Delhi, India
- Sachin Kumar 0005 — University of Delhi, Cluster Innovation Centre, Delhi, India
- Sachin Kumar 0006 — Nanyang Technological University, Singapore
- Sachin Kumar 0007 — University of Maryland, College Park, MD, USA
- Sachin Kumar 0008 — Indian Institute of Technology, Kharagpur, India
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c22]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. ACL (1) 2024: 15725-15788 - [c21]Sachin Kumar, Chan Young Park, Yulia Tsvetkov:
Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions. ICLR 2024 - [c20]Yuhan Liu, Shangbin Feng, Xiaochuang Han, Vidhisha Balachandran, Chan Young Park, Sachin Kumar, Yulia Tsvetkov:
P³Sum: Preserving Author's Perspective in News Summarization with Diffusion Language Models. NAACL-HLT 2024: 2154-2173 - [c19]Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov, Marjan Ghazvininejad:
David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs. NAACL-HLT 2024: 8385-8400 - [i26]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. CoRR abs/2402.00159 (2024) - [i25]Nathan Lambert, Valentina Pyatkin, Jacob Morrison, LJ Miranda, Bill Yuchen Lin, Khyathi Raghavi Chandu, Nouha Dziri, Sachin Kumar, Tom Zick, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi:
RewardBench: Evaluating Reward Models for Language Modeling. CoRR abs/2403.13787 (2024) - [i24]Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Yejin Choi, Nouha Dziri:
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models. CoRR abs/2406.18510 (2024) - [i23]Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Valentin Hoffman, Tomasz Limisiewicz, Yulia Tsvetkov, Noah A. Smith:
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization. CoRR abs/2407.08818 (2024) - [i22]Faeze Brahman, Sachin Kumar, Vidhisha Balachandran, Pradeep Dasigi, Valentina Pyatkin, Abhilasha Ravichander, Sarah Wiegreffe, Nouha Dziri, Khyathi Raghavi Chandu, Jack Hessel, Yulia Tsvetkov, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi:
The Art of Saying No: Contextual Noncompliance in Language Models. CoRR abs/2407.12043 (2024) - [i21]Sachin Kumar, Chan Young Park, Yulia Tsvetkov, Noah A. Smith, Hannaneh Hajishirzi:
ComPO: Community Preferences for Language Model Personalization. CoRR abs/2410.16027 (2024) - [i20]Lester James V. Miranda, Yizhong Wang, Yanai Elazar, Sachin Kumar, Valentina Pyatkin, Faeze Brahman, Noah A. Smith, Hannaneh Hajishirzi, Pradeep Dasigi:
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback. CoRR abs/2410.19133 (2024) - 2023
- [c18]Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov:
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control. ACL (1) 2023: 11575-11596 - [c17]Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James R. Glass, Yulia Tsvetkov:
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation. ACL (1) 2023: 12067-12097 - [c16]Melanie Sclar, Sachin Kumar, Peter West, Alane Suhr, Yejin Choi, Yulia Tsvetkov:
Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker. ACL (1) 2023: 13960-13980 - [c15]Sachin Kumar, Vidhisha Balachandran, Lucille Njoo, Antonios Anastasopoulos, Yulia Tsvetkov:
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey. EACL 2023: 3291-3313 - [c14]Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Jungo Kasai, David R. Mortensen, Noah A. Smith, Yulia Tsvetkov:
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models. EMNLP 2023: 9904-9923 - [i19]Leon Derczynski, Hannah Rose Kirk, Vidhisha Balachandran, Sachin Kumar, Yulia Tsvetkov, Mark R. Leiser, Saif Mohammad:
Assessing Language Model Deployment with Risk Cards. CoRR abs/2303.18190 (2023) - [i18]Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Jungo Kasai, David R. Mortensen, Noah A. Smith, Yulia Tsvetkov:
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models. CoRR abs/2305.13707 (2023) - [i17]Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov, Marjan Ghazvininejad:
SSD-2: Scaling and Inference-time Fusion of Diffusion Language Models. CoRR abs/2305.14771 (2023) - [i16]Melanie Sclar, Sachin Kumar, Peter West, Alane Suhr, Yejin Choi, Yulia Tsvetkov:
Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker. CoRR abs/2306.00924 (2023) - [i15]Sachin Kumar, Chan Young Park, Yulia Tsvetkov:
Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions. CoRR abs/2311.07115 (2023) - [i14]Yuhan Liu, Shangbin Feng, Xiaochuang Han, Vidhisha Balachandran, Chan Young Park, Sachin Kumar, Yulia Tsvetkov:
What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization. CoRR abs/2311.09741 (2023) - 2022
- [c13]Sachin Kumar, Biswajit Paria, Yulia Tsvetkov:
Gradient-based Constrained Sampling from Language Models. EMNLP 2022: 2251-2277 - [c12]Melanie Sclar, Peter West, Sachin Kumar, Yulia Tsvetkov, Yejin Choi:
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation. EMNLP 2022: 9649-9668 - [c11]Riccardo Tommasini, Senjuti Basu Roy, Xuan Wang, Hongwei Wang, Heng Ji, Jiawei Han, Preslav Nakov, Giovanni Da San Martino, Firoj Alam, Markus Schedl, Elisabeth Lex, Akash Bharadwaj, Graham Cormode, Milan Dojchinovski, Jan Forberg, Johannes Frey, Pieter Bonte, Marco Balduini, Matteo Belcao, Emanuele Della Valle, Junliang Yu, Hongzhi Yin, Tong Chen, Haochen Liu, Yiqi Wang, Wenqi Fan, Xiaorui Liu, Jamell Dacon, Lingjuan Lye, Jiliang Tang, Aristides Gionis, Stefan Neumann, Bruno Ordozgoiti, Simon Razniewski, Hiba Arnaout, Shrestha Ghosh, Fabian M. Suchanek, Lingfei Wu, Yu Chen, Yunyao Li, Bang Liu, Filip Ilievski, Daniel Garijo, Hans Chalupsky, Pedro A. Szekely, Ilias Kanellos, Dimitris Sacharidis, Thanasis Vergoulis, Nurendra Choudhary, Nikhil Rao, Karthik Subbian, Srinivasan H. Sengamedu, Chandan K. Reddy, Friedhelm Victor, Bernhard Haslhofer, George Katsogiannis-Meimarakis, Georgia Koutrika, Shengmin Jin, Danai Koutra, Reza Zafarani, Yulia Tsvetkov, Vidhisha Balachandran, Sachin Kumar, Xiangyu Zhao, Bo Chen, Huifeng Guo, Yejing Wang, Ruiming Tang, Yang Zhang, Wenjie Wang, Peng Wu, Fuli Feng, Xiangnan He:
Accepted Tutorials at The Web Conference 2022. WWW (Companion Volume) 2022: 391-399 - [i13]Sachin Kumar, Biswajit Paria, Yulia Tsvetkov:
Constrained Sampling from Language Models via Langevin Dynamics in Embedding Spaces. CoRR abs/2205.12558 (2022) - [i12]Sachin Kumar, Vidhisha Balachandran, Lucille Njoo, Antonios Anastasopoulos, Yulia Tsvetkov:
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey. CoRR abs/2210.07700 (2022) - [i11]Melanie Sclar, Peter West, Sachin Kumar, Yulia Tsvetkov, Yejin Choi:
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation. CoRR abs/2210.13800 (2022) - [i10]Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov:
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control. CoRR abs/2210.17432 (2022) - [i9]Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James R. Glass, Yulia Tsvetkov:
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation. CoRR abs/2212.10020 (2022) - 2021
- [c10]Sachin Kumar, Antonios Anastasopoulos, Shuly Wintner, Yulia Tsvetkov:
Machine Translation into Low-resource Language Varieties. ACL/IJCNLP (2) 2021: 110-121 - [c9]Sachin Kumar, Eric Malmi, Aliaksei Severyn, Yulia Tsvetkov:
Controlled Text Generation as Continuous Optimization with Multiple Constraints. NeurIPS 2021: 14542-14554 - [i8]Lidia Kidane, Sachin Kumar, Yulia Tsvetkov:
An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation. AfricaNLP 2021 - [i7]Sachin Kumar, Antonios Anastasopoulos, Shuly Wintner, Yulia Tsvetkov:
Machine Translation into Low-resource Language Varieties. CoRR abs/2106.06797 (2021) - [i6]Sachin Kumar, Eric Malmi, Aliaksei Severyn, Yulia Tsvetkov:
Controlled Text Generation as Continuous Optimization with Multiple Constraints. CoRR abs/2108.01850 (2021) - [i5]Monisha Jegadeesan, Sachin Kumar, John Wieting, Yulia Tsvetkov:
Improving the Diversity of Unsupervised Paraphrasing with Embedding Outputs. CoRR abs/2110.13231 (2021) - 2020
- [c8]Zi-Yi Dou, Sachin Kumar, Yulia Tsvetkov:
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards. NGT@ACL 2020: 60-68 - [c7]Sachin Kumar, Yulia Tsvetkov:
End-to-End Differentiable GANs for Text Generation. ICBINB@NeurIPS 2020: 118-128 - [c6]Tanya Chowdhury, Sachin Kumar, Tanmoy Chakraborty:
Neural Abstractive Summarization with Structural Attention. IJCAI 2020: 3716-3722 - [i4]Tanya Chowdhury, Sachin Kumar, Tanmoy Chakraborty:
Neural Abstractive Summarization with Structural Attention. CoRR abs/2004.09739 (2020) - [i3]Zi-Yi Dou, Sachin Kumar, Yulia Tsvetkov:
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards. CoRR abs/2006.15454 (2020)
2010 – 2019
- 2019
- [c5]Gayatri Bhat, Sachin Kumar, Yulia Tsvetkov:
A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation. NGT@EMNLP-IJCNLP 2019: 199-205 - [c4]Sachin Kumar, Shuly Wintner, Noah A. Smith, Yulia Tsvetkov:
Topics to Avoid: Demoting Latent Confounds in Text Classification. EMNLP/IJCNLP (1) 2019: 4151-4161 - [c3]Sachin Kumar, Yulia Tsvetkov:
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs. ICLR (Poster) 2019 - [i2]Sachin Kumar, Shuly Wintner, Noah A. Smith, Yulia Tsvetkov:
Topics to Avoid: Demoting Latent Confounds in Text Classification. CoRR abs/1909.00453 (2019) - 2018
- [c2]Shreshtha Mundra, Sachin Kumar, Manjira Sinha, Sandya Mannarswamy:
Mining & Summarizing E-petitions for Enhanced Understanding of Public Opinion. CIKM 2018: 1695-1698 - [i1]Sachin Kumar, Yulia Tsvetkov:
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs. CoRR abs/1812.04616 (2018) - 2017
- [c1]Sachin Kumar, Soumen Chakrabarti, Shourya Roy:
Earth Mover's Distance Pooling over Siamese LSTMs for Automatic Short Answer Grading. IJCAI 2017: 2046-2052
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-17 20:58 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint