default search action
EMNLP 2024: Miami, FL, USA
- Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024. Association for Computational Linguistics 2024, ISBN 979-8-89176-164-3 - Frontmatter.
- Juhwan Choi, Yeonghwa Kim, Seunguk Yu, Jungmin Yun, Youngbin Kim:
UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation. 1-14 - Juhwan Choi, Jungmin Yun, Kyohoon Jin, Youngbin Kim:
Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation. 15-29 - Joonho Yang, Seunghyun Yoon, Byeongjeong Kim, Hwanhee Lee:
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document. 30-45 - Rimon Melamed, Lucas H. McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adserà:
Prompts have evil twins. 46-74 - Vaishali Pal, Evangelos Kanoulas, Andrew Yates, Maarten de Rijke:
Table Question Answering for Low-resourced Indic Languages. 75-92 - Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Baldridge, Radu Soricut:
ImageInWords: Unlocking Hyper-Detailed Image Descriptions. 93-127 - Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang:
LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay. 128-145 - Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps:
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection. 146-158 - Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola García-Perera, EngSiong Chng, Lina Yao:
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model. 159-171 - Sanne Hoeken, Sina Zarrieß, Özge Alaçam:
Hateful Word in Context Classification. 172-186 - Özge Alaçam, Sanne Hoeken, Sina Zarrieß:
Eyes Don't Lie: Subjective Hate Annotation and Detection with Gaze. 187-205 - Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle:
NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning. 206-212 - Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka:
"Thinking" Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models. 213-227 - Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan:
A Usage-centric Take on Intent Understanding in E-Commerce. 228-236 - Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha:
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs. 237-250 - Amir Taubenfeld, Yaniv Dover, Roi Reichart, Ariel Goldstein:
Systematic Biases in LLM Simulations of Debates. 251-267 - Katherine Atwell, Danielle Bragg, Malihe Alikhani:
Studying and Mitigating Biases in Sign Language Understanding Models. 268-283 - Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban:
Uncertainty in Language Models: Assessment through Rank-Calibration. 284-312 - Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang:
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning. 313-333 - Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy Chen, Shafiq Joty:
Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing. 334-350 - Santiago Cuervo, Ricard Marxer:
Scaling Properties of Speech Language Models. 351-361 - Rajkumar Pujari, Chengfei Wu, Dan Goldwasser:
"We Demand Justice!": Towards Social Context Grounding of Political Texts. 362-372 - Rabindra Nath Nandi, Suman Kalyan Maity, Brian Uzzi, Sourav Medya:
An Experimental Analysis on Evaluating Patent Citations. 373-387 - Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow:
Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? 388-409 - Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis:
Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing. 410-423 - Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, Zujie Wen, Wenqiang Lei, Tat-Seng Chua:
Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation. 424-444 - Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman:
Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation. 445-463 - Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, Shimin Tao, Xiaofeng Zhao, Mahong Xia, Zhang Li, Boxing Chen, Hao Yang, Bei Li, Tong Xiao, JingBo Zhu:
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation. 464-478 - Abhilasha Sancheti, Haozhe An, Rachel Rudinger:
On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models. 479-494 - Maureen de Seyssel, Antony D'Avirro, Adina Williams, Emmanuel Dupoux:
EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models. 495-507 - Xiaoxiao Ma, Yuchen Zhang, Kaize Ding, Jian Yang, Jia Wu, Hao Fan:
On Fake News Detection with LLM Enhanced Semantics Mining. 508-521 - Branislav Pecher, Ivan Srba, Mária Bieliková:
On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. 522-556 - Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan:
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection. 557-568 - Valentin Barrière, Sebastian Cifuentes:
A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers. 569-579 - Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang:
Mitigating the Alignment Tax of RLHF. 580-606 - Meng Li, Haoran Jin, Ruixuan Huang, Zhihao Xu, Defu Lian, Zijia Lin, Di Zhang, Xiting Wang:
Evaluating Readability and Faithfulness of Concept-based Explanations. 607-625 - Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy Chen:
Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems. 626-642 - Dayuan Fu, Biqing Qi, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou:
MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making. 643-659 - Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Huang:
CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds. 660-677 - Craig W. Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner:
Tokenization Is More Than Compression. 678-702 - Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard S. Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta:
FLIRT: Feedback Loop In-context Red Teaming. 703-718 - Lingjun Zhao, Khanh Nguyen, Hal Daumé III:
Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections. 719-736 - Haoyuan Wu, Haisheng Zheng, Zhuolun He, Bei Yu:
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks. 737-749 - Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng:
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation. 750-766 - Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates:
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities. 767-783 - Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu:
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models. 784-801 - Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li:
LongEmbed: Extending Embedding Models for Long Context Retrieval. 802-816 - Xiangyang Liu, Junliang He, Xipeng Qiu:
Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences. 817-838 - Xianlong Luo, Meng Yang, Yihao Wang:
Overcome Noise and Bias: Segmentation-Aided Multi-Granularity Denoising and Debiasing for Enhanced Quarduples Extraction in Dialogue. 839-856 - Dongjun Lim, Yun-Gyung Cheong:
Integrating Plutchik's Theory with Mixture of Experts for Enhancing Emotion Classification. 857-867 - Chao Liang, Wei Xiang, Bang Wang:
In-context Contrastive Learning for Event Causality Identification. 868-881 - Anna Wegmann, Tijs A. van den Broek, Dong Nguyen:
What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs. 882-912 - Kanishka Misra, Kyle Mahowald:
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs. 913-929 - Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu:
Large Language Models for Data Annotation and Synthesis: A Survey. 930-957 - Hongyuan Lu, Haoran Yang, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei:
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models. 958-976 - Yifan Yang, Kai Zhen, Ershad Banijamali, Athanasios Mouchtaris, Zheng Zhang:
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning. 977-995 - Haoyu Wang, Tianci Liu, Ruirui Li, Monica Xiao Cheng, Tuo Zhao, Jing Gao:
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning. 996-1008 - Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, Chen Luo, Xianfeng Tang, Monica Xiao Cheng, Tuo Zhao, Jing Gao:
BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering. 1009-1025 - Jocelyn Shen, Joel Mire, Hae Park, Cynthia Breazeal, Maarten Sap:
HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs. 1026-1046 - Junru Lu, Jiazheng Li, Siyu An, Meng Zhao, Yulan He, Di Yin, Xing Sun:
Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence. 1047-1067 - Tianyi Hu, Maria Maistro, Daniel Hershcovich:
Bridging Cultures in the Kitchen: A Framework and Benchmark for Cross-Cultural Recipe Retrieval. 1068-1080 - Peng Xia, Kangyu Zhu, Haoran Li, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao:
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models. 1081-1093 - Yuan Li, Bingqiao Luo, Qian Wang, Nuo Chen, Xu Liu, Bingsheng He:
CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading. 1094-1106 - Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Zhifang Sui:
A Survey on In-context Learning. 1107-1128 - Hangdi Xing, Changxu Cheng, Feiyu Gao, Zirui Shao, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao:
DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing. 1129-1142 - Ziyang Luo, Xin Li, Hongzhan Lin, Jing Ma, Lidong Bing:
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation. 1143-1166 - Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai:
EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models. 1167-1181 - Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee:
Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization. 1182-1191 - Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura:
LLMs Are Zero-Shot Context-Aware Simultaneous Translators. 1192-1207 - Yiqiao Jin, Qinlin Zhao, Yiyang Wang, Hao Chen, Kaijie Zhu, Yijia Xiao, Jindong Wang:
AgentReview: Exploring Peer Review Dynamics with LLM Agents. 1208-1226 - Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya Sakai, Zhicheng Dou:
ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval. 1227-1240 - Han Zhou, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulic, Anna Korhonen:
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments. 1241-1252 - Chenlong Deng, Kelong Mao, Zhicheng Dou:
Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation. 1253-1265 - Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang:
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process. 1266-1280 - Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander Toshev:
Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation. 1281-1287 - Ashima Suvarna, Xiao Liu, Tanmay Parekh, Kai-Wei Chang, Nanyun Peng:
QUDSELECT: Selective Decoding for Questions Under Discussion Parsing. 1288-1299 - Peng Chen, Xiao-Yu Guo, Yuan-Fang Li, Xiaowang Zhang, Zhiyong Feng:
Mitigating Language Bias of LMMs in Social Intelligence Understanding with Virtual Counterfactual Calibration. 1300-1310 - Zihang Liu, Yuanzhe Hu, Tianyu Pang, Yefan Zhou, Pu Ren, Yaoqing Yang:
Model Balancing Helps Low-data Training and Fine-tuning. 1311-1331 - Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami:
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment. 1332-1353 - Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu:
Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment. 1354-1365 - Zhongwu Chen, Long Bai, Zixuan Li, Zhen Huang, Xiaolong Jin, Yong Dou:
A New Pipeline for Knowledge Graph Reasoning Enhanced by Large Language Models Without Fine-Tuning. 1366-1381 - Zhiyuan Chen, Shiqi Shen, Guangyao Shen, Gong Zhi, Xu Chen, Yankai Lin:
Towards Tool Use Alignment of Large Language Models. 1382-1400 - Ranchi Zhao, Zhen Leng Thai, Yifan Zhang, Shengding Hu, Jie Zhou, Yunqi Ba, Jie Cai, Zhiyuan Liu, Maosong Sun:
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models. 1401-1418 - Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass:
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. 1419-1436 - Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun:
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment. 1437-1454 - Yongsen Zheng, Ruilin Xu, Guohua Wang, Liang Lin, Kwok-Yan Lam:
Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation. 1455-1466 - Haoran Li, Qiang Gao, Hongmei Wu, Li Huang:
Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network. 1467-1478 - Wenjian Ding, Yao Zhang, Jun Wang, Adam Jatowt, Zhenglu Yang:
Exploring Union and Intersection of Visual Regions for Generating Questions, Answers, and Distractors. 1479-1489 - Xiangyu Zhao, Yuehan Zhang, Wenlong Zhang, Xiao-Ming Wu:
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation. 1490-1507 - Hayden S. Helm, Brandon Duderstadt, Youngser Park, Carey E. Priebe:
Tracking the perspectives of interacting language models. 1508-1519 - Zhengxuan Zhang, Yin Wu, Yuyu Luo, Nan Tang:
MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering. 1520-1530 - Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui:
Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? 1531-1555 - Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng Li, Wei Peng, Sujian Li:
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement. 1556-1572 - Joseph Marvin Imperial, Gail Forey, Harish Tayyar Madabushi:
Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation. 1573-1594 - Zhihao Zhang, Sophia Yat Mei Lee, Junshuang Wu, Dong Zhang, Shoushan Li, Erik Cambria, Guodong Zhou:
Cross-domain NER with Generated Task-Oriented Knowledge: An Empirical Study from Information Density Perspective. 1595-1609 - Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, Huan Liu:
Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models. 1610-1626 - Yuxuan Wang, Xiaoyuan Liu:
Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement. 1627-1639 - Xiaoze Liu, Ting Sun, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao:
SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation. 1640-1670 - Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie:
MatchTime: Towards Automatic Soccer Game Commentary Generation. 1671-1685 - Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang:
Rethinking Token Reduction for State Space Models. 1686-1697 - Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Yongfeng Huang, Heng Chang, Yueting Zhuang:
Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering. 1698-1710 - Yuyan Zhou, Liang Song, Bingning Wang, Weipeng Chen:
MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic. 1711-1724 - Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson:
Event Causality Identification with Synthetic Control. 1725-1737 - Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Lu, Qi Liu, Sheng Wang, Lingpeng Kong:
Retrieved Sequence Augmentation for Protein Representation Learning. 1738-1767 - Fan Yuan, Chi Qin, Xiaogang Xu, Piji Li:
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding. 1768-1785 - Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulic:
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners. 1786-1807 - Yibo Wang, Xiangjue Dong, James Caverlee, Philip S. Yu:
DA³: A Distribution-Aware Adversarial Attack against Language Models. 1808-1825 - Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing:
Evaluating Psychological Safety of Large Language Models. 1826-1843 - Zhuowei Chen, Lianxi Wang, Yuben Wu, Xinfeng Liao, Yujia Tian, Junyang Zhong:
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification. 1844-1856 - Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu:
Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering. 1857-1868 - Libo Zhao, Jing Li, Ziqian Zeng:
PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation. 1869-1881 - Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang:
TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging. 1882-1898 - Caiqi Zhang, Zhijiang Guo, Andreas Vlachos:
Do We Need Language-Specific Fact-Checking Models? The Case of Chinese. 1899-1914 - Zhiyuan Li, Dongnan Liu, Chaoyi Zhang, Heng Wang, Tengfei Xue, Weidong Cai:
Enhancing Advanced Visual Reasoning Ability of Large Language Models. 1915-1929 - Zecheng Tang, Keyan Zhou, Juntao Li, Yuyang Ding, Pinzheng Wang, Yan Bowen, Renjie Hua, Min Zhang:
CMD: a framework for Context-aware Model self-Detoxification. 1930-1949 - Xiaomeng Hu, Yiming Zhang, Ru Peng, Haozhe Zhang, Chenwei Wu, Gang Chen, Junbo Zhao:
Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection. 1950-1959 - Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. 1960-1975 - Junlin Li, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang:
Be Helpful but Don't Talk too Much - Enhancing Helpfulness in Conversations through Relevance in Multi-Turn Emotional Support. 1976-1988 - Hyuhng Joon Kim, Youna Kim, Cheonbok Park, Junyeob Kim, Choonghyun Park, Kang Min Yoo, Sang-goo Lee, Taeuk Kim:
Aligning Language Models to Explicitly Handle Ambiguity. 1989-2007 - Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li:
Tag-grounded Visual Instruction Tuning with Retrieval Augmentation. 2008-2026 - Xuanchang Zhang, Zhuosheng Zhang, Hai Zhao:
GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models. 2027-2039 - Runze Xia, Congchi Yin, Piji Li:
Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information. 2040-2052 - Rui Li, Qi Liu, Liyang He, Zheng Zhang, Hao Zhang, Shengyu Ye, Junyu Lu, Zhenya Huang:
Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models. 2053-2065 - Yongjin Yang, Jongwoo Ko, Se-Young Yun:
Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models. 2066-2085 - Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu:
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning. 2086-2099 - Yu Lin, Qizhi Zhang, Quanwei Cai, Jue Hong, Wu Ye, Huiqi Liu, Bing Duan:
An Inversion Attack Against Obfuscated Embedding Matrix in Language Model Inference. 2100-2104 - Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen:
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation. 2105-2123 - Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu:
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models. 2124-2155 - Xiaoyang Yi, Yuru Bao, Jian Zhang, Yifang Qin, Faxin Lin:
Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training. 2156-2171 - Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang:
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning. 2172-2190 - Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen:
I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation. 2191-2199 - Michael Wiegand, Josef Ruppenhofer:
Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm. 2200-2218 - Hyungjun Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee:
By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting. 2219-2241 - Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee:
Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization. 2242-2252 - Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie:
CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search. 2253-2268 - Jianzhao Huang, Hongzhan Lin, Ziyan Liu, Ziyang Luo, Guang Chen, Jing Ma:
Towards Low-Resource Harmful Meme Detection with LMM Agents. 2269-2293 - Zhe Hu, Yixiao Ren, Jing Li, Yu Yin:
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values. 2294-2311 - Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng:
Direct Multi-Turn Preference Optimization for Language Agents. 2312-2324 - Leonardo Ranaldi, André Freitas:
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models. 2325-2347 - Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren:
In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search. 2348-2370 - Wenhao Huang, Zhouhong Gu, Chenghao Peng, Jiaqing Liang, Zhixu Li, Yanghua Xiao, Liqian Wen, Zulong Chen:
AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation. 2371-2389 - Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf:
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space. 2390-2422 - Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu:
Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding. 2423-2451 - Jiwan Chung, Seungwon Lim, Jaehyun Jeon, Seungbeen Lee, Youngjae Yu:
Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! 2452-2469 - Chunzhen Jin, Eliot Huang, Heng Chang, Yaqi Wang, Peng Cao, Osmar R. Zaïane:
Reusing Transferable Weight Increments for Low-resource Style Generation. 2470-2488 - Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee:
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course. 2489-2513 - Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler:
Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? 2514-2528 - Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei:
Instruction Pre-Training: Language Models are Supervised Multitask Learners. 2529-2550 - Renzhi Wang, Piji Li:
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models. 2551-2575 - Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma:
Collaborative Performance Prediction for Large Language Models. 2576-2596 - Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari:
Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese. 2597-2615 - Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi:
Knowledge Verification to Nip Hallucination in the Bud. 2616-2633 - Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie Friedrich:
QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios. 2634-2652 - Gregor Geigle, Radu Timofte, Goran Glavas:
African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification. 2653-2669 - Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao:
Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models. 2670-2683 - Bastien Liétard, Pascal Denis, Mikaela Keller:
To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models. 2684-2696 - Hao Wang, Hao Li, Minlie Huang, Lei Sha:
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings. 2697-2711 - Xiutian Zhao, Ke Wang, Wei Peng:
An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making. 2712-2727 - Gregor Geigle, Radu Timofte, Goran Glavas:
Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? 2728-2742 - Zhenyu Liu, Dongfang Li, Xinshuo Hu, Xinping Zhao, Yibin Chen, Baotian Hu, Min Zhang:
Take Off the Training Wheels! Progressive In-Context Learning for Effective Alignment. 2743-2757 - Yufei Ma, Zihan Liang, Huangyu Dai, Ben Chen, Dehong Gao, Zhuoran Ran, Zihan Wang, Linbo Jin, Wen Jiang, Guannan Zhang, Xiaoyan Cai, Libin Yang:
MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning. 2758-2770 - Pinyi Zhang, Jingyang Chen, Junchen Shen, Zijie Zhai, Ping Li, Jie Zhang, Kai Zhang:
Message Passing on Semantic-Anchor-Graphs for Fine-grained Emotion Representation Learning and Classification. 2771-2783 - Yuqing Zhang, Baoyi He, Yihan Chen, Hangqi Li, Han Yue, Shengyu Zhang, Huaiyong Dou, Junchi Yan, Zemin Liu, Yongquan Zhang, Fei Wu:
PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study. 2784-2801 - Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei Zhang, Sen Su:
Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions. 2802-2816 - Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu:
MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction. 2817-2834 - Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi:
Evaluating Large Language Models via Linguistic Profiling. 2835-2848 - Tyler Loakman, Yucheng Li, Chenghua Lin:
With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models. 2849-2867 - Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li:
KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases. 2868-2882 - Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira:
Understanding Higher-Order Correlations Among Semantic Components in Embeddings. 2883-2899 - Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Shaorong Xie, Wei Liu, Xian Wu, Yefeng Zheng:
DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection. 2900-2912 - Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg:
Evaluating D-MERIT of Partial-annotation on Information Retrieval. 2913-2932 - Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas:
Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving. 2933-2958 - Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu:
Calibrating the Confidence of Large Language Models by Eliciting Fidelity. 2959-2979 - Yanjun Chen, Dawei Zhu, Yirong Sun, Xinghao Chen, Wei Zhang, Xiaoyu Shen:
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models. 2980-2989 - Adrian Cosma, Stefan Ruseti, Mihai Dascalu, Cornelia Caragea:
How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics. 2990-3001 - Gaetan Latouche, Marc-André Carbonneau, Benjamin Swanson:
Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection. 3002-3016 - Lukas Edman, Helmut Schmid, Alexander Fraser:
CUTE: Measuring LLMs' Understanding of Their Tokens. 3017-3026 - Xinping Zhao, Dongfang Li, Yan Zhong, Boren Hu, Yibin Chen, Baotian Hu, Min Zhang:
SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation. 3027-3041 - Andreas Opedal, Eleanor Chodroff, Ryan Cotterell, Ethan Wilcox:
On the Role of Context in Reading Time Prediction. 3042-3058 - Yuhang He, Jihai Zhang, Jianzhu Bao, Fangquan Lin, Cheng Yang, Bing Qin, Ruifeng Xu, Wotao Yin:
BC-Prover: Backward Chaining Prover for Formal Theorem Proving. 3059-3077 - Marius Mosbach, Vagrant Gautam, Tomás Vergara Browne, Dietrich Klakow, Mor Geva:
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP. 3078-3105 - Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu:
Autoregressive Pre-Training on Pixels and Texts. 3106-3125 - Yekun Chai, Qingyi Liu, Shuohuan Wang, Yu Sun, Qiwei Peng, Hua Wu:
On Training Data Influence of GPT Models. 3126-3150 - Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat:
Understanding "Democratization" in NLP and ML Research. 3151-3166 - Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto:
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models. 3167-3193 - Seonjeong Hwang, Yunsu Kim, Gary Geunbae Lee:
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages. 3194-3208 - Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng:
ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws. 3209-3222 - Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka:
Word Alignment as Preference for Machine Translation. 3223-3239 - Yaxin Fan, Peifeng Li, Qiaoming Zhu:
Improving Multi-party Dialogue Generation via Topic and Rhetorical Coherence. 3240-3253 - Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang:
SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models. 3254-3266 - Zeping Yu, Sophia Ananiadou:
Neuron-Level Knowledge Attribution in Large Language Models. 3267-3280 - Zeping Yu, Sophia Ananiadou:
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning. 3281-3292 - Zeping Yu, Sophia Ananiadou:
Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis. 3293-3306 - Kushal Tatariya, Vladimir Araujo, Thomas Bauwens, Miryam de Lhoneux:
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models. 3307-3320 - Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song:
GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory. 3321-3343 - Ali Al-Laith, Daniel Hershcovich, Jens Bjerring-Hansen, Jakob Parby, Alexander Conroy, Timothy Tangherlini:
Noise, Novels, Numbers. A Framework for Detecting and Categorizing Noise in Danish and Norwegian Literature. 3344-3354 - Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh:
QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models. 3355-3371 - Omer Shubi, Yoav Meiri, Cfir Avraham Hadar, Yevgeni Berzak:
Fine-Grained Prediction of Reading Comprehension from Eye Movements. 3372-3391 - Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
EfficientRAG: Efficient Retriever for Multi-Hop Question Answering. 3392-3411 - Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakanni-Tür:
Unsupervised Human Preference Learning. 3412-3445 - Helena Bonaldi, Greta Damo, Nicolás Benjamín Ocampo, Elena Cabrio, Serena Villata, Marco Guerini:
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering. 3446-3463 - Byung-Doh Oh, William Schuler:
Leading Whitespaces of Language Models' Subword Vocabulary Pose a Confound for Calculating Word Probabilities. 3464-3472 - Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang:
LLM4Decompile: Decompiling Binary Code with Large Language Models. 3473-3487 - Jihao Gu, Zelin Wang, Yibo Zhang, Ziji Zhang, Ping Gong:
From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning. 3488-3500 - Yike Wu, Yi Huang, Nan Hu, Yuncheng Hua, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan:
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering. 3501-3520 - Wenlong Fei, Xiaohua Wang, Min Hu, Qingyu Zhang, Hongbo Li:
MTLS: Making Texts into Linguistic Symbols. 3521-3535 - Yifan Chen, Kuntao Li, Weixing Mai, Qiaofeng Wu, Yun Xue, Fenghuan Li:
D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection. 3536-3547 - Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens:
A Generic Method for Fine-grained Category Discovery in Natural Language Texts. 3548-3566 - Yang Trista Cao, Lovely-Frances Domingo, Sarah A. Gilbert, Michelle L. Mazurek, Katie Shilton, Hal Daumé III:
Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method. 3567-3587 - Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie:
A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models. 3588-3612 - Qian Yang, Weixiang Yan, Aishwarya Agrawal:
Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison. 3613-3627 - Lang Cao:
Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism. 3628-3646 - Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee:
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation. 3647-3659 - Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia, Constantin Orasan, Tharindu Ranasinghe, Frédéric Blain:
What do Large Language Models Need for Machine Translation Evaluation? 3660-3674 - Flavio Palo, Prateek Singhi, Bilal Fadlallah:
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale. 3675-3687 - Debela Gemechu, Chris Reed:
External Knowledge-Driven Argument Mining: Leveraging Attention-Enhanced Multi-Network Models. 3688-3709 - Maaz Bin Musa, Steven M. Winston, Garrison Allen, Jacob Schiller, Kevin Moore, Sean Quick, Johnathan Melvin, Padmini Srinivasan, Mihailis Diamantis, Rishab Nithyanand:
C3PA: An Open Dataset of Expert-Annotated and Regulation-Aware Privacy Policies to Enable Scalable Regulatory Compliance Audits. 3710-3722 - Taowen Wang, Yiyang Liu, James Liang, Junhan Zhao, Yiming Cui, Yuning Mao, Shaoliang Nie, Jiahao Liu, Fuli Feng, Zenglin Xu, Cheng Han, Lifu Huang, Qifan Wang, Dongfang Liu:
M²PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning. 3723-3740 - Letian Peng, Yi Gu, Chengyu Dong, Zihan Wang, Jingbo Shang:
Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification. 3741-3752 - Letian Peng, Zilong Wang, Jingbo Shang:
Incubating Text Classifiers Following User Instruction with Nothing but LLM. 3753-3766 - Ruilin Luo, Liyuan Wang, Binghuai Lin, Zicheng Lin, Yujiu Yang:
PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL. 3767-3799 - Wesley H. Holliday, Matthew Mandelkern, Cedegao Zhang:
Conditional and Modal Reasoning in Large Language Models. 3800-3821 - Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao, Yuchun Fan, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin:
Advancing Large Language Model Attribution through Self-Improving. 3822-3836 - Ziqi Liang, Haoxiang Shi, Hanhui Chen:
AlignCap: Aligning Speech Emotion Captioning to Human Preferences. 3837-3846 - Yihuai Hong, Aldo Lipani:
Interpretability-based Tailored Knowledge Editing in Transformers. 3847-3858 - Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan:
PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling. 3859-3920 - Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap:
Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting. 3921-3932 - Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang:
Dissecting Fine-Tuning Unlearning in Large Language Models. 3933-3941 - Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, Zhiheng Huang:
Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models. 3942-3965 - Renato Lui Geh, Honghua Zhang, Kareem Ahmed, Benjie Wang, Guy Van den Broeck:
Where is the signal in tokenization space? 3966-3979 - Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang:
Private Language Models via Truncated Laplacian Mechanism. 3980-3993 - Daniela Gottesman, Mor Geva:
Estimating Knowledge in Large Language Models Without Generating a Single Token. 3994-4019 - Lan Zhang, Xin Quan, André Freitas:
Consistent Autoformalization for Constructing Mathematical Libraries. 4020-4033 - Yufei Tao, Adam Hiatt, Erik Haake, Antonie J. Jetter, Ameeta Agrawal:
When Context Leads but Parametric Memory Follows in Large Language Models. 4034-4058 - Aditya Yedetore, Najoung Kim:
Semantic Training Signals Promote Hierarchical Syntactic Generalization in Transformers. 4059-4073 - Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen:
When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages. 4074-4096 - Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai:
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use. 4097-4114 - Kevin Robinson, Sneha Kudugunta, Romina Stella, Sunipa Dev, Jasmijn Bastings:
MiTTenS: A Dataset for Evaluating Gender Mistranslation. 4115-4124 - Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov:
Teaching LLMs to Abstain across Languages via Multilingual Feedback. 4125-4150 - Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov:
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration. 4151-4171 - Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L. Gordon, Zaïd Harchaoui, Yejin Choi:
StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements. 4172-4206 - Wenting Zhao, Ge Gao, Claire Cardie, Alexander M. Rush:
I Could've Asked That: Reformulating Unanswerable Questions. 4207-4220 - Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami:
STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions. 4221-4243 - Yujin Potter, Shiyang Lai, Junsol Kim, James Evans, Dawn Song:
Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters. 4244-4275 - Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu:
SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning. 4276-4292 - Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao, Hassan Foroosh, Dong Yu, Fei Liu:
When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives. 4293-4308 - Vu Trong Kim, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai:
An Analysis of Multilingual FActScore. 4309-4333 - Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo:
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models. 4334-4353 - Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli:
RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering. 4354-4374 - Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon:
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. 4375-4391 - Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov:
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects. 4392-4409 - Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault:
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback. 4410-4430 - Rongting Zhang, Martín Bertrán, Aaron Roth:
Order of Magnitude Speedups for LLM Membership Inference. 4431-4443 - Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chieh Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov:
VIMI: Grounding Video Generation through Multi-modal Instruction. 4444-4456 - Haiyang Wang, Yuchen Pan, Xin Song, Xuechen Zhao, Minghao Hu, Bin Zhou:
F²RL: Factuality and Faithfulness Reinforcement Learning Framework for Claim-Guided Evidence-Supported Counterspeech Generation. 4457-4470 - Chang Yang, Peng Zhang, Hui Gao, Jing Zhang:
Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning. 4471-4483 - Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Wenjia Niu, Sabrina B. Caldwell, Tom Gedeon, Yang Liu, Zhenyue Qin:
Visual Prompting in LLMs for Enhancing Emotion Recognition. 4484-4499 - Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang:
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. 4500-4511 - Che-Wei Tsai, Yen-Hao Huang, Tsu-Keng Liao, Didier Estrada, Retnani Latifah, Yi-Shin Chen:
Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset. 4512-4522 - Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song:
Outcome-Constrained Large Language Models for Countering Hate Speech. 4523-4536 - Changbing Yang, Garrett Nicolai, Miikka Silfverberg:
Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing. 4537-4552 - Ao Wang, Xinghao Yang, Chen Li, Baodi Liu, Weifeng Liu:
Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks. 4553-4565 - Yangyang Zhao, Ben Niu, Mehdi Dastani, Shihan Wang:
Bootstrapped Policy Learning for Task-oriented Dialogue through Goal Shaping. 4566-4580 - Huachuan Qiu, Lizhi Ma, Zhenzhong Lan:
PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling. 4581-4607 - Jiacong Wang, Bohong Wu, Haiyong Jiang, Xun Zhou, Xin Xiao, Haoyuan Guo, Jun Xiao:
World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering. 4608-4623 - Jing Jin, Houfeng Wang, Hao Zhang, Xiaoguang Li, Zhijiang Guo:
DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering. 4624-4637 - Long Li, Xuzheng He, Haozhe Wang, Linlin Wang, Liang He:
How Do Humans Write Code? Large Models Do It the Same Way Too. 4638-4649 - Yufei Xiang, Yiqun Shen, Yeqin Zhang, Cam-Tu Nguyen:
Retrospex: Language Agent Meets Offline Reinforcement Learning Critic. 4650-4666 - Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, JingBo Zhu:
Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models. 4667-4682 - Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen:
Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation. 4683-4702 - Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang:
CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation. 4703-4721 - Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie Su, Camillo J. Taylor, Dan Roth:
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners. 4722-4756 - Yicheng Gao, Gonghan Xu, Zhe Wang, Arman Cohan:
Bayesian Calibration of Win Rate Estimation with LLM Evaluators. 4757-4769 - Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai:
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning. 4770-4785 - Weijun Li, Qiongkai Xu, Mark Dras:
Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients. 4786-4798 - Tiancheng Gu, Kaicheng Yang, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng:
RWKV-CLIP: A Robust Vision-Language Representation Learner. 4799-4812 - Mir Tafseer Nayeem, Davood Rafiei:
KidLM: Advancing Language Models for Children - Early Insights and Future Directions. 4813-4836 - Josh Barua, Sanjay Subramanian, Kayo Yin, Alane Suhr:
Using Language Models to Disambiguate Lexical Choices in Translation. 4837-4848 - Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin:
How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? 4849-4868 - Joakim Edin, Maria Maistro, Lars Maaløe, Lasse Borgholt, Jakob D. Havtorn, Tuukka Ruotsalo:
An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records. 4869-4890 - Zheng Wang, Zhongyang Li, Zeren Jiang, Dandan Tu, Wei Shi:
Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs. 4891-4906 - Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Ruhi Sarikaya, Kevin Small, Heng Ji:
EVEDIT: Event-based Knowledge Editing for Deterministic Knowledge Propagation. 4907-4926 - Tatsuya Aoyama, Nathan Schneider:
Modeling Nonnative Sentence Processing with L2 Language Models. 4927-4940 - Chuanqi Cheng, Jian Guan, Wei Wu, Rui Yan:
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis. 4941-4957 - Shadi Iskander, Sofia Tolmach, Ori Shapira, Nachshon Cohen, Zohar Karnin:
Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs. 4958-4976 - Yuang Li, Min Zhang, Mengxin Ren, Xiaosong Qiao, Miaomiao Ma, Daimeng Wei, Hao Yang:
Cross-Domain Audio Deepfake Detection: Dataset and Analysis. 4977-4983 - Ting Liu, Zunnan Xu, Yue Hu, Liangtao Shi, Zhiqiang Wang, Quanjun Yin:
MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension. 4984-4994 - Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo:
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization. 4995-5027 - Yichong Huang, Baohang Li, Xiaocheng Feng, Wenshuai Huo, Chengpeng Fu, Ting Liu, Bing Qin:
Aligning Translation-Specific Understanding to General Understanding in Large Language Models. 5028-5041 - Mohamad Ballout, Anne Dedert, Nohayr Abdelmoneim, Ulf Krumnack, Gunther Heidemann, Kai-Uwe Kühnberger:
FOOL ME IF YOU CAN! An Adversarial Dataset to Investigate the Robustness of LMs in Word Sense Disambiguation. 5042-5059 - Jaewoo Lee, Boyang Li, Sung Ju Hwang:
Concept-skill Transferability-based Data Selection for Large Vision-Language Models. 5060-5080 - Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Cheng Jiayang, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin:
LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing. 5081-5099 - Mark Dredze, Genta Indra Winata, Prabhanjan Kambadur, Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, David S. Rosenberg, Sebastian Gehrmann:
Academics Can Contribute to Domain-Specialized Language Models. 5100-5110 - Keonwoong Noh, Seokjin Oh, Woohwan Jung:
Beyond Reference: Evaluating High Quality Translations Better than Human References. 5111-5127 - Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie:
Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement. 5128-5154 - Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Jann Montalan, Ryan Hadiwijaya, Joanito Agili Lopo, William Nixon, Börje Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus Irawan, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Ngee Tai Chia, Ayu Purwarianti, Sebastian Ruder, William-Chandra Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng Xin Yong, Samuel Cahyawijaya:
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages. 5155-5203 - Po-Chun Chen, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen:
Induct-Learn: Short Phrase Prompting with Instruction Induction. 5204-5231 - Shi Mingcong, Chunjiang Zhu, Detian Zhang, Shiting Wen, Qing Li:
Multi-Granularity History and Entity Similarity Learning for Temporal Knowledge Graph Reasoning. 5232-5243 - Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier:
LUQ: Long-text Uncertainty Quantification for LLMs. 5244-5262 - Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng:
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method. 5263-5274 - Damien Sileo:
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars. 5275-5283 - Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux:
Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach. 5284-5292 - Jiaying Zheng, Hainan Zhang, Lingxiang Wang, Wangjie Qiu, Hong-Wei Zheng, Zhi Ming Zheng:
Safely Learning with Private Data: A Federated Learning Framework for Large Language Model. 5293-5306 - Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen:
Formality is Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge. 5307-5320 - Yang Luo, Zangwei Zheng, Zirui Zhu, Yang You:
How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? 5321-5335 - Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang:
How Far Can We Extract Diverse Perspectives from Large Language Models? 5336-5366 - Kiran Purohit, Venktesh V, Raghuram Devalla, Krishna Yerragorla, Sourangshu Bhattacharya, Avishek Anand:
EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning. 5367-5388 - Lexin Zhou, Youmna Farag, Andreas Vlachos:
An LLM Feature-based Framework for Dialogue Constructiveness Assessment. 5389-5409 - Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou:
Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System. 5410-5420 - Sergio Burdisso, Srikanth R. Madikeri, Petr Motlícek:
Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction. 5421-5440 - Raphael Tang, Crystina Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. 5441-5454 - Ilias Chalkidis:
Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024. 5455-5467 - Mayi Xu, Yongqi Li, Ke Sun, Tieyun Qian:
Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning. 5468-5495 - Shengda Fan, Yanting Wang, Shasha Mo, Jianwei Niu:
LogicST: A Logical Self-Training Framework for Document-Level Relation Extraction with Incomplete Annotations. 5496-5510 - Qiwei Peng, Anders Søgaard:
Concept Space Alignment in Multilingual LLMs. 5511-5526 - Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou:
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model. 5527-5542 - Peng Liu, Lemei Zhang, Terje Nissen Farup, Even W. Lauvrak, Jon Espen Ingvaldsen, Simen Eide, Jon Atle Gulla, Zhirong Yang:
NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian. 5543-5560 - Yifan Wang, Vera Demberg:
RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework. 5561-5582 - Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang:
Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models. 5583-5595 - Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam:
Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems. 5596-5612 - Yuhao Wang, Ruiyang Ren, Junyi Li, Xin Zhao, Jing Liu, Ji-Rong Wen:
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering. 5613-5626 - Minzheng Wang, Longze Chen, Fu Cheng, Shengyi Liao, Xinghua Zhang, Bingli Wu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li:
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA. 5627-5646 - Monorama Swain, Anna Zee, Anders Søgaard:
On Mitigating Performance Disparities in Multilingual Speech Recognition. 5647-5655 - Stephen Meisenbacher, Florian Matthes:
Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting. 5656-5665 - Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen:
To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models. 5666-5680 - Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva:
What is "Typological Diversity" in NLP? 5681-5700 - Xiaobo Guo, Neil Potnis, Melody Yu, Nabeel Gillani, Soroush Vosoughi:
The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public Discourse. 5701-5723 - Georgi Shopov, Stefan Gerdjikov:
Consistent Bidirectional Language Modelling: Expressive Power and Representational Conciseness. 5724-5768 - Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stanczak, Aishwarya Agrawal:
Benchmarking Vision Language Models for Cultural Understanding. 5769-5790 - Olga Iakovenko, Thomas Hain:
Methods of Automatic Matrix Language Determination for Code-Switched Speech. 5791-5800 - Jaewook Lee, Yeajin Jang, Hongjin Kim, Woojin Lee, Harksoo Kim:
Analyzing Key Factors Influencing Emotion Prediction Performance of VLLMs in Conversational Contexts. 5801-5816 - Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar:
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models. 5817-5830 - Tao Feng, Yicheng Li, Chenglin Li, Hao Chen, Fei Yu, Yin Zhang:
Teaching Small Language Models Reasoning through Counterfactual Distillation. 5831-5842 - Meet Doshi, Raj Dabre, Pushpak Bhattacharyya:
Pretraining Language Models Using Translationese. 5843-5862 - Kyle Buettner, Adriana Kovashka:
Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval. 5863-5870 - Qixi Lu, Endong Xun, Gongbo Tang:
MTA4DPR: Multi-Teaching-Assistants Based Iterative Knowledge Distillation for Dense Passage Retrieval. 5871-5883 - Aida Kostikova, Dominik Beese, Benjamin Paassen, Ole Pütz, Gregor Wiedemann, Steffen Eger:
Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates. 5884-5907 - Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie C. K. Cheung:
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling. 5908-5930 - Hans Ole Hatzel, Chris Biemann:
Story Embeddings - Narrative-Focused Representations of Fictional Stories. 5931-5943 - Kunting Li, Yong Hu, Liang He, Fandong Meng, Jie Zhou:
C-LLM: Learn to Check Chinese Spelling Errors Character by Character. 5944-5957 - Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu:
PSC: Extending Context Window of Large Language Models via Phase Shift Calibration. 5958-5970 - Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan:
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection. 5971-5984 - Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao:
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales. 5985-5998 - Richard Diehl Martinez, Zebulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn:
Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing. 5999-6011 - Yunze Xiao, Yujia Hu, Kenny T. W. Choo, Roy Ka-Wei Lee:
ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations. 6012-6025 - Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang:
Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models? 6026-6036 - Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza:
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation. 6037-6053 - Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar:
Do Large Language Models Know How Much They Know? 6054-6070 - Somin Wadhwa, Silvio Amir, Byron C. Wallace:
Investigating Mysteries of CoT-Augmented Distillation. 6071-6086 - Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludäscher, Jana Diesner:
SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics. 6087-6104 - Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi:
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP. 6105-6113 - Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace, Luyang Kong:
Learning from Natural Language Explanations for Generalizable Entity Matching. 6114-6129 - Zhuohang Li, Jiaxin Zhang, Chao Yan, Kamalika Das, Kumar Sricharan, Murat Kantarcioglu, Bradley A. Malin:
Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation. 6130-6151 - Jen-tse Huang, Wenxiang Jiao, Man Ho Lam, Eric John Li, Wenxuan Wang, Michael R. Lyu:
On the Reliability of Psychological Scales on Large Language Models. 6152-6173 - Abhishek Arora, Emily Silcock, Melissa Dell, Leander Heldring:
Contrastive Entity Coreference and Disambiguation for Historical Texts. 6174-6186 - Jeonghwan Kim, Heng Ji:
Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models. 6187-6207 - Sumit Asthana, Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata:
Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts. 6208-6226 - Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu:
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment. 6227-6246 - Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li:
Focused Large Language Models are Stable Many-Shot Learners. 6247-6261 - Garrett Tanzer, Maximus Shengelia, Ken Harrenstien, David Uthus:
Reconsidering Sentence-Level Sign Language Translation. 6262-6287 - Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities. 6288-6313 - Liviu P. Dinu, Ana Sabina Uban, Alina Maria Cristea, Ioan-Bogdan Iordache, Teodor-George Marchitan, Simona Georgescu, Laurentiu Zoicas:
Verba volant, scripta volant? Don't worry! There are computational solutions for protoword reconstruction. 6314-6326 - Victoria R. Li, Yida Chen, Naomi Saphra:
ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context. 6327-6345 - Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He:
Personas as a Way to Model Truthfulness in Language Models. 6346-6359 - Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J. Hammond:
Satyrn: A Platform for Analytics Augmented Generation. 6360-6385 - Ashish Seth, Ramaneswaran Selvakumar, S. Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha:
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning. 6386-6400 - Qi Zhao, Haotian Fu, Chen Sun, George Konidaris:
EPO: Hierarchical LLM Agents with Environment Preference Optimization. 6401-6415 - Chantal Shaib, Yanai Elazar, Junyi Jessy Li, Byron C. Wallace:
Detection and Measurement of Syntactic Templates in Generated Text. 6416-6431 - Xinyu Pi, Mingyuan Wu, Jize Jiang, Haozhen Zheng, Beitong Tian, ChengXiang Zhai, Klara Nahrstedt, Zhiting Hu:
UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models. 6432-6441 - Dominik Wagner, Seanie Lee, Ilja Baumann, Philipp Seeberger, Korbinian Riedhammer, Tobias Bocklet:
Optimized Speculative Sampling for GPU Hardware Accelerators. 6442-6458 - Zhaoxuan Tan, Zheyuan Liu, Meng Jiang:
Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts. 6459-6475 - Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang:
Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning. 6476-6491 - Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin:
Unifying Multimodal Retrieval via Document Screenshot Embedding. 6492-6505 - Shaomu Tan, Di Wu, Christof Monz:
Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation. 6506-6527 - Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson:
An Audit on the Perspectives and Challenges of Hallucinations in NLP. 6528-6548 - Deniz Bayazit, Negar Foroutan, Zeming Chen, Gail Weiss, Antoine Bosselut:
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models. 6549-6583 - Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang:
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models. 6584-6600 - Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner:
Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering. 6601-6633 - Armin Toroghi, Willis Guo, Ali Pesaranghader, Scott Sanner:
Verifiable, Debuggable, and Repairable Commonsense Logical Reasoning via LLM-based Theory Resolution. 6634-6652 - Kelly Marchisio, Wei-Yin Ko, Alexandre Berard, Théo Dehaze, Sebastian Ruder:
Understanding and Mitigating Language Confusion in LLMs. 6653-6677 - Gaël Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael J. Witbrock, Gillian Dobbie:
Can Large Language Models Learn Independent Causal Mechanisms? 6678-6701 - Sarfaroz Yunusov, Hamza Sidat, Ali Emami:
MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models. 6702-6717 - Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao:
InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context. 6718-6746 - Farhan Samir, Chan Young Park, Anjalie Field, Vered Shwartz, Yulia Tsvetkov:
Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia. 6747-6762 - Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, Eunjeong Hwang, Vered Shwartz:
From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models. 6763-6782 - Karin de Langis, Ryan Koo, Dongyeop Kang:
Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation. 6783-6800 - Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu:
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model. 6801-6816 - Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra:
Learning to Extract Structured Entities Using Language Models. 6817-6834 - Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark J. F. Gales:
Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons. 6835-6855 - Shira Wein, Juri Opitz:
A Survey of AMR Applications. 6856-6875 - Yiwu Zhong, Zi-Yuan Hu, Michael R. Lyu, Liwei Wang:
Beyond Embeddings: The Promise of Visual Table in Visual Reasoning. 6876-6911 - Shahla Farzana, Ivana Lucero, Vivian Villegas, Vera C. Kaelin, Mary A. Khetani, Natalie Parde:
CareCorpus+: Expanding and Augmenting Caregiver Strategy Data to Support Pediatric Rehabilitation. 6912-6927 - Guanchu Wang, Yu-Neng Chuang, Ruixiang Tang, Shaochen Zhong, Jiayi Yuan, Hongye Jin, Zirui Liu, Vipin Chaudhary, Shuai Xu, James Caverlee, Xia Ben Hu:
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion. 6928-6941 - Xinying Qian, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Li Zhang, Kehui Song:
TimeR⁴ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering. 6942-6952 - Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang:
Knowledge-Centric Hallucination Detection. 6953-6975 - Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, JingBo Zhu:
Revealing the Parallel Multilingual Learning within Large Language Models. 6976-6997 - Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen:
Automatic Instruction Evolving for Large Language Models. 6998-7018 - Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou:
RepEval: Effective Text Evaluation with LLM Representation. 7019-7033 - Yuxin He, Buzhou Tang, Xiaoling Wang:
Generative Models for Automatic Medical Decision Rule Extraction from Text. 7034-7048 - Thong Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy Nguyen, See-Kiong Ng, Anh Tuan Luu:
Encoding and Controlling Global Semantics for Long-form Video Question Answering. 7049-7066 - Yuping Lin, Pengfei He, Han Xu, Yue Xing, Makoto Yamada, Hui Liu, Jiliang Tang:
Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis. 7067-7085 - Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun:
Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs. 7086-7100 - Ran Song, Shizhu He, Shuting Jiang, Yantuan Xian, Shengxiang Gao, Kang Liu, Zhengtao Yu:
Does Large Language Model Contain Task-Specific Neurons? 7101-7113 - Philipp Mondorf, Barbara Plank:
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models. 7114-7137 - Hongfu Liu, Hengguan Huang, Ye Wang:
Advancing Test-Time Adaptation in Wild Acoustic Test Settings. 7138-7155 - Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme:
Learning to Retrieve Iteratively for In-Context Learning. 7156-7168 - SeongKu Kang, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu:
Taxonomy-guided Semantic Indexing for Academic Paper Search. 7169-7184 - Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che:
Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts. 7185-7212 - Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh:
Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models. 7213-7224 - Zhiyu Cao, Peifeng Li, Yaxin Fan, Qiaoming Zhu:
Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation. 7225-7238 - Yiyuan Li, Shichao Sun, Pengfei Liu:
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in LLMs. 7239-7256 - Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash:
Aligning Large Language Models with Diverse Political Viewpoints. 7257-7267 - Huy Nghiem, John Prindle, Jieyu Zhao, Hal Daumé III:
"You Gotta be a Doctor, Lin" : An Investigation of Name-Based Bias of Large Language Models in Employment Recommendations. 7268-7287 - Yingsheng Wu, Yuxuan Gu, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin:
Extending Context Window of Large Language Models from a Distributional Perspective. 7288-7301 - Hakyung Sung, Kristopher Kyle:
Leveraging pre-trained language models for linguistic analysis: A case of argument structure constructions. 7302-7314 - Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See-Kiong Ng, Jiashi Feng:
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration. 7315-7332 - Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna Qiu, Lili Qiu:
Position Engineering: Boosting Large Language Models through Positional Information Manipulation. 7333-7345 - Junying Chen, Chi Gui, Ruyi Ouyang, Anningzhe Gao, Shunian Chen, Guiming Chen, Xidong Wang, Zhenyang Cai, Ke Ji, Xiang Wan, Benyou Wang:
Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale. 7346-7370 - Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li:
ADELIE: Aligning Large Language Models on Information Extraction. 7371-7387 - Yifei Wang, Yuheng Chen, Wanting Wen, Yu Sheng, Linjing Li, Daniel Zeng:
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons. 7388-7402 - Jindrich Libovický, Jindrich Helcl:
Lexically Grounded Subword Segmentation. 7403-7420 - Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang:
EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees. 7421-7432 - Hy Nguyen, Xuefei He, Andrew Reeson, Cécile Paris, Josiah Poon, Jonathan K. Kummerfeld:
Do Text-to-Vis Benchmarks Test Real Use of Visualisations? 7433-7441 - Chengyuan Liu, Shihang Wang, Lizhi Qing, Kun Kuang, Yangyang Kang, Changlong Sun, Fei Wu:
Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs. 7442-7459 - Jingyu Hu, Weiru Liu, Mengnan Du:
Strategic Demonstration Selection for Improved Fairness in LLM In-Context Learning. 7460-7475 - Nguyen Dinh, Thanh Dang, Luan Thanh Nguyen, Kiet Van Nguyen:
Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges. 7476-7498 - Vyas Raina, Adian Liusie, Mark J. F. Gales:
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment. 7499-7517 - Zhicong Lu, Li Jin, Peiguang Li, Yu Tian, Linhao Zhang, Sirui Wang, Guangluan Xu, Changyuan Tian, Xunliang Cai:
Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal. 7518-7530 - Chengyuan Liu, Yangyang Kang, Shihang Wang, Lizhi Qing, Fubang Zhao, Chao Wu, Changlong Sun, Kun Kuang, Fei Wu:
More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs. 7531-7548 - Vyas Raina, Rao Ma, Charles McGhee, Kate M. Knill, Mark J. F. Gales:
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models. 7549-7565 - Georgios Katsimpras, Georgios Paliouras:
GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation. 7566-7577 - Zichen Chen, Jianda Chen, Ambuj K. Singh, Misha Sra:
XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs. 7578-7596 - Yuanpin Zhou, Huogen Wang:
Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning. 7597-7610 - Jiashuo Sun, Jihai Zhang, Yucheng Zhou, Zhaochen Su, Xiaoye Qu, Yu Cheng:
SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information. 7611-7629 - Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui:
UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models. 7630-7645 - Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su:
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments. 7646-7663 - Yihong Tang, Bo Wang, Dongming Zhao, Jinxiaojia Jinxiaojia, Zhangjijun Zhangjijun, Ruifang He, Yuexian Hou:
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space. 7664-7676 - Wenhao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang:
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server. 7677-7695 - Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei:
DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination. 7696-7712 - Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao:
Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models. 7713-7724 - Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou:
Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale. 7725-7738 - Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Jinheon Baek, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong:
An Empirical Study of Multilingual Reasoning Distillation for Question Answering. 7739-7751 - Gal Yona, Roee Aharoni, Mor Geva:
Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? 7752-7764 - Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig:
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? 7765-7784 - Ming Shan Hee, Aditi Kumaresan, Roy Ka-Wei Lee:
Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning. 7785-7799 - Baixuan Xu, Weiqi Wang, Haochen Shi, Wenxuan Ding, Huihao Jing, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song:
MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding. 7800-7815 - Cheng Jiayang, Chunkit Chan, Qianqian Zhuang, Lin Qiu, Tianhang Zhang, Tengxiao Liu, Yangqiu Song, Yue Zhang, Pengfei Liu, Zheng Zhang:
ECON: On the Detection and Resolution of Evidence Conflicts. 7816-7844 - Jonathan Tonglet, Marie-Francine Moens, Iryna Gurevych:
"Image, Tell me your story!" Predicting the original meta-context of visual misinformation. 7845-7864 - Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan:
Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning. 7865-7879 - Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong:
Mixture-of-Subspaces in Low-Rank Adaptation. 7880-7899 - Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram:
PARIKSHA: A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data. 7900-7932 - Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng:
LawBench: Benchmarking Legal Knowledge of Large Language Models. 7933-7962 - Furkan Sahinuç, Thy Thy Tran, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych:
Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards. 7963-7977 - Adrian Bulat, Yassine Ouali, Ricardo Guerrero, Brais Martínez, Georgios Tzimiropoulos:
Efficient Vision-Language pre-training via domain-specific learning for human activities. 7978-8000 - Wenbo Li, Guohao Li, Zhibin Lan, Xue Xu, Wanru Zhuang, Jiachen Liu, Xinyan Xiao, Jinsong Su:
Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training. 8001-8014 - Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang:
Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works. 8015-8036 - Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang:
Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners. 8037-8051 - Hao Sun, Jiayi Wu, Hengyi Cai, Xiaochi Wei, Yue Feng, Bo Wang, Shuaiqiang Wang, Yan Zhang, Dawei Yin:
AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning. 8052-8062 - Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li:
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models. 8063-8077 - Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen:
mDPO: Conditional Preference Optimization for Multimodal Large Language Models. 8078-8088 - Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan:
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models. 8089-8100 - Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas:
Language-to-Code Translation with a Single Labeled Example. 8101-8112 - Jan Buchmann, Xiao Liu, Iryna Gurevych:
Attribute or Abstain: Large Language Models as Long Document Assistants. 8113-8140 - Xiaochen Wang, Jiaqi Wang, Houping Xiao, Jinghui Chen, Fenglong Ma:
FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models. 8141-8154 - Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang:
Retrieved In-Context Principles from Previous Mistakes. 8155-8169 - Haozhe Chen, Run Chen, Julia Hirschberg:
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control. 8170-8180 - Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang:
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models. 8181-8196 - Clemente Pasti, Talu Karagöz, Franz Nowak, Anej Svete, Reda Boumasmoud, Ryan Cotterell:
An L* Algorithm for Deterministic Weighted Regular Languages. 8197-8210 - Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin:
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection. 8211-8227 - Pritish Sahu, Karan Sikka, Ajay Divakaran:
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification. 8228-8248 - Yusuke Hirota, Jerone Theodore Alexander Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang:
Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes. 8249-8267 - Di Cao, Yong Liao, Xiuwei Shang:
RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? 8268-8282 - Brendan King, Jeffrey Flanigan:
Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel. 8283-8300 - Guiming Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang:
Humans or LLMs as the Judge? A Study on Judgement Bias. 8301-8327 - Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu:
WPO: Enhancing RLHF with Weighted Preference Optimization. 8328-8340 - Rongwu Xu, Zi'an Zhou, Tianwei Zhang, Zehan Qi, Su Yao, Ke Xu, Wei Xu, Han Qiu:
Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias. 8341-8368 - Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Sherry Shi:
MetaReflection: Learning Instructions for Language Agents using Past Reflections. 8369-8385 - Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan:
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors. 8386-8411 - Yiran Wang, Masao Utiyama:
On Eliciting Syntax from Language Models via Hashing. 8412-8427 - Zetian Ouyang, Yishuai Qiu, Linlin Wang, Gerard de Melo, Ya Zhang, Yanfeng Wang, Liang He:
CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios. 8428-8438 - Heng Yang, Ke Li:
The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples. 8439-8457 - Pretam Ray, Jivnesh Sandhan, Amrith Krishna, Pawan Goyal:
CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages. 8458-8466 - Catarina G. Belém, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth:
Perceptions of Linguistic Uncertainty by Language Models and Humans. 8467-8502 - Haw-Shiuan Chang, Nanyun Peng, Mohit Bansal, Anil Ramakrishna, Tagyoung Chung:
Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM. 8503-8526 - Xiaoyu Dong, Yujie Feng, Zexin Lu, Guangyuan Shi, Xiao-Ming Wu:
Zero-shot Cross-domain Dialogue State Tracking via Context-aware Auto-prompting and Instruction-following Contrastive Decoding. 8527-8540 - Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, Wei Xu:
Knowledge Conflicts for LLMs: A Survey. 8541-8565 - Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asuman E. Ozdaglar:
MisinfoEval: Generative AI in the Era of "Alternative Facts". 8566-8578 - Benjamin Irving, Annika Schoene:
MEANT: Multimodal Encoder for Antecedent Information. 8579-8600 - Chufan Shi, Haoran Yang, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam:
A Thorough Examination of Decoding Methods in the Era of LLMs. 8601-8629 - Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar:
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings. 8630-8641 - Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md. Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji:
FIRST: Faster Improved Listwise Reranking with Single Token Decoding. 8642-8652 - Hongjin Kim, Jai-Eun Kim, Harksoo Kim:
Exploring Nested Named Entity Recognition with Large Language Models: Methods, Challenges, and Insights. 8653-8670 - Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Gong, Bhuwan Dhingra:
ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods. 8671-8689 - Karina Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut:
"Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models. 8690-8707 - Yujian Liu, Yang Zhang, Tommi S. Jaakkola, Shiyu Chang:
Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective. 8708-8731 - Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu:
LIONs: An Empirically Optimized Approach to Align Language Models. 8732-8753 - Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada:
Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing. 8754-8782 - Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han:
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery. 8783-8817 - Liyan Tang, Philippe Laban, Greg Durrett:
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents. 8818-8847 - John Wu, David Wu, Jimeng Sun:
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning. 8848-8871 - Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J. Yadwadkar, Aditya Akella:
MOSEL: Inference Serving Using Dynamic Modality Selection. 8872-8886 - Palak Jain, Livio Baldini Soares, Tom Kwiatkowski:
From RAG to Riches: Retrieval Interlaced with Sequence Generation. 8887-8904 - Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee:
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition. 8905-8915 - Jaehyung Kim, Dongyoung Kim, Yiming Yang:
Learning to Correct for QA Reasoning with Black-box LLMs. 8916-8937 - Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant:
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? 8938-8968 - Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Wieting, Mohit Iyyer:
PostMark: A Robust Blackbox Watermark for Large Language Models. 8969-8987 - Xiaoyu Shen, Rexhina Blloshmi, Dawei Zhu, Jiahuan Pei, Wei Zhang:
Assessing "Implicit" Retrieval Robustness of Large Language Models. 8988-9003 - Suyash Fulay, William Brannon, Shrestha Mohanty, Cassandra Overney, Elinor Poole-Dayan, Deb Roy, Jad Kabbara:
On the Relationship between Truth and Political Bias in Language Models. 9004-9018 - Karan Taneja, Ashok K. Goel:
Can Active Label Correction Improve LLM-based Modular AI Systems? 9019-9031 - Andrea Vallebueno, Cassandra Handan-Nader, Christopher D. Manning, Daniel E. Ho:
Statistical Uncertainty in Word Embeddings: GloVe-V. 9032-9047 - Rajiv Movva, Pang Wei Koh, Emma Pierson:
Annotation alignment: Comparing LLM and human annotations of conversational safety. 9048-9062 - Nigel Fernandez, Alexander Scarlatos, Wanyong Feng, Simon Woodhead, Andrew S. Lan:
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions. 9063-9081 - Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang:
The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention. 9082-9100 - Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran:
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models. 9101-9118 - Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng:
Enhancing Reinforcement Learning with Dense Rewards from Language Model Critic. 9119-9138 - Layla Bouzoubaa, Elham Aghakhani, Rezvaneh Rezapour:
Words Matter: Reducing Stigma in Online Conversations about Substance Use with Large Language Models. 9139-9156 - Dingyang Chen, Qi Zhang, Yinglun Zhu:
Efficient Sequential Decision Making with Large Language Models. 9157-9170 - Zifan Jiang, Gerard Sant, Amit Moryossef, Mathias Müller, Rico Sennrich, Sarah Ebling:
SignCLIP: Connecting Text and Sign Language by Contrastive Learning. 9171-9193 - Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang:
APPLS: Evaluating Evaluation Metrics for Plain Language Summarization. 9194-9211 - Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani:
Ontologically Faithful Generation of Non-Player Character Dialogues. 9212-9242 - Luísa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, Sara Hooker:
LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives. 9243-9267 - Ekaterina Taktasheva, Maxim Bazhukov, Kirill Koncha, Alena Fenogenova, Ekaterina Artemova, Vladislav Mikhailov:
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs. 9268-9299 - Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu Song:
Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction. 9300-9322 - Kate McCurdy, Paul Soulos, Paul Smolensky, Roland Fernandez, Jianfeng Gao:
Toward Compositional Behavior in Neural Models: A Survey of Current Views. 9323-9339 - Krista Opsahl-Ong, Michael J. Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab:
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs. 9340-9366 - Samuel Kiegeland, Ethan Wilcox, Afra Amini, David Robert Reich, Ryan Cotterell:
Reverse-Engineering the Reader. 9367-9389 - Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang:
Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation. 9390-9406 - Kewei Cheng, Nesreen K. Ahmed, Theodore L. Willke, Yizhou Sun:
Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text. 9407-9430 - David Schulte, Felix Hamborg, Alan Akbik:
Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning. 9431-9442 - So Lee, Mai Vu:
The effects of distance on NPI illusive effects in BERT. 9443-9457 - Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme:
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. 9458-9482 - Christabel Acquaye, Haozhe An, Rachel Rudinger:
Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US. 9483-9502 - Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Wang:
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding. 9503-9522 - Samuel Pfrommer, Yatong Bai, Tanmay Gautam, Somayeh Sojoudi:
Ranking Manipulation for Conversational Search Engines. 9523-9552 - Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov:
Fast Forwarding Low-Rank Training. 9553-9562 - Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort:
Precise Model Benchmarking with Only a Few Observations. 9563-9575 - Ian Berlot-Attwell, Kumar Krishna Agrawal, Annabelle Michael Carrell, Yash Sharma, Naomi Saphra:
Attribute Diversity Determines the Systematicity Gap in VQA. 9576-9611 - Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S. Weld, Joseph Chee Chang, Kyle Lo:
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models. 9612-9631 - Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma:
Development of Cognitive Intelligence in Pre-trained Language Models. 9632-9657 - Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui:
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding. 9658-9678 - Sam Blouir, Jimmy T. H. Smith, Antonios Anastasopoulos, Amarda Shehu:
Birdie: Advancing State Space Language Modeling with Dynamic Mixtures of Training Objectives. 9679-9705 - Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow:
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? 9706-9726 - Sheridan Feucht, David Atkinson, Byron C. Wallace, David Bau:
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs. 9727-9739 - Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig:
TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering. 9740-9766 - Biswesh Mohapatra, Manav Nitin Kapadnis, Laurent Romary, Justine Cassell:
Evaluating the Effectiveness of Large Language Models in Establishing Conversational Grounding. 9767-9781 - Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang:
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting. 9782-9796 - Reza Esfandiarpoor, Cristina Menghini, Stephen H. Bach:
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions. 9797-9819 - Bowen Zhang, Harold Soh:
Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction. 9820-9836 - Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun:
MQuinE: a Cure for "Z-paradox" in Knowledge Graph Embedding. 9837-9850 - Anej Svete, Nadav Borenstein, Mike Zhou, Isabelle Augenstein, Ryan Cotterell:
Can Transformers Learn n-gram Language Models? 9851-9867 - Minchan Kwon, Gaeun Kim, Jongsuk Kim, Haeil Lee, Junmo Kim:
StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model. 9868-9884 - Philippe Laban, Alexander R. Fabbri, Caiming Xiong, Chien-Sheng Wu:
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems. 9885-9903 - Xiaoying Wang, Lingling Mu, Jingyi Zhang, Hongfei Xu:
Multi-pass Decoding for Grammatical Error Correction. 9904-9916 - Yucheng Jiang, Yijia Shao, Dekun Ma, Sina J. Semnani, Monica S. Lam:
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations. 9917-9955 - Chenming Tang, Zhixiang Wang, Yunfang Wu:
SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation. 9956-9971 - Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng:
Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge. 9972-9987 - Melanie Subbiah, Faisal Ladhak, Akankshya Mishra, Griffin Adams, Lydia B. Chilton, Kathleen R. McKeown:
STORYSUMM: Evaluating Faithfulness in Story Summarization. 9988-10005 - Haofei Yu, Zhengyang Qi, Lawrence Jang, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang:
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts. 10006-10030 - Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee:
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer. 10031-10045 - Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg:
Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension. 10046-10063 - Jun Rao, Xuebo Liu, Lian Lian, Shengjun Cheng, Yunjie Liao, Min Zhang:
CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions. 10064-10083 - Yuzhe Gu, Enmao Diao:
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers. 10084-10096 - Jaeseong Lee, Seung-won Hwang, Wonpyo Park, Mingi Ji:
Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models. 10097-10107 - Yang Xu, Yu Wang, Hao An, Zhichen Liu, Yongyuan Li:
Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood. 10108-10121 - Jiahui Li, Hanlin Zhang, Fengda Zhang, Tai-Wei Chang, Kun Kuang, Long Chen, Jun Zhou:
Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning. 10122-10140 - Xiaohua Feng, Chaochao Chen, Yuyuan Li, Zibin Lin:
Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models. 10141-10155 - Changchun Liu, Kai Zhang, Junzhe Jiang, Zirui Liu, Hanqing Tao, Min Gao, Enhong Chen:
ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs. 10156-10168 - Zhongtao Jiang, Yuanzhe Zhang, Kun Luo, Xiaowei Yuan, Jun Zhao, Kang Liu:
On the In-context Generation of Language Models. 10169-10187 - Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei:
Atomic Inference for NLI with Generated Facts as Atoms. 10188-10204 - William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe:
Towards Robust Speech Representation Learning for Thousands of Languages. 10205-10224 - Xuan Ren, Biao Wu, Lingqiao Liu:
I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses. 10225-10245 - Jiahuan Li, Shujian Huang, Aarron Ching, Xinyu Dai, Jiajun Chen:
PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment. 10246-10257 - Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig:
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance. 10258-10279 - Ting-Yun Chang, Jesse Thomason, Robin Jia:
When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models. 10280-10299 - Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo Zhang, Yanghui Rao:
Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference. 10300-10317 - Jinsung Yoon, Rajarishi Sinha, Sercan Ömer Arik, Tomas Pfister:
Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions. 10318-10336 - Jianshang Kou, Benfeng Xu, Chiwei Zhu, Zhendong Mao:
KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction. 10337-10350 - Zhen Lin, Shubhendu Trivedi, Jimeng Sun:
Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation. 10351-10368 - Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl:
MixGR: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity. 10369-10391 - Tuan Nguyen, Thanh Trung Huynh, Minh Hieu Phan, Quoc Viet Hung Nguyen, Phi Le Nguyen:
CARER - ClinicAl Reasoning-Enhanced Representation for Temporal Health Risk Prediction. 10392-10407 - Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan:
"In-Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning. 10408-10422 - Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He:
Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective. 10423-10435 - Xin Liu, Farima Fatahi Bayat, Lu Wang:
Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding. 10436-10448 - Esther Gan, Yiran Zhao, Liying Cheng, Yancan Mao, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, Michael Shieh:
Reasoning Robustness of LLMs to Adversarial Typographical Errors. 10449-10459 - Pengyu Wang, Dong Zhang, Linyang Li, Chenkun Tan, Xinghao Wang, Mozhi Zhang, Ke Ren, Botian Jiang, Xipeng Qiu:
InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance. 10460-10479 - Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale Fung:
Belief Revision: The Adaptability of Large Language Models Reasoning. 10480-10496 - Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou:
Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models. 10497-10523 - Minjia Wang, Fangzhou Liu, Xiuxing Li, Bowen Dong, Zhenyu Li, Tengyu Pan, Jianyong Wang:
Bio-RFX: Refining Biomedical Extraction via Advanced Relation Classification and Structural Constraints. 10524-10539 - Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng:
Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models. 10540-10552 - Nitish Joshi, Abulhair Saparov, Yixin Wang, He He:
LLMs Are Prone to Fallacies in Causal Inference. 10553-10569 - Ryan Louie, Ananjan Nandi, William Fang, Cheng Chang, Emma Brunskill, Diyi Yang:
Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles. 10570-10603 - Andreas Waldis, Joel Birrer, Anne Lauscher, Iryna Gurevych:
The Lou Dataset - Exploring the Impact of Gender-Fair Language in German Text Classification. 10604-10624 - Yu Tong, Ge Chen, Guokai Zheng, Rui Li, Jiang Dazhi:
When Generative Adversarial Networks Meet Sequence Labeling Challenges. 10625-10635 - Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee:
Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering. 10636-10651 - Hyundong Cho, Nicolaas Paul Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro A. Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May:
Speechworthy Instruction-tuned Language Models. 10652-10670 - Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Data, Data Everywhere: A Guide for Pretraining Dataset Construction. 10671-10695 - Dilara Soylu, Christopher Potts, Omar Khattab:
Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together. 10696-10710 - Jing Huang, Diyi Yang, Christopher Potts:
Demystifying Verbatim Memorization in Large Language Models. 10711-10732 - Ayana Niwa, Hayate Iso:
AmbigNLG: Addressing Task Ambiguity in Instruction for NLG. 10733-10752 - Marco Cognetta, Vilém Zouhar, Naoaki Okazaki:
Distributional Properties of Subword Regularization. 10753-10763 - Yajing Yang, Qian Liu, Min-Yen Kan:
DataTales: A Benchmark for Real-World Intelligent Data Narration. 10764-10788 - Euiin Yi, Taehyeon Kim, Hongseok Jeung, Du-Seong Chang, Se-Young Yun:
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters. 10789-10802 - Yangfan Ye, Xiachong Feng, Xiaocheng Feng, Weitao Ma, Libo Qin, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin:
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization. 10803-10821 - Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer:
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models. 10822-10837 - Wencke Liermann, Jin-Xia Huang, Yohan Lee, Kong Joo Lee:
More Insightful Feedback for Tutoring: Enhancing Generation Mechanisms and Automatic Evaluation. 10838-10851 - Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun:
Stable Language Model Pre-training by Reducing Embedding Variability. 10852-10863 - Kavya Manohar, Leena G. Pillai:
What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations. 10864-10869 - Benjamin Schiller, Johannes Daxenberger, Andreas Waldis, Iryna Gurevych:
Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets. 10870-10887 - Seungjong Sun, Eungu Lee, Seo Baek, Seunghyun Hwang, Wonbyung Lee, Dongyan Nan, Bernard J. Jansen, Jang-Hyun Kim:
Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas. 10888-10901 - Junda Zhu, Lingyong Yan, Haibo Shi, Dawei Yin, Lei Sha:
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator. 10902-10919 - Yanjiang Chen, Kai Zhang, Feng Hu, Xianquan Wang, Ruikang Li, Qi Liu:
Dynamic Multi-granularity Attribution Network for Aspect-based Sentiment Analysis. 10920-10931 - Shahed Masoudian, Markus Frohmann, Navid Rekabsaz, Markus Schedl:
Unlabeled Debiasing in Downstream Tasks via Class-wise Low Variance Regularization. 10932-10938 - Pu Jian, Donglei Yu, Jiajun Zhang:
Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA. 10939-10956 - Hao Yang, Lizhen Qu, Ehsan Shareghi, Reza Haf:
Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights. 10957-10973 - Milan Bhan, Jean-Noël Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot:
Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations. 10974-10991 - Wanshi Xu, Xianwei Zhuang, Zhanpeng Chen, Zhihong Zhu, Xuxin Cheng, Yuexian Zou:
What are the Generator Preferences for End-to-end Task-Oriented Dialog System? 10992-11003 - Jan Philip Wahle, Terry Ruas, Yang Xu, Bela Gipp:
Paraphrase Types Elicit Prompt Engineering Capabilities. 11004-11033 - Jingtao Cao, Zhang Zheng, Hongru Wang, Kam-Fai Wong:
VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models. 11034-11049 - Ronglai Zuo, Fangyun Wei, Brian Mak:
Towards Online Continuous Sign Language Recognition and Translation. 11050-11067 - Yiwei Dai, Hengrui Gu, Ying Wang, Xin Wang:
Mitigate Extrinsic Social Bias in Pre-trained Language Models via Continuous Prompts Adjustment. 11068-11083 - Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu:
Split and Merge: Aligning Position Biases in LLM-based Evaluators. 11084-11108 - Sougata Saha, Rohini K. Srihari:
Integrating Argumentation and Hate-Speech-based Techniques for Countering Misinformation. 11109-11124 - Wenda Xu, Jiachen Li, William Yang Wang, Lei Li:
BPO: Staying Close to the Behavior LLM Creates Better Online LLM Alignment. 11125-11139 - Liangying Shao, Liang Zhang, Minlong Peng, Guoqi Ma, Hao Yue, Mingming Sun, Jinsong Su:
One2Set + Large Language Model: Best Partners for Keyphrase Generation. 11140-11153 - Yifei Yuan, Yang Deng, Anders Søgaard, Mohammad Aliannejadi:
Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering. 11154-11169 - Jiwoo Hong, Noah Lee, James Thorne:
ORPO: Monolithic Preference Optimization without Reference Model. 11170-11189 - Bowen Chen, Namgi Han, Yusuke Miyao:
A Multi-Perspective Analysis of Memorization in Large Language Models. 11190-11209 - Nicolò Penzo, Maryam Sajedinia, Bruno Lepri, Sara Tonelli, Marco Guerini:
Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations. 11210-11233 - Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych:
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. 11234-11258 - Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà:
Unveiling the Role of Pretraining in Direct Speech Translation. 11259-11265 - Shasha Guo, Lizi Liao, Jing Zhang, Cuiping Li, Hong Chen:
PCQPR: Proactive Conversational Question Planning with Reflection. 11266-11278 - Xunzhu Tang, Kisub Kim, Yewei Song, Cedric Lothritz, Bei Li, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawendé F. Bissyandé:
CodeAgent: Autonomous Communicative Agents for Code Review. 11279-11313 - Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro:
TroL: Traversal of Layers for Large Language and Vision Models. 11314-11342 - Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin:
MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language. 11343-11358 - Olga Zamaraeva, Carlos Gómez-Rodríguez:
Revisiting Supertagging for faster HPSG parsing. 11359-11374 - Lu Dai, Hao Liu, Hui Xiong:
Improve Dense Passage Retrieval with Entailment Tuning. 11375-11387 - Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen Wan, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana:
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models. 11388-11422 - Rodolfo Zevallos, Núria Bel, Mireia Farrús:
TEMA: Token Embeddings Mapping for Enriching Low-Resource Language Models. 11423-11435 - Xuanming Zhang, Anthony Diaz, Zixun Chen, Qingyang Wu, Kun Qian, Erik Voss, Zhou Yu:
DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting. 11436-11458 - Fatemeh Pesaran Zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim:
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback. 11459-11480 - Christoph Leiter, Steffen Eger:
PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation. 11481-11506 - Shuai Zhao, Meihuizi Jia, Anh Tuan Luu, Fengjun Pan, Jinming Wen:
Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning. 11507-11522 - Francisco Javier Chiyah Garcia, Alessandro Suglia, Arash Eshghi:
Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models. 11523-11542 - Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu:
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models. 11543-11557 - Matthias Lindemann, Alexander Koller, Ivan Titov:
Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations. 11558-11573 - Panagiotis Giadikiaroglou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou:
Puzzle Solving using Reasoning of Large Language Models: A Survey. 11574-11591 - Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Jianfeng Gao, Fabian Peller-Konrad, Tobias Röddiger, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues:
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading. 11592-11610 - Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen:
Red Teaming Language Models for Processing Contradictory Dialogues. 11611-11630 - Sander Land, Max Bartolo:
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models. 11631-11646 - Houman Mehrafarin, Arash Eshghi, Ioannis Konstas:
Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs. 11647-11662 - Reto Gubelmann:
Pragmatic Norms Are All You Need - Why The Symbol Grounding Problem Does Not Apply to LLMs. 11663-11678 - Kawshik Sundar, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi:
Major Entity Identification: A Generalizable Alternative to Coreference Resolution. 11679-11695 - Xinfeng Wang, Jin Cui, Fumiyo Fukumoto, Yoshimi Suzuki:
Enhancing High-order Interaction Awareness in LLM-based Recommender Model. 11696-11711 - Akshay Paruchuri, Jake Garrison, Shun Liao, John Hernandez, Jacob E. Sunshine, Tim Althoff, Xin Liu, Daniel McDuff:
What Are the Odds? Language Models Are Capable of Probabilistic Reasoning. 11712-11733 - Han Jiang, Junwen Duan, Zhe Qu, Jianxin Wang:
MARE: Multi-Aspect Rationale Extractor on Unsupervised Rationale Extraction. 11734-11745 - Hayder Elesedy, Pedro M. Esperança, Silviu Vlad Oprea, Mete Ozay:
LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models. 11746-11765 - Zhijun Xu, Siyu Yuan, Lingjie Chen, Deqing Yang:
"A good pun is its own reword": Can Large Language Models Understand Puns? 11766-11782 - Weiping Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu:
QGEval: Benchmarking Multi-dimensional Evaluation for Question Generation. 11783-11803 - Ana Ezquerro, David Vilares, Carlos Gómez-Rodríguez:
Dependency Graph Parsing as Sequence Labeling. 11804-11828 - Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoît Crabbé, Etienne Bernard:
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data. 11829-11841 - John Pavlopoulos, Panos Louridas, Panagiotis Filos:
Towards a Greek Proverb Atlas: Computational Spatial Exploration and Attribution of Greek Proverbs. 11842-11854 - Weize Liu, Yinlong Xu, Hongxia Xu, Jintai Chen, Xuming Hu, Jian Wu:
Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications. 11855-11881 - Bowen Zhang, Chunping Li:
Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss. 11882-11893 - Marc Brinner, Sina Zarrieß:
Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training. 11894-11907 - Markus Frohmann, Igor Sterner, Ivan Vulic, Benjamin Minixhofer, Markus Schedl:
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation. 11908-11941 - Chen Ji, Su Yang, Hongyu Sun, Yuqing Zhang:
Applying Contrastive Learning to Code Vulnerability Type Classification. 11942-11952 - Ruida Wang, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang:
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts. 11953-11974 - Liang Zhang, Zhen Yang, Biao Fu, Ziyao Lu, Liangying Shao, Shiyu Liu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jinsong Su:
Multi-Level Cross-Modal Alignment for Speech Relation Extraction. 11975-11986 - Christopher Schröder, Gerhard Heyer:
Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models. 11987-12004 - Jinsung Kim, Seonmin Koo, Heuiseok Lim:
PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models. 12005-12026 - Aakanksha, Arash Ahmadian, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, Sara Hooker:
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm. 12027-12049 - Marion Di Marco, Alexander Fraser:
Subword Segmentation in LLMs: Looking at Inflection and Consistency. 12050-12060 - Omar Sharif, Joseph Gatto, Madhusudan Basak, Sarah Masud Preum:
Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments. 12061-12081 - Beatriz Borges, Niket Tandon, Tanja Käser, Antoine Bosselut:
Let Me Teach You: Pedagogical Foundations of Feedback for Language Models. 12082-12104 - Jean-Flavien Bussotti, Luca Ragazzi, Giacomo Frisoni, Gianluca Moro, Paolo Papotti:
Unknown Claims: Generation of Fact-Checking Training Examples from Unstructured and Structured Data. 12105-12122 - Shrey Satapara, P. K. Srijith:
TL-CL: Task And Language Incremental Continual Learning. 12123-12142 - Daniel P. Jeong, Saurabh Garg, Zachary C. Lipton, Michael Oberst:
Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? 12143-12170 - Leonardo Ranaldi, Giulia Pucci, Barry Haddow, Alexandra Birch:
Empowering Multi-step Reasoning across Languages via Program-Aided Language Models. 12171-12187 - Yu Yuan, Lili Zhao, Kai Zhang, Guangting Zheng, Qi Liu:
Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models. 12188-12200 - Nuo Chen, Ning Wu, Jianhui Chang, Linjun Shou, Jia Li:
ControlMath: Controllable Data Generation Promotes Math Generalist Models. 12201-12217 - Liying Li, Yihan Bai, Minhao Cheng:
Where Am I From? Identifying Origin of LLM-generated Content. 12218-12229 - Tarek Naous, Michael J. Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu:
ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment. 12230-12266 - Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori S. Levin:
GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text. 12267-12286 - Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, Yilun Zhu, Shabnam Behzad, Lauren Levine, Jessica Lin, Devika Tiwari, Amir Zeldes:
GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains. 12287-12303 - Zhiyuan Zhu, Yusheng Liao, Chenxin Xu, Yunfeng Guan, Yanfeng Wang, Yu Wang:
RA2FD: Distilling Faithfulness into Efficient Dialogue Systems. 12304-12317 - Fangrui Lv, Kaixiong Gong, Jian Liang, Xinyu Pang, Changshui Zhang:
Subjective Topic meets LLMs: Unleashing Comprehensive, Reflective and Creative Thinking through the Negation of Negation. 12318-12341 - Kanishka Misra, Allyson Ettinger, Kyle Mahowald:
Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently. 12342-12355 - Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li, Chenghua Lin, Wenge Rong:
Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking. 12356-12374 - Anhao Zhao, Fanghua Ye, Jinlan Fu, Xiaoyu Shen:
Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism. 12375-12400 - Tengfei Yu, Xuebo Liu, Zhiyi Hou, Liang Ding, Dacheng Tao, Min Zhang:
Self-Powered LLM Modality Expansion for Large Speech-Text Models. 12401-12417 - Sirui Liang, Baoli Zhang, Jun Zhao, Kang Liu:
ABSEval: An Agent-based Framework for Script Evaluation. 12418-12434 - Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad:
Latent Concept-based Explanation of NLP Models. 12435-12459 - Hyunjong Ok, Jegwang Ryu, Jaeho Lee:
Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher. 12460-12476 - Yida Mu, Mali Jin, Xingyi Song, Nikolaos Aletras:
Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research. 12477-12492 - Arvid Frydenlund:
The Mystery of the Pathological Path-star Task for Language Models. 12493-12516 - Nikolas Vitsakis, Amit Parekh, Ioannis Konstas:
Voices in a Crowd: Searching for clusters of unique perspectives. 12517-12539 - Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu:
Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent. 12540-12557 - Lingrui Mei, Shenghua Liu, Yiwei Wang, Baolong Bi, Xueqi Cheng:
SLANG: New Concept Comprehension of Large Language Models. 12558-12575 - Michael Lan, Philip Torr, Fazl Barez:
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models. 12576-12601 - Jiaxin Qin, Zixuan Zhang, Chi Han, Pengfei Yu, Manling Li, Heng Ji:
Why Does New Knowledge Create Messy Ripple Effects in LLMs? 12602-12609 - Viet Dao, Van-Cuong Pham, Quyen Tran, Thanh-Thien Le, Linh Ngo, Thien Nguyen:
Lifelong Event Detection via Optimal Transport. 12610-12621 - Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot:
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories. 12622-12645 - KaShun Shum, Minrui Xu, Jianshu Zhang, Zixin Chen, Shizhe Diao, Hanze Dong, Jipeng Zhang, Muhammad Omer Raza:
FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation. 12646-12659 - Danielle Saunders, Steve DeNeefe:
Domain adapted machine translation: What does catastrophic forgetting forget and why? 12660-12671 - Benjamin Towle, Ke Zhou:
Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback. 12672-12680 - Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra:
Atomic Self-Consistency for Better Long Form Generations. 12681-12694 - Mahammed Kamruzzaman, Hieu Nguyen, Gene Louis Kim:
"Global is Good, Local is Bad?": Understanding Brand Bias in LLMs. 12695-12702 - Siqi Li, Danni Liu, Jan Niehues:
Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach. 12703-12719 - Ryan Shea, Aymen Kallala, Xin Liu, Michael W. Morris, Zhou Yu:
ACE: A LLM-based Negotiation Coaching System. 12720-12749 - Ming Zhang, Caishuang Huang, Yilong Wu, Shichun Liu, Huiyuan Zheng, Yurui Dong, Yujiong Shen, Shihan Dou, Jun Zhao, Junjie Ye, Qi Zhang, Tao Gui, Xuanjing Huang:
TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities. 12750-12771 - Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M. Murphy, Nev Jones, Kate Hardy, Hong Shen, Fei Fang, Zhiyu Chen:
PATIENT-ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals. 12772-12797 - Xueren Ge, Abhishek Satpathy, Ronald D. Williams, John A. Stankovic, Homa Alemzadeh:
DKEC: Domain Knowledge Enhanced Multi-Label Classification for Diagnosis Prediction. 12798-12813 - Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang:
ModSCAN: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities. 12814-12845 - Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang:
Large Language Models Can Self-Correct with Key Condition Verification. 12846-12867 - Zixin Tang, Janet G. van Hell:
Learning to Write Rationally: How Information Is Distributed in Non-native Speakers' Essays. 12868-12879 - Lin Ai, Tharindu Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, Huan Liu, Julia Hirschberg:
Defending Against Social Engineering Attacks in the Age of LLMs. 12880-12902 - Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi:
Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models. 12903-12913 - Yixuan Wang, Xianzhen Luo, Fuxuan Wei, Yijun Liu, Qingfu Zhu, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che:
Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training. 12914-12926 - Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra:
Target-Aware Language Modeling via Granular Data Sampling. 12927-12935 - Tanmay Parekh, Jeffrey Kwan, Jiarui Yu, Sparsh Johri, Hyosang Ahn, Sreya Muppalla, Kai-Wei Chang, Wei Wang, Nanyun Peng:
SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness. 12936-12965 - Mustafa Omer Gul, Yoav Artzi:
CoGen: Learning from Feedback with Coupled Comprehension and Generation. 12966-12982 - Yuanhao Xiong, Yixin Nie, Haotian Liu, Boxin Wang, Jun Chen, Rong Jin, Cho-Jui Hsieh, Lorenzo Torresani, Jie Lei:
UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks. 12983-12997 - David Hobson, Haiqi Zhou, Derek Ruths, Andrew Piper:
Story Morals: Surfacing value-driven narrative schemas using large language models. 12998-13032 - Jaspreet Ranjit, Brihi Joshi, Rebecca Dorn, Laura Petry, Olga Koumoundouros, Jayne Bottarini, Peichen Liu, Eric Rice, Swabha Swayamdipta:
OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants. 13033-13059 - Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Murari Tiyyala, Nicholas Andrews, Daniel Khashabi:
AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies. 13060-13082 - Qi Zhang, Zhijia Chen, Huitong Pan, Cornelia Caragea, Longin Jan Latecki, Eduard Dragut:
SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents. 13083-13100 - Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer:
Analysis of Plan-based Retrieval for Grounded Text Generation. 13101-13119 - Alex Chandler, Devesh Surve, Hui Su:
Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors. 13120-13133 - John Dang, Arash Ahmadian, Kelly Marchisio, Julia Kreutzer, Ahmet Üstün, Sara Hooker:
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs. 13134-13156 - Yuanyuan Lei, Ruihong Huang:
Boosting Logical Fallacy Reasoning in LLMs via Logical Structure Tree. 13157-13173 - Erwan Fagnou, Paul Caillon, Blaise Delattre, Alexandre Allauzen:
Chain and Causal Attention for Efficient Entity Tracking. 13174-13188 - Yi Zeng, Weiyu Sun, Tran Ngoc Huynh, Dawn Song, Bo Li, Ruoxi Jia:
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models. 13189-13215 - Zhengmian Hu, Tong Zheng, Heng Huang:
A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution. 13216-13227 - Xiaoqiang Wang, Lingfei Wu, Tengfei Ma, Bang Liu:
FAC²E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition. 13228-13243 - Tanvir Mahmud, Diana Marculescu:
OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation. 13244-13260 - Zhiqi Huang, Puxuan Yu, Shauli Ravfogel, James Allan:
Language Concept Erasure for Language-invariant Dense Retrieval. 13261-13273 - Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian:
Learning Personalized Alignment for Evaluating Open-ended Text Generation. 13274-13292 - Yue Zhou, Henry Peng Zou, Barbara Di Eugenio, Yang Zhang:
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks. 13293-13304 - Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu:
Turn Waste into Worth: Rectifying Top-k Router of MoE. 13305-13320 - Pittawat Taveekitworachai, Febri Abdullah, Ruck Thawonmas:
Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination. 13321-13361 - Nandita Naik, Christopher Potts, Elisa Kreiss:
CommVQA: Situating Visual Question Answering in Communicative Contexts. 13362-13377 - Weilin Zhao, Yuxiang Huang, Xu Han, Wang Xu, Chaojun Xiao, Xinrong Zhang, Yewei Fang, Kaihuo Zhang, Zhiyuan Liu, Maosong Sun:
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding. 13378-13393 - Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun:
1+1\textgreater2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? 13394-13412 - Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G. Honavar:
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective. 13413-13426 - Wen Lai, Viktor Hangya, Alexander Fraser:
Style-Specific Neurons for Steering LLMs in Text Style Transfer. 13427-13443 - Tianhua Zhang, Kun Li, Hongyin Luo, Xixin Wu, James R. Glass, Helen Meng:
Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers. 13444-13461 - Sizhe Zhou, Yu Meng, Bowen Jin, Jiawei Han:
Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction. 13462-13486 - Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao, Kang Liu:
DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models. 13487-13521 - Zhihua Jiang, Jianwei Chen, Dongning Rao, Guanghui Ye:
Leveraging Context-Aware Prompting for Commit Message Generation. 13522-13540 - Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein:
Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination. 13541-13564 - Qizhou Chen, Taolin Zhang, Xiaofeng He, Dongyang Li, Chengyu Wang, Longtao Huang, Hui Xue':
Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning. 13565-13580 - Zhihao Wang, Shiyu Liu, Jianheng Huang, Wang Zheng, Yixuan Liao, Xiaoxin Chen, Junfeng Yao, Jinsong Su:
A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models. 13581-13594 - Jimin Sohn, Haeji Jung, Alex Cheng, Jooeon Kang, Yilin Du, David R. Mortensen:
Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages. 13595-13602 - Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan:
An Analysis and Mitigation of the Reversal Curse. 13603-13615 - Chaeeun Kim, Soyoung Yoon, Hyunji Lee, Joel Jang, Sohee Yang, Minjoon Seo:
Exploring the Practicality of Generative Retrieval on Dynamic Corpora. 13616-13633 - Xukai Liu, Ye Liu, Kai Zhang, Kehang Wang, Qi Liu, Enhong Chen:
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting. 13634-13651 - Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua:
Don't Just Say "I don't know"! Self-aligning Large Language Models for Responding to Unknown Questions with Explanations. 13652-13673 - Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Fan Yang, Mao Yang:
Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning. 13674-13695 - Fenglin Liu, Zheng Li, Hongjian Zhou, Qingyu Yin, Jingfeng Yang, Xianfeng Tang, Chen Luo, Ming Zeng, Haoming Jiang, Yifan Gao, Priyanka Nigam, Sreyashi Nag, Bing Yin, Yining Hua, Xuan Zhou, Omid Rohanian, Anshul Thakur, Lei A. Clifton, David A. Clifton:
Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark. 13696-13710 - Jinchuan Zhang, Yan Zhou, Yaxin Liu, Ziming Li, Songlin Hu:
Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction. 13711-13736 - Van-Cuong Pham, Thien Nguyen:
Householder Pseudo-Rotation: A Novel Approach to Activation Editing in LLMs with Direction-Magnitude Perspective. 13737-13751 - Jinyoung Kim, Dayoon Ko, Gunhee Kim:
DynamicER: Resolving Emerging Mentions to Dynamic Entities for RAG. 13752-13770 - Quyen Tran, Nguyen Xuan Thanh, Nguyen Hoang Anh, Nam Le Hai, Trung Le, Linh Van Ngo, Thien Huu Nguyen:
Preserving Generalization of Language models in Few-shot Continual Relation Extraction. 13771-13784 - Md. Tahmid Rahman Laskar, Sawsan Alqahtani, M. Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee-Wei Tan, Md. Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang:
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations. 13785-13816 - Shuaiyi Li, Yang Deng, Deng Cai, Hongyuan Lu, Liang Chen, Wai Lam:
Consecutive Batch Model Editing with HooK Layers. 13817-13833 - Linyi Ding, Jinfeng Xiao, Sizhe Zhou, Chaoqi Yang, Jiawei Han:
Topic-Oriented Open Relation Extraction with A Priori Seed Generation. 13834-13845 - Xiangci Li, Jessica Ouyang:
Related Work and Citation Text Generation: A Survey. 13846-13864 - Liangxin Liu, Xuebo Liu, Lian Lian, Shengjun Cheng, Jun Rao, Tengfei Yu, Hexuan Deng, Min Zhang:
Curriculum Consistency Learning for Conditional Sentence Generation. 13865-13881 - Leonardo Bertolazzi, Albert Gatt, Raffaella Bernardi:
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences. 13882-13905 - Fan Jiang, Tom Drummond, Trevor Cohn:
Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision. 13906-13933 - Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri:
MOSEL: 950, 000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages. 13934-13947 - Jiashi Lin, Lifang Wang, Xinyu Lu, Zhongtian Hu, Wei Zhang, Wenxuan Lu:
Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning. 13948-13959 - Ali Basirat, Navid Hemmati:
Contribution of Linguistic Typology to Universal Dependency Parsing: An Empirical Investigation. 13960-13971 - Francesco Periti, Pierluigi Cassotti, Stefano Montanelli, Nina Tahmasebi, Dominik Schlechtweg:
TRoTR: A Framework for Evaluating the Re-contextualization of Text Reuse. 13972-13990 - Jiateng Wei, Quan Lu, Ning Jiang, Siqi Li, Jingyang Xiang, Jun Chen, Yong Liu:
Structured Optimal Brain Pruning for Large Language Models. 13991-14007 - Francesco Periti, David Alfter, Nina Tahmasebi:
Automatically Generated Definitions and their utility for Modeling Word Meaning. 14008-14026 - Yejie Wang, Keqing He, Dayuan Fu, Zhuoma Gongque, Heyang Xu, Yanxu Chen, Zhexu Wang, Yujia Fu, Guanting Dong, Muxi Diao, Jingang Wang, Mengdi Zhang, Xunliang Cai, Weiran Xu:
How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data. 14027-14043 - Weiwei Sun, Zhengliang Shi, Wu Long, Lingyong Yan, Xinyu Ma, Yiding Liu, Min Cao, Dawei Yin, Zhaochun Ren:
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. 14044-14067 - Guoxin Yu, Lemao Liu, Mo Yu, Yue Yu, Xiang Ao:
Rethinking the Evaluation of In-Context Learning for LLMs. 14068-14082 - Walter Laurito, Sharan Maiya, Grégoire Dhimoïla, Owen Yeung, Kaarel Hänni:
Cluster-Norm for Unsupervised Probing of Knowledge. 14083-14112 - Eden Biran, Daniela Gottesman, Sohee Yang, Mor Geva, Amir Globerson:
Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries. 14113-14130 - Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng:
Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration. 14131-14143 - Seonmin Koo, Jinsung Kim, Youngjoon Jang, Chanjun Park, Heuiseok Lim:
Where am I? Large Language Models Wandering between Semantics and Structures in Long Contexts. 14144-14160 - Matthew Shu, Nishant Balepur, Shi Feng, Jordan L. Boyd-Graber:
KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students. 14161-14178 - Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Quanquan Gu, Haifeng Chen, Wei Wang, Wei Cheng:
Large Language Models Can Be Contextual Privacy Protection Learners. 14179-14201 - Nishant Balepur, Matthew Shu, Alexander Miserlis Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan L. Boyd-Graber:
A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick. 14202-14225 - Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Reza Haf:
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models. 14226-14240 - Jun-Hyung Park, Yeachan Kim, Mingyu Lee, Hyuntae Park, SangKeun Lee:
MolTRES: Improving Chemical Language Representation Learning for Molecular Property Prediction. 14241-14254 - Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui:
First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning. 14255-14271 - Jimin Sun, So Yeon Min, Yingshan Chang, Yonatan Bisk:
Tools Fail: Detecting Silent Errors in Faulty Tools. 14272-14289 - Bowen Zhang, Chunping Li:
Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity. 14290-14302 - Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, Gary Geunbae Lee:
Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing. 14303-14317 - Georgios Pantazopoulos, Malvina Nikandrou, Alessandro Suglia, Oliver Lemon, Arash Eshghi:
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling. 14318-14337 - Fengjun Pan, Xiaobao Wu, Zongrui Li, Anh Tuan Luu:
Are LLMs Good Zero-Shot Fallacy Classifiers? 14338-14364 - Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He:
The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis. 14365-14378 - Dominik Schlechtweg, Pierluigi Cassotti, Bill Noble, David Alfter, Sabine Schulte im Walde, Nina Tahmasebi:
More DWUGs: Extending and Evaluating Word Usage Graph Datasets in Multiple Languages. 14379-14393 - Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama:
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification. 14394-14410 - Arpan Phukan, Manish Gupta, Asif Ekbal:
ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos. 14411-14436 - Elaf Alhazmi, Quan Sheng, Wei Emma Zhang, Munazza Zaib, Ahoud Alhazmi:
Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation. 14437-14458 - William Merrill, Noah A. Smith, Yanai Elazar:
Evaluating n-Gram Novelty of Language Models Using Rusty-DAWG. 14459-14473 - Kayo Yin, Chinmay Singh, Fyodor Minakov, Vanessa Milan, Hal Daumé III, Cyril Zhang, Alex Lu, Danielle Bragg:
ASL STEM Wiki: Dataset and Benchmark for Interpreting STEM Articles. 14474-14490 - Sweta Agrawal, António Farinhas, Ricardo Rei, André F. T. Martins:
Can Automatic Metrics Assess High-Quality Translations? 14491-14502 - Sweta Agrawal, José Guilherme Camargo de Souza, Ricardo Rei, António Farinhas, Gonçalo Rui Alves Faria, Patrick Fernandes, Nuno Miguel Guerreiro, André Martins:
Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation. 14503-14519 - Bowen Xing, Lizi Liao, Minlie Huang, Ivor W. Tsang:
DC-Instruct: An Effective Framework for Generative Multi-intent Spoken Language Understanding. 14520-14534 - Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren:
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models. 14535-14556 - Boyu Zhang, Tianyu Du, Junkai Tong, Xuhong Zhang, Kingsum Chow, Sheng Cheng, Xun Wang, Jianwei Yin:
SecCoder: Towards Generalizable and Robust Secure Code Generation. 14557-14571 - Ziqi Zhang, Cunxiang Wang, Xiao Xiong, Yue Zhang, Donglin Wang:
Nash CoT: Multi-Path Inference with Preference Equilibrium. 14572-14587 - Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou:
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention. 14588-14599 - Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di Zhang, Kun Gai, Ji-Rong Wen:
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector. 14600-14615 - Wei Li, Zhen Huang, Xinmei Tian, Le Lu, Houqiang Li, Xu Shen, Jieping Ye:
Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding. 14616-14632 - Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark J. F. Gales, Mario Fritz:
LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History. 14633-14652 - Marta Marchiori Manerba, Karolina Stanczak, Riccardo Guidotti, Isabelle Augenstein:
Social Bias Probing: Fairness Benchmarking for Language Models. 14653-14671 - Wenhao Yu, Hongming Zhang, Xiaoman Pan, Peixin Cao, Kaixin Ma, Jian Li, Hongwei Wang, Dong Yu:
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models. 14672-14685 - Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li:
DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models. 14686-14695 - Yuqi Wang, Lyuhao Chen, Songcheng Cai, Zhijian Xu, Yilun Zhao:
Revisiting Automated Evaluation for Long-form Table Question Answering. 14696-14706 - Italo Luis da Silva, Hanqi Yan, Lin Gui, Yulan He:
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems. 14707-14719 - Zhihan Zhang, Tao Ge, Zhenwen Liang, Wenhao Yu, Dian Yu, Mengzhao Jia, Dong Yu, Meng Jiang:
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning. 14720-14738 - Yilun Zhao, Yitao Long, Tintin Jiang, Chengye Wang, Weiyuan Chen, Hongjun Liu, Xiangru Tang, Yiming Zhang, Chen Zhao, Arman Cohan:
FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents. 14739-14752 - Collin Zhang, John X. Morris, Vitaly Shmatikov:
Extracting Prompts by Inverting LLM Outputs. 14753-14777 - Zhiting Fan, Ruizhe Chen, Ruiling Xu, Zuozhu Liu:
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs. 14778-14790 - Jiliang Hu, Zuchao Li, Ping Wang, Haojun Ai, Lefei Zhang, Hai Zhao:
VHASR: A Multimodal Speech Recognition System With Vision Hotwords. 14791-14804 - Naaman Tan, Josef Valvoda, Tianyu Liu, Anej Svete, Yanxia Qin, Min-Yen Kan, Ryan Cotterell:
A Probability-Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors. 14805-14829 - Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, Liyunfei Liyunfei, Siliang Tang:
Bridging Local Details and Global Context in Text-Attributed Graphs. 14830-14841 - Felermino Dario Mario Ali, Henrique Lopes Cardoso, Rui Sousa-Silva:
Building Resources for Emakhuwa: Machine Translation and News Classification Benchmarks. 14842-14857 - Mohammad Modarres, Sina Abbasi, Mohammad Taher Pilehvar:
RepMatch: Quantifying Cross-Instance Similarities in Representation Space. 14858-14869 - Xiusheng Huang, Yequan Wang, Jun Zhao, Kang Liu:
Commonsense Knowledge Editing Based on Free-Text in LLMs. 14870-14880 - Sagi Pendzel, Nir Lotan, Alon Zoizner, Einat Minkov:
A Closer Look at Multidimensional Online Political Incivility. 14881-14896 - Zetong Li, Qinliang Su, Shijing Si, Jianxing Yu:
Leveraging BERT and TFIDF Features for Short Text Clustering via Alignment-Promoting Co-Training. 14897-14913 - Bar Iluz, Yanai Elazar, Asaf Yehudai, Gabriel Stanovsky:
Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation. 14914-14921 - Debarghya Datta, Soumajit Pramanik:
Unsupervised Named Entity Disambiguation for Low Resource Domains. 14922-14928 - Viktoria Chekalina, Anna Rudenko, Gleb Mezentsev, Aleksandr Mikhalev, Alexander Panchenko, Ivan V. Oseledets:
SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers. 14929-14939 - Qingyang Li, Yanru Zhong, Yuchu Qin:
MoCoKGC: Momentum Contrast Entity Encoding for Knowledge Graph Completion. 14940-14952 - Ying Su, Zhan Ling, Haochen Shi, Cheng Jiayang, Yauwai Yim, Yangqiu Song:
ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities. 14953-14965 - Xiaopeng Xie, Ming Yan, Xiwen Zhou, Chenlong Zhao, Suli Wang, Yong Zhang, Joey Zhou:
Shortcuts Arising from Contrast: Towards Effective and Lightweight Clean-Label Attacks in Prompt-Based Learning. 14966-14977 - Aashiq Muhamed, Oscar Li, David P. Woodruff, Mona Diab, Virginia Smith:
GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients. 14978-15003 - Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie:
RaTEScore: A Metric for Radiology Report Generation. 15004-15019 - Shayan Ali Akbar, Md Mosharaf Hossain, Tess Wood, Si-Chi Chin, Erica Salinas, Victor Alvarez, Erwin Cornejo:
HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning. 15020-15037 - Sajad Sotudeh, Nazli Goharian:
Learning to Rank Salient Content for Query-focused Summarization. 15038-15048 - Qian Ruan, Ilia Kuznetsov, Iryna Gurevych:
Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions. 15049-15067 - Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao:
LitSearch: A Retrieval Benchmark for Scientific Literature Search. 15068-15083 - Xintong Li, Jinya Jiang, Ria Dharmani, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang:
Open-world Multi-label Text Classification with Extremely Weak Supervision. 15084-15096 - Toni J. B. Liu, Nicolas Boullé, Raphaël Sarfati, Christopher J. Earls:
LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law. 15097-15117 - Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu:
AKEW: Assessing Knowledge Editing in the Wild. 15118-15133 - Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh:
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation. 15134-15158 - Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu:
Dense X Retrieval: What Retrieval Granularity Should We Use? 15159-15177 - Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, Diyi Yang:
Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach. 15178-15194 - Zheng Zhao, Yftah Ziser, Shay B. Cohen:
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models. 15195-15214 - Beomseok Lee, Hyunwoo Kim, Keon Kim, Yong Suk Choi:
XDetox: Text Detoxification with Token-Level Toxicity Explanations. 15215-15226 - ZiHao Xiao, Jiefu Gong, Shijin Wang, Wei Song:
Optimizing Chinese Lexical Simplification Across Word Types: A Hybrid Approach. 15227-15239 - Bingxuan Li, Yiwei Wang, Tao Meng, Kai-Wei Chang, Nanyun Peng:
Control Large Language Models via Divide and Conquer. 15240-15256 - Chenyu Qiu, Pengjiang Qian, Chuang Wang, Jian Yao, Li Liu, Wei Fang, Eddie Eddie:
Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion. 15257-15269 - Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang:
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning. 15270-15283 - Yuanzheng Wang, Yixing Fan, Jiafeng Guo, Ruqing Zhang, Xueqi Cheng:
RoCEL: Advancing Table Entity Linking through Distinctive Row and Column Contexts. 15284-15298 - Zi'ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu:
Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models. 15299-15312 - Panuthep Tasawong, Peerat Limkonchotiwat, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong:
Efficient Overshadowed Entity Disambiguation by Mitigating Shortcut Learning. 15313-15321 - Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong:
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction. 15322-15336 - Zhipeng Chen, Kun Zhou, Xin Zhao, Jingyuan Wang, Ji-Rong Wen:
Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment. 15337-15351 - Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin:
AudioVSR: Enhancing Video Speech Recognition with Audio Data. 15352-15361 - Siddhant Waghjale, Vishruth Veerendranath, Zhiruo Wang, Daniel Fried:
ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness? 15362-15376 - Zhaopeng Feng, Ruizhe Chen, Yan Zhang, Zijie Meng, Zuozhu Liu:
Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level. 15377-15393 - Zi-Yi Dou, Cheng-Fu Yang, Xueqing Wu, Kai-Wei Chang, Nanyun Peng:
Re-ReST: Reflection-Reinforced Self-Training for Language Agents. 15394-15411 - Shuhao Guan, Cheng Xu, Moule Lin, Derek Greene:
Effective Synthetic Data and Test-Time Adaptation for OCR Correction. 15412-15425 - Fu Zhang, Qi Miao, Jingwei Cheng, Hongsen Yu, Yi Yan, Xin Li, Yongxue Wu:
SRF: Enhancing Document-Level Relation Extraction with a Novel Secondary Reasoning Framework. 15426-15439 - Junzhuo Liu, Xuzheng Yang, Weiwei Li, Peng Wang:
FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension. 15440-15457 - Eitan Wagner, Amir Feder, Omri Abend:
Exploring the Learning Capabilities of Language Models using LEVERWORLDS. 15458-15468 - Eitan Wagner, Yuli Slavutsky, Omri Abend:
CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models. 15469-15484 - Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I. Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha:
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding. 15485-15505 - Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen:
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging. 15506-15524 - Ifeoluwa Wuraola, Nina Dethlefs, Daniel Marciniak:
Understanding Slang with LLMs: Modelling Cross-Cultural Nuances through Paraphrasing. 15525-15531 - Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou:
Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding. 15532-15548 - Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-Guang Lou, Shuai Ma:
Re-Reading Improves Reasoning in Large Language Models. 15549-15575 - Qingcheng Zeng, Mingyu Jin, Rob Voigt:
Adaptive Axes: A Pipeline for In-domain Social Stereotype Analysis. 15576-15593 - Sourjyadip Ray, Kushal Gupta, Soumi Kundu, Payal Arvind Kasat, Somak Aditya, Pawan Goyal:
ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments. 15594-15608 - Jiyi Li:
Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations. 15609-15622 - Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu:
Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation. 15623-15643 - Junbo Huang, Ricardo Usbeck:
Revisiting Supervised Contrastive Learning for Microblog Classification. 15644-15653 - Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang:
BaitAttack: Alleviating Intention Shift in Jailbreak Attacks via Adaptive Bait Crafting. 15654-15668 - Zhaotian Weng, Zijun Gao, Jerone Theodore Alexander Andrews, Jieyu Zhao:
Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective. 15669-15680 - Weichuan Wang, Zhaoyi Li, Defu Lian, Chen Ma, Linqi Song, Ying Wei:
Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing. 15681-15700 - Yubo Ma, Zhibin Gou, Junheng Hao, Ruochen Xu, Shuohang Wang, Liangming Pan, Yujiu Yang, Yixin Cao, Aixin Sun:
SciAgent: Tool-augmented Language Models for Scientific Reasoning. 15701-15736 - Dong Won Lee, Hae Park, Yoon Kim, Cynthia Breazeal, Louis-Philippe Morency:
Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents. 15737-15762 - Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury:
Towards Measuring and Modeling "Culture" in LLMs: A Survey. 15763-15784 - Haiquan Zhao, Lingyu Li, Shisong Chen, Shuqi Kong, Jiaan Wang, Kexin Huang, Tianle Gu, Yixu Wang, Jian Wang, Dandan Liang, Zhixu Li, Yan Teng, Yanghua Xiao, Yingchun Wang:
ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models. 15785-15810 - Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury:
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting. 15811-15837 - Xiao Yu, Kejiang Chen, Qi Yang, Weiming Zhang, Nenghai Yu:
Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features. 15838-15846 - Sarah Masud, Sahajpreet Singh, Viktor Hangya, Alexander Fraser, Tanmoy Chakraborty:
Hate Personified: Investigating the role of LLMs in content moderation. 15847-15863 - Ashutosh Bajpai, Aaryan Goyal, Atif Anwer, Tanmoy Chakraborty:
Temporally Consistent Factuality Probing for Large Language Models. 15864-15881 - Zihao Li, Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann:
A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives. 15882-15894 - Prasoon Bajpai, Niladri Chatterjee, Subhabrata Dutta, Tanmoy Chakraborty:
Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators. 15895-15912 - Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng:
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training. 15913-15923 - Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan:
Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability. 15924-15951 - Yiming Ju, Ziyi Ni, Xingrun Xing, Zhixiong Zeng, Hanyu Zhao, Siqi Fan, Zheng Zhang:
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging. 15952-15959 - Sam Spilsbury, Pekka Marttinen, Alexander Ilin:
Generating Demonstrations for In-Context Compositional Generalization in Grounded Language Learning. 15960-15991 - Li Zeng, Yingyu Shan, Zeming Liu, Jiashu Yao, Yuhang Guo:
FAME: Towards Factual Multi-Task Model Editing. 15992-16011 - Renjie Pi, Tianyang Han, Jianshu Zhang, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang:
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance. 16012-16027 - Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Yuxuan Lai, Chongyang Tao, Shuai Ma:
Leveraging Large Language Models for NLG Evaluation: Advances and Challenges. 16028-16045 - Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang:
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs. 16046-16060 - Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin:
VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models. 16061-16075 - Suhas S. Kowshik, Abhishek Divekar, Vijit Malik:
CorrSynth - A Correlated Sampling Method for Diverse Dataset Generation from LLMs. 16076-16095 - Constanza Fierro, Ruchira Dhar, Filippos Stamatiou, Nicolas Garneau, Anders Søgaard:
Defining Knowledge: Bridging Epistemology and Large Language Models. 16096-16111 - Peiwen Jiang, Xinbo Lin, Zibo Zhao, Ruhui Ma, Yvonne Chen, Jinhua Cheng:
TKGT: Redefinition and A New Way of Text-to-Table Tasks Based on Real World Demands and Knowledge Graphs Augmented LLMs. 16112-16126 - Shihao Rao, Liang Li, Jiapeng Liu, Guan Weixin, Xiyan Gao, Bing Lim, Can Ma:
Free your mouse! Command Large Language Models to Generate Code to Format Word Documents. 16127-16142 - Jiawei Gu, Zacc Yang, Chuanghao Ding, Rui Zhao, Fei Tan:
CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models. 16143-16162 - Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang:
The Instinctive Bias: Spurious Images lead to Illusion in MLLMs. 16163-16177 - Akira Kawabata, Saku Sugawara:
Rationale-Aware Answer Verification by Pairwise Self-Evaluation. 16178-16196 - Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang, Hai Zhao, Lifeng Liu, Yulong Wang:
On the Robustness of Editing Large Language Models. 16197-16216 - Mihyeon Kim, Juhyoung Park, YoungBin Kim:
IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method. 16217-16229 - Zeguan Xiao, Yan Yang, Guanhua Chen, Yun Chen:
Distract Large Language Models for Automatic Jailbreak Attack. 16230-16244 - He-Zhe Lin, Cheng-Hung Liu, Chih-Jen Lin:
Exploring Space Efficiency in a Tree-based Linear Model for Extreme Multi-label Classification. 16245-16260 - Saif Mohammad:
WorryWords: Norms of Anxiety Association for over 44k English Words. 16261-16278 - Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M. Khapra:
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists. 16279-16309 - Jun Zhao, Can Zu, Xu Hao, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang:
LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration. 16310-16324 - Till Saenger, Musashi Hinck, Justin Grimmer, Brandon M. Stewart:
AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments. 16325-16342 - Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li:
Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs. 16343-16360 - Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang:
Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems. 16361-16376 - Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun, Yiran Zhong:
Scaling Laws for Linear Complexity Language Models. 16377-16426 - Heejin Do, Sangwon Ryu, Gary Geunbae Lee:
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards. 16427-16438 - Guangliang Liu, Haitao Mao, Jiliang Tang, Kristen Marie Johnson:
Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis. 16439-16455 - Fu Zhang, Yifan Ding, Jingwei Cheng:
ATAP: Automatic Template-Augmented Commonsense Knowledge Graph Completion via Pre-Trained Language Models. 16456-16472 - Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty:
LM2: A Simple Society of Language Models Solves Complex Reasoning. 16473-16484 - Clara Meister, Mario Giulianelli, Tiago Pimentel:
Towards a Similarity-adjusted Surprisal Theory. 16485-16498 - Omar Adjali, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne:
Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering. 16499-16513 - Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu:
Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? 16514-16575 - Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty:
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP. 16576-16586 - Pavel Chizhov, Catherine Arnett, Elizaveta Korotkova, Ivan P. Yamshchikov:
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training. 16587-16604 - Wei Shi, Shuang Li, Kerun Yu, Jinglei Chen, Zujie Liang, Xinhui Wu, Yuxi Qian, Feng Wei, Bo Zheng, Jiaqing Liang, Jiangjie Chen, Yanghua Xiao:
SEGMENT+: Long Text Processing with Short-Context Language Models. 16605-16617 - Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang:
Explicit Memory Learning with Expectation Maximization. 16618-16635 - Inderjeet Nair, Jiaye Tan, Xiaotian Su, Anne Gere, Xu Wang, Lu Wang:
Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions. 16636-16657 - Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang:
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent. 16658-16680 - Clement Neo, Shay B. Cohen, Fazl Barez:
Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions. 16681-16697 - Amey Hengle, Atharva Kulkarni, Shantanu Patankar, Madhumitha Chandrasekaran, Sneha D'Silva, Jemima Jacob, Rashmi Gupta:
Still Not Quite There! Evaluating Large Language Models for Comorbid Mental Health Diagnosis. 16698-16721 - Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings:
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning. 16722-16763 - Razvan-Alexandru Smadu, David-Gabriel Ion, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel:
Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups. 16764-16800 - Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng:
Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue. 16801-16819 - Divya Patel, Pathik Patel, Ankush Chander, Sourish Dasgupta, Tanmoy Chakraborty:
Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done! 16820-16842 - Vishal Vivek Saley, Goonjan Saha, Rocktim Jyoti Das, Dinesh Raghu, Mausam:
MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations. 16843-16877 - Abhilash Nandy, Yash Agarwal, Ashish Patwa, Millon Madhur Das, Aman Bansal, Ankit Raj, Pawan Goyal, Niloy Ganguly:
***YesBut***: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models. 16878-16895 - Chunhui Zhang, Yiren Jian, Zhongyu Ouyang, Soroush Vosoughi:
Working Memory Identifies Reasoning Limits in Language Models. 16896-16922 - James Wang, Ran Li, Junfeng Yang, Chengzhi Mao:
RAFT: Realistic Attacks to Fool Text Detectors. 16923-16936 - Jiaxuan You, Mingjie Liu, Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
LLM-Evolve: Evaluation for LLM's Evolving Capability on Benchmarks. 16937-16942 - Ajay Jaiswal, Bodun Hu, Lu Yin, Yeonju Ro, Tianlong Chen, Shiwei Liu, Aditya Akella:
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping. 16943-16956 - Tom Potter, Zheng Yuan:
LLM-based Code-Switched Text Generation for Grammatical Error Correction. 16957-16965 - Mehrdad Farahani, Richard Johansson:
Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models. 16966-16977 - Geewook Kim, Minjoon Seo:
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning. 16978-17000 - Zihao He, Minh Duc Chu, Rebecca Dorn, Siyi Guo, Kristina Lerman:
Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities. 17001-17019 - Eldar Kurtic, Amir Moeini, Dan Alistarh:
Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models. 17020-17027 - Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna:
Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models. 17028-17047 - Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer:
One Thousand and One Pairs: A "novel" challenge for long-context language models. 17048-17085 - Tu Vu, Kalpesh Krishna, Salaheddin Alzubi, Chris Tar, Manaal Faruqui, Yun-Hsuan Sung:
Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation. 17086-17105 - John T. Hale, Milos Stanojevic:
Do LLMs learn a true syntactic universal? 17106-17119 - Oh Joon Kwon, Daiki E. Matsunaga, Kee-Eung Kim:
GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets. 17120-17139 - Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman:
How Susceptible are Large Language Models to Ideological Manipulation? 17140-17161 - Fabrice Harel-Canada, Hanyu Zhou, Sreya Muppalla, Zeynep Yildiz, Miryung Kim, Amit Sahai, Nanyun Peng:
Measuring Psychological Depth in Language Models. 17162-17196 - Jin Zhao, Jingxuan Tu, Han Du, Nianwen Xue:
Media Attitude Detection via Framing Analysis with Events and their Relations. 17197-17210 - Yang Ba, Michelle Mancenido, Rong Pan:
Fill In The Gaps: Model Calibration and Generalization with Synthetic Data. 17211-17225 - Sagi Shaier, Ari Kobren, Philip V. Ogren:
Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations. 17226-17239 - Ethan Mendes, Yang Chen, James Hays, Sauvik Das, Wei Xu, Alan Ritter:
Granular Privacy Control for Geolocation with Vision Language Models. 17240-17292 - Chao Jiang, Wei Xu:
MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain. 17293-17319 - Siddhant Bikram Shah, Shuvam Shiwakoti, Maheep Chaudhary, Haohan Wang:
MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification. 17320-17332 - Mingye Zhu, Yi Liu, Quan Wang, Junbo Guo, Zhendong Mao:
FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization. 17333-17350 - Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun:
StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning. 17351-17370 - Jiaxiang Liu, Yuan Wang, Jiawei Du, Joey Zhou, Zuozhu Liu:
MedCoT: Medical Chain of Thought via Hierarchical Expert. 17371-17389 - Ziyong Lin, Quansen Wang, Zixia Jia, Zilong Zheng:
Varying Sentence Representations via Condition-Specified Routers. 17390-17401 - Jiao Ou, Jiayu Wu, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai:
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues. 17402-17431 - Javier Ferrando, Elena Voita:
Information Flow Routes: Automatically Interpreting Language Models at Scale. 17432-17445 - Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang:
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models. 17446-17467 - Qin Dai, Benjamin Heinzerling, Kentaro Inui:
Representational Analysis of Binding in Language Models. 17468-17493 - Erxin Yu, Jing Li, Ming Liao, Siqi Wang, Zuchen Gao, Fei Mi, Lanqing Hong:
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference. 17494-17508 - Tobias Schimanski, Jingwei Ni, Roberto Martín, Nicola Ranger, Markus Leippold:
ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures. 17509-17524 - Liu Ran, Zhongzhou Liu, Xiaoli Li, Yuan Fang:
Context-Aware Adapter Tuning for Few-Shot Relation Learning in Knowledge Graphs. 17525-17537 - Shixuan Ma, Quan Wang:
Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness. 17538-17553 - Zhanpeng Chen, Zhihong Zhu, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou:
Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection. 17554-17567 - Siyuan Wang, Zhuohan Long, Zhihao Fan, Zhongyu Wei:
From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking. 17568-17582 - Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren:
Symbolic Working Memory Enhances Language Models for Complex Rule Application. 17583-17604 - Sijun Tan, Xiuyu Li, Shishir G. Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph Gonzalez, Raluca A. Popa:
LLoCO: Learning Long Contexts Offline. 17605-17621 - Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, Wang Chen, Anh Tuan Luu:
Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration. 17622-17642 - Hojae Lee, Junho Kim, SangKeun Lee:
Mentor-KD: Making Small Language Models Better Multi-step Reasoners. 17643-17658 - Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng:
Are Large Language Models Capable of Generating Human-Level Narratives? 17659-17681 - Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung:
MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs. 17682-17702 - Haohui Lu, Usman Naseem:
Can Large Language Models Enhance Predictions of Disease Progression? Investigating Through Disease Network Link Prediction. 17703-17715 - Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang:
Searching for Best Practices in Retrieval-Augmented Generation. 17716-17736 - Marwa Abdulhai, Gregory Serapio-García, Clément Crepy, Daria Valter, John Canny, Natasha Jaques:
Moral Foundations of Large Language Models. 17737-17752 - Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury:
The Zeno's Paradox of 'Low-Resource' Languages. 17753-17774 - Aseem Srivastava, Smriti Joshi, Tanmoy Chakraborty, Md. Shad Akhtar:
Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization. 17775-17789 - Pritika Ramu, Koustava Goswami, Apoorv Saxena, Balaji Vasan Srinivasan:
Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition. 17790-17806 - Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima:
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment. 17807-17816 - Deyuan Liu, Zhanyue Qin, Hairu Wang, Zhao Yang, Zecheng Wang, Fangying Rong, Qingbin Liu, Yanchao Hao, Bo Li, Xi Chen, Cunhang Fan, Zhao Lv, Dianhui Chu, Zhiying Tu, Dianbo Sui:
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging. 17817-17829 - Nicholas Popovic, Michael Färber:
Embedded Named Entity Recognition using Probing Classifiers. 17830-17850 - Zhou Zhang, Dongzeng Tan, Jiaan Wang, Yilong Chen, Jiarong Xu:
Unleashing the Power of Emojis in Texts via Self-supervised Graph Pre-Training. 17851-17863 - Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang:
Data Contamination Can Cross Language Barriers. 17864-17875 - Shengjie Li, Vincent Ng:
Automated Essay Scoring: A Reflection on the State of the Art. 17876-17888 - Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Shuming Shi, Zhaopeng Tu:
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate. 17889-17904 - Xin Zhou, Ping Nie, Yiwen Guo, Haojie Wei, Zhanqiu Zhang, Pasquale Minervini, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang:
Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs. 17905-17923 - Migyeong Kang, Goun Choi, Hyolim Jeon, Ji Hyun An, Daejin Choi, Jinyoung Han:
CURE: Context- and Uncertainty-Aware Mental Disorder Detection. 17924-17940 - Yakun Yu, Shiang Qi, Baochun Li, Di Niu:
PepRec: Progressive Enhancement of Prompting for Recommendation. 17941-17953 - Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu, Yunde Jia:
In-Context Compositional Generalization for Large Vision-Language Models. 17954-17966 - Xiaowei Yuan, Zhao Yang, Yequan Wang, Jun Zhao, Kang Liu:
Improving Zero-shot LLM Re-Ranker with Risk Minimization. 17967-17983 - Xianwei Zhuang, Zhihong Zhu, Zhanpeng Chen, Yuxin Xie, Liming Liang, Yuexian Zou:
Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory. 17984-18003 - Xin Ying Qiu, Jingshen Zhang:
Label Confidence Weighted Learning for Target-level Sentence Simplification. 18004-18019 - Wenduan Xu, Stephen Clark, Douglas Brown, Gabriel Matos, Konstantinos Meichanetzidis:
Quantum Recurrent Architectures for Text Classification. 18020-18027 - Armel Zebaze, Benoît Sagot, Rachel Bawden:
Tree of Problems: Improving structured problem solving with compositionality. 18028-18047 - Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof Arenas, Luisa Bentivogli:
What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study. 18048-18076 - Zichao Li, Shaojie He, Meng Liao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Yanxiong Lu, Xianpei Han, Le Sun:
Seg2Act: Global Context-aware Action Generation for Document Logical Structuring. 18077-18088 - Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, Ajay Jaiswal, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu:
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning. 18089-18099 - Saksham Rastogi, Danish Pruthi:
Revisiting the Robustness of Watermarking to Paraphrasing Attacks. 18100-18110 - Jinggui Liang, Yuxia Wu, Yuan Fang, Hao Fei, Lizi Liao:
A Survey of Ontology Expansion for Conversational Understanding. 18111-18127 - Johnathan Xie, Annie S. Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn:
Calibrating Language Models with Adaptive Temperature Scaling. 18128-18138 - Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo:
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? 18139-18149 - Eleonora Gualdoni, Gemma Boleda:
Why do objects have many names? A study on word informativeness in language use and lexical systems. 18150-18163 - Songming Zhang, Xue Zhang, Zengkui Sun, Yufeng Chen, Jinan Xu:
Dual-Space Knowledge Distillation for Large Language Models. 18164-18181 - Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik:
NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition. 18182-18198 - Junteng Liu, Shiqi Chen, Yu Cheng, Junxian He:
On the Universal Truthfulness Hyperplane Inside LLMs. 18199-18224 - Chao-Wei Huang, Yun-Nung Chen:
PairDistill: Pairwise Relevance Distillation for Dense Retrieval. 18225-18237 - Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo, Zheng Xu:
User Inference Attacks on Large Language Models. 18238-18265 - Yongkang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze:
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy. 18266-18287 - Yufang Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Aimin Zhou:
Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models. 18288-18301 - Matthew Raffel, Victor Agostinelli, Lizhong Chen:
Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation. 18302-18314 - Qinzhuo Wu, Wei Liu, Jian Luan, Bin Wang:
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback. 18315-18339 - Esra Dönmez, Thang Vu, Agnieszka Falenska:
Please note that I'm just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification. 18340-18357 - Tiago Pimentel, Clara Meister:
How to Compute the Probability of a Word. 18358-18375 - Elie Antoine, Frédéric Béchet, Géraldine Damnati, Philippe Langlais:
A linguistically-motivated evaluation methodology for unraveling model's abilities in reading comprehension tasks. 18376-18392 - Elias Bassani, Ignacio Sanchez:
GuardBench: A Large-Scale Benchmark for Guardrail Models. 18393-18409 - Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Guang Liu, Jun Zhao, Kang Liu:
Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering. 18410-18430 - Gabriele Merlin, Mariya Toneva:
Language models and brains align due to more than next-word prediction and word-level information. 18431-18454 - Zijin Feng, Luyang Lin, Lingzhi Wang, Hong Cheng, Kam-Fai Wong:
LLMEdgeRefine: Enhancing Text Clustering with LLM-Based Boundary Point Refinement. 18455-18462 - Ekaterina Sviridova, Anar Yeginbergen, Ainara Estarrona, Elena Cabrio, Serena Villata, Rodrigo Agerri:
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures. 18463-18475 - Alessio Devoto, Yu Zhao, Simone Scardapane, Pasquale Minervini:
A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression. 18476-18499 - Linhao Zhang, Jintao Liu, Li Jin, Hao Wang, Kaiwen Wei, Guangluan Xu:
GOME: Grounding-based Metaphor Binding With Conceptual Elaboration For Figurative Language Illustration. 18500-18510 - Aida Mostafazadeh Davani, Mark Diaz, Dylan K. Baker, Vinodkumar Prabhakaran:
D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation. 18511-18526 - Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki:
PALM: Few-Shot Prompt Learning for Audio Language Models. 18527-18536 - Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah, Enrico Liscio:
Annotator-Centric Active Learning for Subjective NLP Tasks. 18537-18555 - Mario Giulianelli, Luca Malagutti, Juan Luis Gastaldi, Brian DuSell, Tim Vieira, Ryan Cotterell:
On the Proper Treatment of Tokenization in Psycholinguistics. 18556-18572 - Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno Guerreiro:
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation. 18573-18583 - Mansour Al Ghanim, Saleh Almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou:
Jailbreaking LLMs with Arabic Transliteration and Arabizi. 18584-18600 - Zara Siddique, Liam D. Turner, Luis Espinosa Anke:
Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models. 18601-18619 - Changho Lee, Janghoon Han, Seonghyeon Ye, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae:
Instruction Matters: A Simple yet Effective Task Selection for Optimized Instruction Tuning of Specific Tasks. 18620-18642 - Chenxi Lin, Jiayu Ren, Guoxiu He, Zhuoren Jiang, Haiyan Yu, Xiaomin Zhu:
Recurrent Alignment with Hard Attention for Hierarchical Text Rating. 18643-18657 - Junhui He, Shangyu Wu, Weidong Wen, Chun Jason Xue, Qingan Li:
CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification. 18658-18668 - Yongjing Yin, Junran Ding, Kai Song, Yue Zhang:
Semformer: Transformer Language Models with Semantic Planning. 18669-18680 - Sameer Pimparkhede, Mehant Kammakomati, Srikanth Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya:
DocCGen: Document-based Controlled Code Generation. 18681-18697 - Giulio Zhou, Sydelle De Souza, Ella Markham, Oghenetekevwe Kwakpovwe, Sumin Zhao:
Semantics and Sentiment: Cross-lingual Variations in Emoji Use. 18698-18712 - Daniel Akkerman, Phong Le, Raquel G. Alhama:
The Emergence of Compositional Languages in Multi-entity Referential Games: from Image to Graph Representations. 18713-18723 - Matanel Oren, Michael Hassid, Yarden Nir, Yossi Adi, Roy Schwartz:
Transformers are Multi-State RNNs. 18724-18741 - Niyati Bafna, Kenton Murray, David Yarowsky:
Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization. 18742-18762 - Kerem Zaman, Leshem Choshen, Shashank Srivastava:
Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion. 18763-18783 - Minwook Bae, Hyounghun Kim:
Collective Critics for Creative Story Generation. 18784-18819 - Eleftheria Tsipidi, Franz Nowak, Ryan Cotterell, Ethan Wilcox, Mario Giulianelli, Alex Warstadt:
Surprise! Uniform Information Density Isn't the Whole Story: Predicting Surprisal Contours in Long-form Discourse. 18820-18836 - Jaepill Choi, Kyubyung Chae, Jiwoo Song, Yohan Jo, Taesup Kim:
Model-based Preference Optimization in Abstractive Summarization without Human Feedback. 18837-18851 - Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe:
Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation? 18852-18867 - Simona Doneva, Tilia Ellendorff, Beate Sick, Jean-Philippe Goldman, Amelia Cannon, Gerold Schneider, Benjamin Ineichen:
NeuroTrialNER: An Annotated Corpus for Neurological Diseases and Therapies in Clinical Trial Registries. 18868-18890 - Maxime Kayser, Bayar Menzat, Cornelius Emde, Bogdan Bercean, Alex Novak, Abdalá Morgado, Bartlomiej W. Papiez, Susanne Gaube, Thomas Lukasiewicz, Oana-Maria Camburu:
Fool Me Once? Contrasting Textual and Visual Explanations in a Clinical Decision-Support Setting. 18891-18919 - Weihe Zhai, Arkaitz Zubiaga, Bingquan Liu, Chengjie Sun, Yalong Zhao:
Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering. 18920-18930 - Yanting Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Xiaoling Wang:
Generation with Dynamic Vocabulary. 18931-18948 - Michele Contalbo, Francesco Guerra, Matteo Paganelli:
Argument Relation Classification through Discourse Markers and Adversarial Training. 18949-18954 - Abhishek Purushothama, Adam Wiemerslage, Katharina von der Wense:
Getting The Most Out of Your Training Data: Exploring Unsupervised Tasks for Morphological Inflection. 18955-18970 - Dae Yon Hwang, Bilal Taha, Harshit Pande, Yaroslav Nechaev:
Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval. 18971-18982 - Po-Heng Chen, Yun-Nung Chen:
Efficient Unseen Language Adaptation for Multilingual Pre-Trained Language Models. 18983-18994 - Ruiyu Xiao, Lei Wu, Yuhang Gou, Weinan Zhang, Ting Liu:
Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation. 18995-19008 - Kate Sanders, Nathaniel Weir, Benjamin Van Durme:
TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning. 19009-19028 - Makesh Narsimhan Sreedhar, Traian Rebedea, Christopher Parisien:
Unsupervised Extraction of Dialogue Policies from Conversations. 19029-19045 - Onkar Susladkar, Gayatri Deshmukh, Vandan Gorade, Sparsh Mittal:
GRIZAL: Generative Prior-guided Zero-Shot Temporal Action Localization. 19046-19059 - Youngtaek Oh, Jae-Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim:
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality. 19060-19076 - Wenyan Li, Crystina Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott:
FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture. 19077-19095 - Hoyeon Lee, Hyeeun Jang, Jong-Hwan Kim, Jae-Min Kim:
A Two-Step Approach for Data-Efficient French Pronunciation Learning. 19096-19103 - Rongzhi Li, Takeru Matsuda, Hitomi Yanaka:
Exploring Intra and Inter-language Consistency in Embeddings with ICA. 19104-19111 - Md Tawkat Islam Khondaker, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan:
DetoxLLM: A Framework for Detoxification with Explanations. 19112-19139 - Josephine Lukito, Bin Chen, Gina M. Masullo, Natalie Jomini Stroud:
Comparing a BERT Classifier and a GPT classifier for Detecting Connective Language Across Multiple Social Media. 19140-19153 - Yash Akhauri, Ahmed F. AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M. Rush, Safeen Huda, Mohamed S. Abdelfattah:
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models. 19154-19167 - Krishnapriya Vishnubhotla, Daniela Teodorescu, Mallory J. Feldman, Kristen A. Lindquist, Saif M. Mohammad:
Emotion Granularity from Text: An Aggregate-Level Indicator of Mental Health. 19168-19185 - Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang:
BLSP-Emo: Towards Empathetic Large Speech-Language Models. 19186-19199 - Abhishek Divekar, Greg Durrett:
SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation. 19200-19227 - Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang:
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model. 19228-19252 - Mohammed Saidul Islam, Md. Tahmid Rahman Laskar, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty:
DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts. 19253-19286 - Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha:
DEM: Distribution Edited Model for Training with Mixed Data Distributions. 19287-19301 - Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. 19302-19318 - Seoyeon Park, Cornelia Caragea:
VerifyMatch: A Semi-Supervised Learning Paradigm for Natural Language Inference with Confidence-Aware MixUp. 19319-19335 - Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Raymond J. Mooney:
CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans. 19336-19354 - Théo Gigant, Camille Guinaudeau, Marc Decombas, Frédéric Dufaux:
Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics. 19355-19368 - Manuj Malik, Jing Jiang, Kian Ming Chai:
An Empirical Analysis of the Writing Styles of Persona-Assigned LLMs. 19369-19388 - Amit Parekh, Nikolas Vitsakis, Alessandro Suglia, Ioannis Konstas:
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks. 19389-19424 - Aleksander Ficek, Jiaqi Zeng, Oleksii Kuchaiev:
GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning. 19425-19432 - Xinyi He, Jiaru Zou, Yun Lin, Mengyu Zhou, Shi Han, Zejian Yuan, Dongmei Zhang:
CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing. 19433-19451 - Avirup Saha, Lakshmi Mandal, Balaji Ganesan, Sambit Ghosh, Renuka Sindhgatta, Carlos Eberhardt, Dan Debrunner, Sameep Mehta:
Sequential API Function Calling Using GraphQL Schema. 19452-19458 - Judith Sieker, Simeon Junker, Ronja Utescher, Nazia Attari, Heiko Wersing, Hendrik Buschmeier, Sina Zarrieß:
The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems. 19459-19475 - Jessica Forde, Ruochen Zhang, Lintang Sutawika, Alham Fikri Aji, Samuel Cahyawijaya, Genta Indra Winata, Minghao Wu, Carsten Eickhoff, Stella Biderman, Ellie Pavlick:
Re-Evaluating Evaluation for Multilingual Summarization. 19476-19493 - Heng Zhao, Yinjie Zhao, Bihan Wen, Yew-Soon Ong, Joey Zhou:
Video-Text Prompting for Weakly Supervised Spatio-Temporal Video Grounding. 19494-19505 - Caio Corro:
A Fast and Sound Tagging Method for Discontinuous Named-Entity Recognition. 19506-19518 - Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Georgiev, Rocktim Jyoti Das, Preslav Nakov:
Factuality of Large Language Models: A Survey. 19519-19529 - Youngwoo Kim, Razieh Rahimi, James Allan:
Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation. 19530-19547 - Rongchen Guo, Isar Nejadgholi, Hillary Dawkins, Kathleen C. Fraser, Svetlana Kiritchenko:
Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse. 19548-19564 - Rakesh R. Menon, Shashank Srivastava:
DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers. 19565-19583 - Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha:
IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning. 19584-19601 - Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos:
Scope-enhanced Compositional Semantic Parsing for DRT. 19602-19616 - Siyang Liu, Trisha Maturi, Bowen Yi, Siqi Shen, Rada Mihalcea:
The Generation Gap: Exploring Age Bias in the Value Systems of Large Language Models. 19617-19634 - Talia Tseriotou, Adam Tsakalidis, Maria Liakata:
TempoFormer: A Transformer for Temporally-aware Representations in Change Detection. 19635-19653 - Guillermo Marco, Julio Gonzalo, María Teresa Mateo Girona, Ramón Santos:
Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing? 19654-19670 - Yanran Chen, Hannes Gröner, Sina Zarrieß, Steffen Eger:
Evaluating Diversity in Automatic Poetry Generation. 19671-19692 - Yi Zhou, Danushka Bollegala, José Camacho-Collados:
Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models. 19693-19708 - Camilla Casula, Sebastiano Salto, Alan Ramponi, Sara Tonelli:
Delving into Qualitative Implications of Synthetic Data for Hate Speech Detection. 19709-19726 - Zineng Tang, Lingjun Mao, Alane Suhr:
Grounding Language in Multi-Perspective Referential Communication. 19727-19741 - Yifan Qiao, Parker Carlson, Shanxiu He, Yingrui Yang, Tao Yang:
Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval. 19742-19757 - Hizkiel Alemayehu, Hamada M. Zahera, Axel-Cyrille Ngonga Ngomo:
Error Analysis of Multilingual Language Models in Machine Translation: A Case Study of English-Amharic Translation. 19758-19768 - Arkadiusz Modzelewski, Giovanni Da San Martino, Pavel Savov, Magdalena Wilczynska, Adam Wierzbicki:
MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation. 19769-19785 - Artem Abzaliev, Rada Mihalcea:
Unsupervised Discrete Representations of American Sign Language. 19786-19793 - Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim:
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models. 19794-19809 - Mihir Parmar, Hanieh Deilamsalehy, Franck Dernoncourt, Seunghyun Yoon, Ryan A. Rossi, Trung Bui:
Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs. 19810-19820 - Parand A. Alamdari, Yanshuai Cao, Kevin H. Wilson:
Jump Starting Bandits with LLM-Generated Prior Knowledge. 19821-19833 - Firat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çagatay Yildiz:
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? 19834-19843 - Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun:
Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. 19844-19863 - Suhong Moon, Marwa Abdulhai, Minwoo Kang, Joseph Suh, Widyadewi Soedarmadji, Eran Kohen Behar, David M. Chan:
Virtual Personas for Language Models via an Anthology of Backstories. 19864-19897 - Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral:
Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? 19898-19915 - Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun:
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies. 19916-19939 - Joel Mire, Maria Antoniak, Elliott Ash, Andrew Piper, Maarten Sap:
The Empirical Variability of Narrative Perceptions of Social Media Texts. 19940-19968 - Yating Wu, Ritika Mangla, Alex Dimakis, Greg Durrett, Junyi Jessy Li:
Which questions should I answer? Salience Prediction of Inquisitive Questions. 19969-19987 - Lei Sun, Jinming Zhao, Qin Jin:
Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues. 19988-20002 - Guan-Ting Lin, Wei Huang, Hung-yi Lee:
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech. 20003-20015 - Sachit Menon, Richard S. Zemel, Carl Vondrick:
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities. 20016-20031 - Weixi Tong, Tianyi Zhang:
CodeJudge: Evaluating Code Generation with Large Language Models. 20032-20051 - Guohao Sun, Can Qin, Huazhu Fu, Linwei Wang, Zhiqiang Tao:
Self-Training Large Language and Vision Assistant for Medical Question Answering. 20052-20060 - Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun Ouyang, Beining Wang, Vidhi Dhaval Mody, Hong Yu:
SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization. 20061-20083 - Yujun Zhou, Yufei Han, Haomin Zhuang, Kehan Guo, Zhenwen Liang, Hongyan Bao, Xiangliang Zhang:
Defending Jailbreak Prompts via In-Context Adversarial Game. 20084-20105 - Kateryna Kasianenko, Shima Khanehzar, Stephen Wan, Ehsan Dehghan, Axel Bruns:
Detecting Online Community Practices with Large Language Models: A Case Study of Pro-Ukrainian Publics on Twitter. 20106-20135 - Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, José Camacho-Collados:
Multilingual Topic Classification in X: Dataset and Analysis. 20136-20152 - Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, Yufei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong:
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models. 20153-20177 - Amir Zur, Elisa Kreiss, Karel D'Oosterlinck, Christopher Potts, Atticus Geiger:
Updating CLIP to Prefer Descriptions Over Captions. 20178-20187 - Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin, Chun-Ying Huang:
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research. 20188-20206 - Jonathan Hus, Antonios Anastasopoulos:
Back to School: Translation Using Grammar Books. 20207-20219 - Hammad A. Ayyubi, Tianqi Liu, Arsha Nagrani, Xudong Lin, Mingda Zhang, Anurag Arnab, Feng Han, Yukun Zhu, Xuande Feng, Kevin Zhang, Jialu Liu, Shih-Fu Chang:
VIEWS: Entity-Aware News Video Captioning. 20220-20239 - Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan:
Towards Aligning Language Models with Textual Feedback. 20240-20266 - Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Bin Zhu, Xiaodi Sun, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang:
AMPO: Automatic Multi-Branched Prompt Optimization. 20267-20279 - Xinglin Lyu, Junhui Li, Yanqing Zhao, Min Zhang, Daimeng Wei, Shimin Tao, Hao Yang:
DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators. 20280-20295 - Devleena Das, Vivek Khetan:
DEFT-UCS: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection for Text-Editing. 20296-20312 - Yuko Nakagi, Takuya Matsuyama, Naoko Koide-Majima, Hiroto Yamaguchi, Rieko Kubo, Shinji Nishimoto, Yu Takagi:
Unveiling Multi-level and Multi-modal Semantic Representations in the Human Brain using Large Language Models. 20313-20338 - Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanushree Mitra:
"They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations. 20339-20369 - Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen:
Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models. 20370-20401 - Gabriel Roccabruna, Massimo Rizzoli, Giuseppe Riccardi:
Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? 20402-20415 - Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai:
Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties. 20416-20431 - Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low:
Waterfall: Scalable Framework for Robust Text Watermarking and Provenance for LLMs. 20432-20466 - Nicholas Deas, Elsbeth Turcan, Iván Pérez Mejía, Kathleen R. McKeown:
MASIVE: Open-Ended Affective State Identification in English and Spanish. 20467-20485 - Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, Jordan L. Boyd-Graber:
You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions. 20486-20510 - Peijun Qing, Chongyang Gao, Yefan Zhou, Xingjian Diao, Yaoqing Yang, Soroush Vosoughi:
AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality. 20511-20523 - Irfan Robbani, Paul Reisert, Surawat Pothong, Naoya Inoue, Camélia Guerraoui, Wenzhi Wang, Shoichi Naito, Jungmin Choi, Kentaro Inui:
Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling. 20524-20540 - Leena Mathur, Paul Pu Liang, Louis-Philippe Morency:
Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions. 20541-20560 - Ziyi Kou, Shichao Pei, Meng Jiang, Xiangliang Zhang:
RAt: Injecting Implicit Bias for Text-To-Image Prompt Refinement Models. 20561-20570 - Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh:
Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese. 20571-20590 - Miyu Oba, Yohei Oseki, Akiyo Fukatsu, Akari Haga, Hiroki Ouchi, Taro Watanabe, Saku Sugawara:
Can Language Models Induce Grammatical Knowledge from Indirect Evidence? 20591-20603 - Jialiang Xu, Shenglan Li, Zhaozhuo Xu, Denghui Zhang:
Do LLMs Know to Respect Copyright Notice? 20604-20619 - Ryan Sun, Tianyi Zhou, Xun Chen, Lichao Sun:
SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding. 20620-20641 - YeonJoon Jung, Jaeseong Lee, Seungtaek Choi, Dohyeon Lee, Minsoo Kim, Seung-won Hwang:
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding. 20642-20655 - Sungdong Kim, Minjoon Seo:
Rethinking the Role of Proxy Rewards in Language Model Alignment. 20656-20674 - Abhirama Subramanyam Penamakuri, Anand Mishra:
Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant. 20675-20688 - Stefano Perrella, Lorenzo Proietti, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli:
Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics. 20689-20714 - Soeun Lee, Si-Woo Kim, Taewhan Kim, Dong-Jin Kim:
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning. 20715-20727 - Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Junyu Xiong, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang:
Encoding Spreadsheets for Large Language Models. 20728-20748 - Rositsa V. Ivanova, Thomas Huber, Christina Niklaus:
Let's discuss! Quality Dimensions and Annotated Datasets for Computational Argument Quality Assessment. 20749-20779 - Dongfang Xu, Davy Weissenbacher, Karen O'Connor, Siddharth Rawal, Graciela Gonzalez-Hernandez:
Automatic sentence segmentation of clinical record narratives in real-world data. 20780-20793 - Heeyoung Lee:
One-to-Many Communication and Compositionality in Emergent Communication. 20794-20811 - Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang:
Bayesian Example Selection Improves In-Context Learning for Speech, Text and Visual Modalities. 20812-20828 - Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali:
Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? 20829-20855 - Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral:
Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models. 20856-20879 - Mayukh Sharma, Sean O'Brien, Julian J. McAuley:
Linear Layer Extrapolation for Fine-Grained Emotion Classification. 20880-20888 - Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao:
Task Oriented In-Domain Data Augmentation. 20889-20907 - Shruti Singh, Nandan Sarkar, Arman Cohan:
SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers. 20908-20923 - Zhuocheng Gong, Ang Lv, Jian Guan, Wei Wu, Huishuai Zhang, Minlie Huang, Dongyan Zhao, Rui Yan:
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules. 20924-20938 - Youssef Mohamed, Runjia Li, Ibrahim Said Ahmad, Kilichbek Haydarov, Philip Torr, Kenneth Church, Mohamed Elhoseiny:
No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages. 20939-20962 - Someen Park, Jaehoon Kim, Seungwan Jin, Sohyun Park, Kyungsik Han:
PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection. 20963-20987 - Shashi Kumar, Srikanth R. Madikeri, Juan Pablo Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlícek, Karthik S, Aravind Ganapathiraju:
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR. 20988-20995 - Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz:
ApiQ: Finetuning of 2-Bit Quantized Large Language Model. 20996-21020 - Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu, Zhangyue Yin, Wentao Shu, Mianqiu Huang, Bo Wang, Yunhua Zhou, Linlin Li, Qun Liu, Xipeng Qiu:
Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk. 21021-21034 - Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya:
A Morphology-Based Investigation of Positional Encodings. 21035-21045 - Vahid Ghafouri, Jose Such, Guillermo Suarez-Tangil:
I love pineapple on pizza != I hate pineapple on pizza: Stance-Aware Sentence Transformers for Opinion Mining. 21046-21058 - Mamta Mamta, Rishikant Chigrupaatii, Asif Ekbal:
BiasWipe: Mitigating Unintended Bias in Text Classifiers through Model Interpretability. 21059-21070 - Firoj Alam, Abul Hasnat, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain:
ArMeme: Propagandistic Content in Arabic Memes. 21071-21090 - Arianna Muti, Federico Ruggeri, Khalid Al-Khatib, Alberto Barrón-Cedeño, Tommaso Caselli:
Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts. 21091-21107 - Zhonghua Zheng, Lizi Liao, Yang Deng, Ee-Peng Lim, Minlie Huang, Liqiang Nie:
Thoughts to Target: Enhance Planning for Target-driven Conversation. 21108-21124 - Clara Na, Ian Magnusson, Ananya Harsh Jha, Tom Sherborne, Emma Strubell, Jesse Dodge, Pradeep Dasigi:
Scalable Data Ablation Approximations for Language Models through Modular Training and Merging. 21125-21141 - Zhe Cao, Zhi Qu, Hidetaka Kamigaito, Taro Watanabe:
Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation. 21142-21157 - Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe:
Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters. 21158-21166 - Jinyoung Park, Minseok Joo, Joo-Kyung Kim, Hyunwoo J. Kim:
Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation. 21167-21182 - Tuc Nguyen, Thai Le:
Adapters Mixup: Mixing Parameter-Efficient Adapters to Enhance the Adversarial Robustness of Fine-tuned Pre-trained Text Classifiers. 21183-21203 - Woojin Kim, Sungeun Hahm, Jaejin Lee:
Generalizing Clinical De-identification Models by Privacy-safe Data Augmentation using GPT-4. 21204-21218 - Prisha Samadarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan:
Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game. 21219-21236 - Raphael Scheible, Johann Frei, Fabian Thomczyk, Henry He, Patric Tippmann, Jochen Knaus, Victor Jaravine, Frank Kramer, Martin Boeker:
GottBERT: a pure German Language Model. 21237-21250 - Khoi P. N. Nguyen, Vincent Ng:
Computational Meme Understanding: A Survey. 21251-21267 - Costas Mavromatis, Balasubramaniam Srinivasan, Zhengyuan Shen, Jiani Zhang, Huzefa Rangwala, Christos Faloutsos, George Karypis:
CoverICL: Selective Annotation for In-Context Learning via Active Graph Coverage. 21268-21286 - Nicola Dall'Asen, Yiming Wang, Enrico Fini, Elisa Ricci:
Retrieval-enriched zero-shot image classification in low-resource domains. 21287-21302 - Xianquan Wang, Likang Wu, Shukang Yin, Zhi Li, Yanjiang Chen, Hufeng Hufeng, Yu Su, Qi Liu:
I-AM-G: Interest Augmented Multimodal Generator for Item Personalization. 21303-21317 - Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy:
Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps. 21318-21340 - Baihe Huang, Hiteshi Sharma, Yi Mao:
Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing. 21341-21352 - Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist:
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion. 21353-21370 - Diogo Glória-Silva, David Semedo, João Magalhães:
Show and Guide: Instructional-Plan Grounded Vision and Language Model. 21371-21389 - Bandhav Veluri, Benjamin N. Peloquin, Bokai Yu, Hongyu Gong, Shyamnath Gollakota:
Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents. 21390-21402 - Minsoo Kim, Jongyoon Kim, Jihyuk Kim, Seung-won Hwang:
QuBE: Question-based Belief Enhancement for Agentic LLM Reasoning. 21403-21423 - Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang:
CompAct: Compressing Retrieved Documents Actively for Question Answering. 21424-21439 - Fatemeh Shiri, Xiao-Yu Guo, Mona Far, Xin Yu, Reza Haf, Yuan-Fang Li:
An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models. 21440-21455 - Jiaxin Zhang, Wendi Cui, Yiran Huang, Kamalika Das, Kumar Sricharan:
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models. 21456-21473 - Marlene Lutz, Rochelle Choenni, Markus Strohmaier, Anne Lauscher:
Local Contrastive Editing of Gender Stereotypes. 21474-21493 - Stefan Larson, Nicole Lima, Santiago Diaz, Amogh Manoj Joshi, Siddharth Betala, Jamiu Suleiman, Yash Mathur, Kaushal Prajapati, Ramla Alakraa, Junjie Shen, Temi Okotore, Kevin Leach:
De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP. 21494-21505 - Avik Dutta, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le:
RAR: Retrieval-augmented retrieval for code generation in low resource languages. 21506-21515 - Laura Weidinger, John Mellor, Bernat Guillen Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, A. Stevie Bergman, Mikel Rodriguez, Verena Rieser, William Isaac:
STAR: SocioTechnical Approach to Red Teaming Language Models. 21516-21532 - Maharshi Gor, Hal Daumé III, Tianyi Zhou, Jordan L. Boyd-Graber:
Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA. 21533-21564 - Antoine Simoulin, Namyong Park, Xiaoyi Liu, Grey Yang:
Memory-Efficient Fine-Tuning of Transformers via Token Selection. 21565-21580 - Tarun Tater, Sabine Schulte im Walde, Diego Frassinelli:
Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories. 21581-21597 - Elizabeth Fons, Rachneet Kaur, Soham Palande, Zhen Zeng, Tucker Balch, Manuela Veloso, Svitlana Vyetrenko:
Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark. 21598-21634 - Shudong Liu, Zhaocong Li, Xuebo Liu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao, Min Zhang:
Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner. 21635-21645 - Hai Ye, Hwee Tou Ng:
Preference-Guided Reflective Sampling for Aligning Language Models. 21646-21668 - Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, Zeerak Talat:
Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP. 21669-21691 - Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim, Maarten Sap:
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs. 21692-21714 - Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius:
A Simple LLM Framework for Long-Range Video Question-Answering. 21715-21737 - Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli:
Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing. 21738-21744 - Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou Cheikh Tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah El Mekki, El Moatez Billah Nagoudi, Benelhadj Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir Ech-Chammakhy, Amal Makouar, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed:
Casablanca: Data and Models for Multidialectal Arabic Speech Recognition. 21745-21758 - Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria:
Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations. 21759-21776 - Kata Naszádi, Frans A. Oliehoek, Christof Monz:
Communicating with Speakers and Listeners of Different Pragmatic Levels. 21777-21783 - Bhathiya Hemanthage, Hakan Bilen, Phil Bartie, Christian Dondrup, Oliver Lemon:
RECANTFormer: Referring Expression Comprehension with Varying Numbers of Targets. 21784-21798 - Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari:
Sprout: Green Generative AI with Carbon-Efficient LLM Inference. 21799-21813 - Alexander Spangher, Nanyun Peng, Sebastian Gehrmann, Mark Dredze:
Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs. 21814-21828 - Björn Deiseroth, Manuel Brack, Patrick Schramowski, Kristian Kersting, Samuel Weinbach:
T-FREE: Subword Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings. 21829-21851 - HyoJung Han, Kevin Duh, Marine Carpuat:
SpeechQE: Estimating the Quality of Direct Speech Translation. 21852-21867 - Negar Arabzadeh, Siqing Huo, Nikhil Mehta, Qingyun Wu, Chi Wang, Ahmed Awadallah, Charles L. A. Clarke, Julia Kiseleva:
Assessing and Verifying Task Utility in LLM-Powered Applications. 21868-21888 - Somanshu Singla, Zhen Wang, Tianyang Liu, Abdullah Ashfaq, Zhiting Hu, Eric P. Xing:
Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models. 21889-21909 - Harbani Jaggi, Kashyap Coimbatore Murali, Eve Fleisig, Erdem Biyik:
Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree. 21910-21917 - Youxiang Zhu, Nana Lin, Kiran Balivada, Daniel Haehn, Xiaohui Liang:
Adversarial Text Generation using Large Language Models for Dementia Detection. 21918-21933 - Daniil Larionov, Mikhail Seleznyov, Vasiliy Viskov, Alexander Panchenko, Steffen Eger:
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics. 21934-21949 - Giovanni Marraffini, Andrés Cotton, Noe Hsueh, Axel Fridman, Juan Wisznia, Luciano Corro:
The Greatest Good Benchmark: Measuring LLMs' Alignment with Utilitarian Moral Dilemmas. 21950-21959 - Jiali Cheng, Hadi Amiri:
FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding. 21960-21975 - Jai Aggarwal, Suzanne Stevenson:
Style-Shifting Behaviour of the Manosphere on Reddit. 21976-21989 - Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, Yang Zhang:
The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective. 21990-22001 - Minqian Liu, Zhiyang Xu, Zihao Lin, Trevor Ashby, Joy Rimchala, Jiaxin Zhang, Lifu Huang:
Holistic Evaluation for Interleaved Text-and-Image Generation. 22002-22016 - Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander R. Fabbri, Wojciech Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev:
FOLIO: Natural Language Reasoning with First-Order Logic. 22017-22031 - Alexander S. Choi, Syeda Sabrina Akter, J. p. Singh, Antonios Anastasopoulos:
The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? 22032-22054 - Steven Y. Feng, Noah D. Goodman, Michael Frank:
Is Child-Directed Speech Effective Training Data for Language Models? 22055-22071 - Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao:
RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference. 22072-22087 - Juncai Li, Ru Li, Xiaoli Li, Qinghua Chai, Jeff Z. Pan:
Inference Helps PLMs' Conceptual Understanding: Improving the Abstract Inference Ability with Hierarchical Conceptual Entailment Graphs. 22088-22104 - Gitanjali Kumari, Kirtan Jain, Asif Ekbal:
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought. 22105-22138 - Govind Ramesh, Yao Dou, Wei Xu:
GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation. 22139-22148 - Kiseung Kim, Jay-Yoon Lee:
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation. 22149-22161 - Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth:
Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets. 22162-22184 - Mana Makinae, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe:
Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model. 22185-22205 - Pritika Ramu, Aparna Garimella, Sambaran Bandyopadhyay:
Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from Text. 22206-22216 - Abhishek Ghose, Emma Nguyen:
On the Fragility of Active Learners for Text Classification. 22217-22233 - Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May Dongmei Wang, Joyce C. Ho, Chao Zhang, Carl Yang:
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers. 22234-22254 - Jonghyun Song, Cheyon Jin, Wenlong Zhao, Andrew McCallum, Jay-Yoon Lee:
Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval. 22255-22269 - Chia-Wei Tang, Ting-Chih Chen, Kiet Nguyen, Kazi Sajeed Mehrab, Alvi Md. Ishmam, Chris Thomas:
M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection. 22270-22293 - Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Haotian Sun, Hang Wu, Carl Yang, May Dongmei Wang:
MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning. 22294-22314 - Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce C. Ho, Carl Yang, May Dongmei Wang:
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records. 22315-22339 - Hoang-Quoc Nguyen-Son, Minh-Son Dao, Koji Zettsu:
SimLLM: Detecting Sentences Generated by Large Language Models Using Similarity between the Generation and its Re-generation. 22340-22352 - Meiqi Chen, Bo Peng, Yan Zhang, Chaochao Lu:
CELLO: Causal Evaluation of Large Vision-Language Models. 22353-22374 - Yusuke Sakai, Mana Makinae, Hidetaka Kamigaito, Taro Watanabe:
Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair. 22375-22398 - Xudong Lin, Manling Li, Richard S. Zemel, Heng Ji, Shih-Fu Chang:
Training-free Deep Concept Injection Enables Language Models for Video Question Answering. 22399-22416 - Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu:
MIBench: Evaluating Multimodal Large Language Models over Multiple Images. 22417-22428 - Francesco Molfese, Simone Conia, Riccardo Orlando, Roberto Navigli:
ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering. 22429-22444 - Kshitij Mishra, Manisha Burja, Asif Ekbal:
ABLE: Personalized Disability Support with Politeness and Empathy Integration. 22445-22470 - Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Sunghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo:
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models. 22471-22502 - Hyungjoo Chae, Taeyoon Kwon, Seungjun Moon, Yongho Song, Dongjin Kang, Kai Tzu-iunn Ong, Beong-woo Kwak, Seonghyeon Bae, Seung-won Hwang, Jinyoung Yeo:
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code. 22503-22524 - David Heineman, Yao Dou, Wei Xu:
Improving Minimum Bayes Risk Decoding with Multi-Prompt. 22525-22545 - Gopendra Vikram Singh, Sai Vemulapalli, Mauajama Firdaus, Asif Ekbal:
Deciphering Cognitive Distortions in Patient-Doctor Mental Health Conversations: A Multimodal LLM-Based Detection and Reasoning Framework. 22546-22570 - Neil Chowdhury, Franklin Wang, Sumedh Shenoy, Douwe Kiela, Sarah Schwettmann, Tristan Thrush:
Nearest Neighbor Normalization Improves Multimodal Retrieval. 22571-22582 - Shengguang Wu, Shusheng Yang, Zhenglun Chen, Qi Su:
Rethinking Pragmatics in Large Language Models: Towards Open-Ended Evaluation and Preference Tuning. 22583-22599 - Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, Jie Tang:
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering. 22600-22632 - Yuxuan Guo, Zhiliang Tian, Yiping Song, Tianlun Liu, Liang Ding, Dongsheng Li:
Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models. 22633-22646 - Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen:
Knowledge Graph Enhanced Large Language Model Editing. 22647-22662 - Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif Ekbal:
'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews. 22663-22679 - Assaf Ben-Kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-Elor:
Mitigating Open-Vocabulary Caption Hallucinations. 22680-22698 - Kosuke Nishida, Kyosuke Nishida, Kuniko Saito:
Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes. 22699-22714 - Michalis Korakakis, Andreas Vlachos, Adrian Weller:
ALVIN: Active Learning Via INterpolation. 22715-22728 - Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu:
Filtered Direct Preference Optimization. 22729-22770 - Mathew Huerta-Enochian, Seung Ko:
Instruction Fine-Tuning: Does Prompt Loss Matter? 22771-22795 - Tomás Feith, Akhil Arora, Martin Gerlach, Debjit Paul, Robert West:
Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia. 22796-22819
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.