default search action
ACL 2024: Bangkok, Thailand
- Lun-Wei Ku, Andre Martins, Vivek Srikumar:
Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024. Association for Computational Linguistics 2014, ISBN 979-8-89176-099-8 - Frontmatter.
- Letian Peng, Yuwei Zhang, Jingbo Shang:
Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation. 1-16 - Mingyang Song, Liping Jing, Yi Feng:
Match More, Extract Better! Hybrid Matching Model for Open Domain Web Keyphrase Extraction. 17-27 - Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu:
AFPQ: Asymmetric Floating Point Quantization for LLMs. 28-36 - Xiaotong Jiang, Zhongqing Wang, Guodong Zhou:
End-to-End Emotion Semantic Parsing. 37-47 - Chen Chen, Ruizhe Li, Yuchen Hu, Yuanyuan Chen, Chengwei Qin, Qiang Zhang:
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System. 48-61 - Hyunsoo Cho:
Unveiling Imitation Learning: Exploring the impact of Data Falsity to Large Language Model. 62-73 - Alex Gu, Wen-Ding Li, Naman Jain, Theo Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama:
The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? 74-117 - Chao-Chun Hsu, Erin Bransom, Jenna Sparks, Bailey Kuehl, Chenhao Tan, David Wadden, Lucy Lu Wang, Aakanksha Naik:
CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support. 118-132 - Hao Li, Yuping Wu, Viktor Schlegel, Riza Batista-Navarro, Tharindu Madusanka, Iqra Zahid, Jiayan Zeng, Xiaochi Wang, Xinran He, Yizhi Li, Goran Nenadic:
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation. 133-150 - Tahira Naseem, Guangxuan Xu, Sarathkrishna Swaminathan, Asaf Yehudai, Subhajit Chaudhury, Radu Florian, Ramón Fernandez Astudillo, Asim Munawar:
A Grounded Preference Model for LLM Alignment. 151-162 - Bowen Jin, Chulin Xie, Jiawei Zhang, Kashob Kumar Roy, Yu Zhang, Zheng Li, Ruirui Li, Xianfeng Tang, Suhang Wang, Yu Meng, Jiawei Han:
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs. 163-184 - Yizhu Jiao, Sha Li, Sizhe Zhou, Heng Ji, Jiawei Han:
Text2DB: Integration-Aware Information Extraction with Large Language Model Agents. 185-205 - Zoey Liu, Nitin Venkateswaran, Éric Le Ferrand, Emily Prud'hommeaux:
How Important is a Language Model for Low-resource ASR? 206-213 - Vithursan Thangarasa, Mahmoud Salem, Shreyas Saxena, Chen-Yu Leong, Joel Hestness, Sean Lie:
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models. 214-230 - Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas:
Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling. 231-247 - Shuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji Kasneci:
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models. 248-264 - Yuhang Zhou, Wei Ai:
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios. 265-282 - Canwen Xu, Yichong Xu, Shuohang Wang, Yang Liu, Chenguang Zhu, Julian J. McAuley:
Small Models are Valuable Plug-ins for Large Language Models. 283-294 - Andreas Madsen, Sarath Chandar, Siva Reddy:
Are self-explanations from Large Language Models faithful? 295-337 - Henry Peng Zou, Vinay Samuel, Yue Zhou, Weizhi Zhang, Liancheng Fang, Zihe Song, Philip S. Yu, Cornelia Caragea:
ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction. 338-354 - Qinyuan Ye, Mohamed Ahmed, Reid Pryzant, Fereshte Khani:
Prompt Engineering a Prompt Engineer. 355-385 - Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, S. Sakshi, Sanjoy Chowdhury, Dinesh Manocha:
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations. 386-406 - Naihao Deng, Zhenjie Sun, Ruiqi He, Aman Sikka, Yulong Chen, Lin Ma, Yue Zhang, Rada Mihalcea:
Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMs. 407-426 - Brooklyn Sheppard, Anna Richter, Allison Cohen, Elizabeth Allyn Smith, Tamara Kneese, Carolyne Pelletier, Ioana Baldini, Yue Dong:
Biasly: An Expert-Annotated Dataset for Subtle Misogyny Detection and Mitigation. 427-452 - Parker Glenn, Parag Dakle, Liang Wang, Preethi Raghavan:
BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra. 453-466 - Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra:
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models. 467-484 - Haogeng Liu, Quanzeng You, Yiqi Wang, Xiaotian Han, Bohan Zhai, Yongfei Liu, Wentao Chen, Yiren Jian, Yunzhe Tao, Jianbo Yuan, Ran He, Hongxia Yang:
InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model. 485-492 - Xinze Li, Yixin Cao, Liangming Pan, Yubo Ma, Aixin Sun:
Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution. 493-516 - Ryan Koo, Minhwa Lee, Vipul Raheja, Jong Inn Park, Zae Myung Kim, Dongyeop Kang:
Benchmarking Cognitive Biases in Large Language Models as Evaluators. 517-545 - Chong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong:
X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions. 546-566 - Jiashuo Wang, Chunpu Xu, Chak Tou Leong, Wenjie Li, Jing Li:
Muffin: Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback. 567-585 - Suyuchen Wang, Ivan Kobyzev, Peng Lu, Mehdi Rezagholizadeh, Bang Liu:
Resonance RoPE: Improving Context Length Generalization of Large Language Models. 586-598 - Xiangru Tang, Anni Zou, Zhuosheng Zhang, Ziming Li, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein:
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning. 599-621 - Yiming Wang, Zhuosheng Zhang, Pei Zhang, Baosong Yang, Rui Wang:
Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models. 622-643 - Baohang Zhou, Zezhong Wang, Lingzhi Wang, Hongru Wang, Ying Zhang, Kehui Song, Xuhui Sui, Kam-Fai Wong:
DPDLLM: A Black-box Framework for Detecting Pre-training Data from Large Language Models. 644-653 - Tianci Xue, Ziqi Wang, Yixia Li, Yun Chen, Guanhua Chen:
PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning. 654-665 - Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, EngSiong Chng, Ruizhe Li:
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models. 666-679 - Hao Yue, Shaopeng Lai, Chengyi Yang, Liang Zhang, Junfeng Yao, Jinsong Su:
Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing. 680-691 - Young-Jun Lee, Dokyong Lee, Joo-Won Sung, Jonghwan Hyeon, Ho-Jin Choi:
Large Language Models can Share Images, Too! 692-713 - Daoguang Zan, Ailun Yu, Wei Liu, Bo Shen, Shaoxin Lin, Yongshun Gong, Yafen Yao, Yan Liu, Bei Guan, Weihua Luo, Yongji Wang, Qianxiang Wang, Lizhen Cui:
CodeM: Less Data Yields More Versatility via Ability Matrix. 714-729 - Kung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi Fung, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji:
Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning. 730-749 - Jiajie Jin, Yutao Zhu, Yujia Zhou, Zhicheng Dou:
BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence. 750-761 - Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu:
Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions. 762-776 - Shengjie Qiu, Junhao Zheng, Zhen Liu, Yicheng Luo, Qianli Ma:
Incremental Sequence Labeling: A Tale of Two Shifts. 777-791 - Jinxin Liu, Shulin Cao, Jiaxin Shi, Tingjian Zhang, Lunyiu Nie, Linmei Hu, Lei Hou, Juanzi Li:
How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering. 792-815 - Xuhui Sui, Ying Zhang, Yu Zhao, Kehui Song, Baohang Zhou, Xiaojie Yuan:
MELOV: Multimodal Entity Linking with Optimized Visual Features in Latent Space. 816-826 - Fanyi Qu, Hao Sun, Yunfang Wu:
Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding. 827-838 - Lihui Liu, Blaine Hill, Boxin Du, Fei Wang, Hanghang Tong:
Conversational Question Answering with Language Models Generated Reformulations over Knowledge Graph. 839-850 - Li Zhong, Zilong Wang, Jingbo Shang:
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step by Step. 851-870 - Zhongxiang Sun, Kepu Zhang, Haoyu Wang, Xiao Zhang, Jun Xu:
Effective In-Context Example Selection through Data Compression. 871-877 - Yang Chen, Chong Yang, Tu Hu, Xinhao Chen, Man Lan, Li Cai, Xinlin Zhuang, Xuan Lin, Xin Lu, Aimin Zhou:
Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLM. 878-890 - Yichi Zhang, Zhuo Chen, Yin Fang, Yanxi Lu, Fangming Li, Wen Zhang, Huajun Chen:
Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering. 891-904 - Minpeng Liao, Chengxi Li, Wei Luo, Jing Wu, Kai Fan:
MARIO: MAth Reasoning with code Interpreter Output - A Reproducible Pipeline. 905-924 - Le Cheng, Shuangyin Li:
DiffusPoll: Conditional Text Diffusion Model for Poll Generation. 925-935 - Haolong Li, Yu Ma, Yinqi Zhang, Chen Ye, Jie Chen:
Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data. 936-946 - Hankun Kang, Tieyun Qian:
Implanting LLM's Knowledge via Reading Comprehension Tree for Toxicity Detection. 947-962 - Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang:
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression. 963-981 - Yue Guo, Yi Yang:
EconNLI: Evaluating Large Language Models on Economics Reasoning. 982-994 - Songda Li, Yunqi Zhang, Chunyuan Deng, Yake Niu, Hui Zhao:
Better Late Than Never: Model-Agnostic Hallucination Post-Processing Framework Towards Clinical Text Summarization. 995-1011 - Haowen Pan, Yixin Cao, Xiaozhi Wang, Xun Yang, Meng Wang:
Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers. 1012-1037 - Tinh Luong, Thanh-Thien Le, Linh Ngo, Thien Nguyen:
Realistic Evaluation of Toxicity in Large Language Models. 1038-1047 - Hanqing Zhang, Si Sun, Haiming Wu, Dawei Song:
Controllable Text Generation with Residual Memory Transformer. 1048-1066 - Renlong Jie, Xiaojun Meng, Lifeng Shang, Xin Jiang, Qun Liu:
Prompt-Based Length Controlled Generation with Multiple Control Types. 1067-1085 - Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang:
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain. 1086-1104 - Minjin Kim, Minju Kim, Hana Kim, Beong-woo Kwak, SeongKu Kang, Youngjae Yu, Jinyoung Yeo, Dongha Lee:
Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset. 1105-1120 - Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro:
CoLLaVO: Crayon Large Language and Vision mOdel. 1121-1138 - Wen Wu, Wenlin Chen, Chao Zhang, Philip C. Woodland:
Modelling Variability in Human Annotator Simulation. 1139-1157 - Sheikh Shafayat, H. M. Quamran Hasan, Minhajur Rahman Chowdhury Mahim, Rifki Afina Putri, James Thorne, Alice Oh:
BEnQA: A Question Answering Benchmark for Bengali and English. 1158-1177 - Wanqing Cui, Keping Bi, Jiafeng Guo, Xueqi Cheng:
MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning. 1178-1192 - Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao:
Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models. 1193-1215 - Qizhi Pei, Lijun Wu, Kaiyuan Gao, Xiaozhuan Liang, Yin Fang, Jinhua Zhu, Shufang Xie, Tao Qin, Rui Yan:
BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning. 1216-1240 - Zhihao Wen, Jie Zhang, Yuan Fang:
SIBO: A Simple Booster for Parameter-Efficient Fine-Tuning. 1241-1257 - Jiaxin Zhang, Zhongzhi Li, Ming-Liang Zhang, Fei Yin, Cheng-Lin Liu, Yashar Moshfeghi:
GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving. 1258-1276 - Jiahao Wang, Wenjun Ke, Peng Wang, Hang Zhang, Dong Nie, Jiajun Liu, Guozheng Li, Ziyu Shang:
Boosting Textural NER with Synthetic Image and Instructive Alignment. 1277-1287 - Elena Voita, Javier Ferrando, Christoforos Nalmpantis:
Neurons in Large Language Models: Dead, N-gram, Positional. 1288-1301 - Jinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Pan:
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition. 1302-1318 - Napat Laosaengpha, Thanit Tativannarat, Chawan Piansaddhayanon, Attapol Rutherford, Ekapol Chuangsuwanich:
Learning Job Title Representation from Job Description Aggregation Network. 1319-1329 - Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth:
FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts. 1330-1350 - Yahan Yu, Duzhen Zhang, Xiuyi Chen, Chenhui Chu:
Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition. 1351-1358 - Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li:
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models. 1359-1375 - Adian Liusie, Yassir Fathullah, Mark J. F. Gales:
Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models. 1376-1387 - Chaoxu Pang, Yixuan Cao, Chunhao Yang, Ping Luo:
Uncovering Limitations of Large Language Models in Information Seeking from Tables. 1388-1409 - Shen Zhou, Yongqi Li, Xin Miao, Tieyun Qian:
An Ensemble-of-Experts Framework for Rehearsal-free Continual Relation Extraction. 1410-1423 - Georg Wenzel, Adam Jatowt:
Temporal Validity Change Prediction. 1424-1446 - Saeed Najafi, Alona Fyshe:
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models. 1447-1466 - Hanane Kteich, Na Li, Usashi Chatterjee, Zied Bouraoui, Steven Schockaert:
Modelling Commonsense Commonalities with Multi-Facet Concept Embeddings. 1467-1480 - Thomas Bonnier:
Revisiting Multimodal Transformers for Tabular Data with Text Fields. 1481-1500 - Jayanta Sadhu, Ayan Antik Khan, Abhik Bhattacharjee, Rifat Shahriyar:
An Empirical Study on the Characteristics of Bias upon Context Length Variation for Bangla. 1501-1520 - Jingcheng Niu, Saifei Liao, Victoria Ng, Simon de Montigny, Gerald Penn:
ConTempo: A Unified Temporally Contrastive Framework for Temporal Relation Extraction. 1521-1533 - Abbas Ghaddar, David Alfonso-Hermelo, Philippe Langlais, Mehdi Rezagholizadeh, Boxing Chen, Prasanna Parthasarathi:
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems. 1534-1551 - Zicheng Lin, Zhibin Gou, Tian Liang, Ruilin Luo, Haowei Liu, Yujiu Yang:
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning. 1552-1587 - Taolin Zhang, Qizhou Chen, Dongyang Li, Chengyu Wang, Xiaofeng He, Longtao Huang, Hui Xue', Jun Huang:
DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models. 1588-1602 - Ashok Urlana, Pruthwik Mishra, Tathagato Roy, Rahul Mishra:
Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects - A Survey. 1603-1623 - Hengguan Huang, Songtao Wang, Hongfu Liu, Hao Wang, Ye Wang:
Benchmarking Large Language Models on Communicative Medical Coaching: A Dataset and a Novel System. 1624-1637 - Ruomeng Ding, Chaoyun Zhang, Lu Wang, Yong Xu, Minghua Ma, Wei Zhang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation. 1638-1662 - Heidi C. Zhang, Sina J. Semnani, Farhad Ghassemi, Jialiang Xu, Shicheng Liu, Monica S. Lam:
SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic Parsing. 1663-1678 - Bosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu, Anh Tuan Luu, Shafiq Joty:
Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges. 1679-1705 - Abe Bohan Hou, Jingyu Zhang, Yichen Wang, Daniel Khashabi, Tianxing He:
k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text. 1706-1715 - Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush:
ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation. 1716-1726 - Tuo Zhang, Jinyue Yuan, Salman Avestimehr:
Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers. 1727-1735 - Divya Jyoti Bajpai, Manjesh K. Hanawal:
CeeBERT: Cross-Domain Inference in Early Exit BERT. 1736-1748 - Souvik Das, Rohini K. Srihari:
UNIWIZ: A Unified Large Language Model Orchestrated Wizard for Safe Knowledge Grounded Conversations. 1749-1762 - Brian Thompson, Mehak Preet Dhaliwal, Peter Frisch, Tobias Domhan, Marcello Federico:
A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism. 1763-1775 - Gabriel Perin, Xuxi Chen, Shusen Liu, Bhavya Kailkhura, Zhangyang Wang, Brian Gallagher:
RankMean: Module-Level Importance Score for Merging Fine-tuned LLM Models. 1776-1782 - Haoyi Qiu, Wenbo Hu, Zi-Yi Dou, Nanyun Peng:
VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models. 1783-1805 - Xuxin Cheng, Zhihong Zhu, Bang Yang, Xianwei Zhuang, Hongxiang Li, Yuexian Zou:
Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding. 1806-1816 - Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, Meng Jiang:
Towards Safer Large Language Models through Machine Unlearning. 1817-1829 - Mingyu Jin, Qinkai Yu, Dong Shu, Haiyan Zhao, Wenyue Hua, Yanda Meng, Yongfeng Zhang, Mengnan Du:
The Impact of Reasoning Step Length on Large Language Models. 1830-1842 - Guangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh, Bidhan Bashyal, Rongrong Wang, Kristen Johnson:
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness. 1843-1856 - Qiqi Wang, Ruofan Wang, Kaiqi Zhao, Robert Amor, Benjamin Liu, Jiamou Liu, Xianda Zheng, Zijian Huang:
SKGSum: Structured Knowledge-Guided Document Summarization. 1857-1871 - Shilin Zhou, Zhenghua Li, Chen Gong, Lei Zhang, Yu Hong, Min Zhang:
Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and Approaches. 1872-1884 - Alex Kim, Keonwoo Kim, Sangwon Yoon:
DEBATE: Devil's Advocate-Based Assessment and Text Evaluation. 1885-1897 - Yixin Yang, Zheng Li, Qingxiu Dong, Heming Xia, Zhifang Sui:
Can Large Multimodal Models Uncover Deep Semantics Behind Images? 1898-1912 - Qiang Gao, Zixiang Meng, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji:
Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction Paradigm. 1913-1927 - EunJeong Hwang, Vered Shwartz, Dan Gutfreund, Veronika Thost:
A Graph per Persona: Reasoning about Subjective Natural Language Descriptions. 1928-1942 - Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, Xiang Wang:
MolTC: Towards Molecular Relational Modeling In Language Models. 1943-1958 - Di Wu, Da Yin, Kai-Wei Chang:
KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation. 1959-1981 - Jiang Li, Xiangdong Su, Fujun Zhang, Guanglai Gao:
Learning Low-dimensional Multi-domain Knowledge Graph Embedding via Dual Archimedean Spirals. 1982-1994 - Sheng Wang, Liheng Chen, Jiyue Jiang, Boyang Xue, Lingpeng Kong, Chuan Wu:
LoRA Meets Dropout under a Unified Framework. 1995-2008 - Wenxin Mao, Ruiqi Wang, Jiyu Guo, Jichuan Zeng, Cuiyun Gao, Peiyi Han, Chuanyi Liu:
Enhancing Text-to-SQL Parsing through Question Rewriting and Execution-Guided Refinement. 2009-2024 - Shuo Zhang, Liangming Pan, Junzhou Zhao, William Yang Wang:
The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language Models. 2025-2038 - Haoran Luo, Haihong E, Zichen Tang, Shiyao Peng, Yikai Guo, Wentai Zhang, Chenghao Ma, Guanting Dong, Meina Song, Wei Lin, Yifan Zhu, Anh Tuan Luu:
ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models. 2039-2056 - Yudong Wang, Chang Ma, Qingxiu Dong, Zhifang Sui, Lingpeng Kong, Jingjing Xu:
Achilles-Bench: A Challenging Benchmark for Low-Resource Evaluation. 2057-2080 - Hanbin Wang, Zhenghao Liu, Shuo Wang, Ganqu Cui, Ning Ding, Zhiyuan Liu, Ge Yu:
INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair. 2081-2107 - Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Gao Xing, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang:
SocialBench: Sociality Evaluation of Role-Playing Conversational Agents. 2108-2126 - Yongqiang Ma, Lizhi Qing, Jiawei Liu, Yangyang Kang, Yue Zhang, Wei Lu, Xiaozhong Liu, Qikai Cheng:
From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications. 2127-2137 - Xinnan Guo, Qian Zhu, Qiuhui Shi, Xuan Lin, Liubin Wang, DaqianLi DaqianLi, Yongrui Chen:
Context-Aware Tracking and Dynamic Introduction for Incomplete Utterance Rewriting in Extended Multi-Turn Dialogues. 2138-2148 - Yuyan Chen, Songzhou Yan, Sijia Liu, Yueze Li, Yanghua Xiao:
EmotionQueen: A Benchmark for Evaluating Empathy of Large Language Models. 2149-2176 - Rui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, Kashun Shum, Jipeng Zhang, Renjie Pi, Tong Zhang:
Plum: Prompt Learning using Metaheuristics. 2177-2197 - Yuyan Chen, Songzhou Yan, Qingpei Guo, Jiyuan Jia, Zhixu Li, Yanghua Xiao:
HOTVCOM: Generating Buzzworthy Comments for Videos. 2198-2224 - Yuyan Chen, Yueze Li, Songzhou Yan, Sijia Liu, Jiaqing Liang, Yanghua Xiao:
Do Large Language Models have Problem-Solving Capability under Incomplete Information Scenarios? 2225-2238 - Joe Stacey, Marek Rei:
Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation. 2239-2258 - Tzuf Paz-Argaman, John Palowitch, Sayali Kulkarni, Reut Tsarfaty, Jason Baldridge:
Into the Unknown: Generating Geospatial Descriptions for New Environments. 2259-2273 - Omer Goldman, Avi Caciularu, Matan Eyal, Kris Cao, Idan Szpektor, Reut Tsarfaty:
Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance. 2274-2286 - Jungseob Lee, Hyeonseok Moon, Seungjun Lee, Chanjun Park, Sugyeong Eo, Hyunwoong Ko, Jaehyung Seo, Seungyoon Lee, Heuiseok Lim:
Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation. 2287-2303 - Uri Shaham, Jonathan Herzig, Roee Aharoni, Idan Szpektor, Reut Tsarfaty, Matan Eyal:
Multilingual Instruction Tuning With Just a Pinch of Multilinguality. 2304-2317 - Jianlyu Chen, Shitao Xiao, Peitian Zhang, Kun Luo, Defu Lian, Zheng Liu:
M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation. 2318-2335 - Zhangqian Bi, Yao Wan, Zheng Wang, Hongyu Zhang, Batu Guan, Fangxin Lu, Zili Zhang, Yulei Sui, Hai Jin, Xuanhua Shi:
Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback. 2336-2353 - Chenlong Deng, Zhicheng Dou, Yujia Zhou, Peitian Zhang, Kelong Mao:
An Element is Worth a Thousand Words: Enhancing Legal Case Retrieval by Incorporating Legal Elements. 2354-2365 - Xinnong Zhang, Haoyu Kuang, Xinyi Mou, Hanjia Lyu, Kun Wu, Siming Chen, Jiebo Luo, Xuanjing Huang, Zhongyu Wei:
SoMeLVLM: A Large Vision Language Model for Social Media Processing. 2366-2389 - Jaehyung Seo, Jaewook Lee, Chanjun Park, Seongtae Hong, Seungjun Lee, Heuiseok Lim:
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models. 2390-2415 - Amit Dhurandhar, Tejaswini Pedapati, Ronny Luss, Soham Dan, Aurélie C. Lozano, Payel Das, Georgios Kollias:
NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models. 2416-2430 - Amit Dhurandhar, Rahul Nair, Moninder Singh, Elizabeth Daly, Karthikeyan Natesan Ramamurthy:
Ranking Large Language Models without Ground Truth. 2431-2452 - Chengfeng Dou, Ying Zhang, Zhi Jin, Wenpin Jiao, Haiyan Zhao, Yongqiang Zhao, Zhengwei Tao:
Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process Feedback. 2453-2473 - Shitao Xiao, Zheng Liu, Peitian Zhang, Xingrun Xing:
LM-Cocktail: Resilient Tuning of Language Models via Model Merging. 2474-2488 - Xin Miao, Yongqi Li, Shen Zhou, Tieyun Qian:
Episodic Memory Retrieval from LLMs: A Neuromorphic Mechanism to Generate Commonsense Counterfactuals for Relation Extraction. 2489-2511 - Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine de Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Indra Winata, Seid Muhie Yimam, Saif M. Mohammad:
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages. 2512-2530 - Haihui Yang, Xiaojun Quan:
Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector. 2531-2546 - Tuna Alikasifoglu, Arda C. Aras, Aykut Koç:
VISPool: Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks. 2547-2556 - Krishnapriya Vishnubhotla, Adam Hammond, Graeme Hirst, Saif M. Mohammad:
The Emotion Dynamics of Literary Novels. 2557-2574 - Peiran Yao, Denilson Barbosa:
Accurate and Nuanced Open-QA Evaluation Through Textual Entailment. 2575-2587 - Antonios Dimakis, Stella Markantonatou, Antonios Anastasopoulos:
Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages. 2588-2595 - Zhongzhi Li, Ming-Liang Zhang, Fei Yin, Cheng-Lin Liu:
LANS: A Layout-Aware Neural Solver for Plane Geometry Problem. 2596-2608 - Wenxuan Ding, Shangbin Feng, Yuhan Liu, Zhaoxuan Tan, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov:
Knowledge Crosswords: Geometric Knowledge Reasoning with Large Language Models. 2609-2636 - Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo:
DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection. 2637-2667 - Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi:
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts. 2668-2680 - Junmo Kang, Hongyin Luo, Yada Zhu, Jacob A. Hansen, James R. Glass, David D. Cox, Alan Ritter, Rogério Feris, Leonid Karlinsky:
Self-Specialization: Uncovering Latent Expertise within Large Language Models. 2681-2706 - Fred Xu, Song Jiang, Zijie Huang, Xiao Luo, Shichang Zhang, Yuanzhou Chen, Yizhou Sun:
FUSE: Measure-Theoretic Compact Fuzzy Set Representation for Taxonomy Expansion. 2707-2720 - Sergio Servantez, Joe Barrow, Kristian J. Hammond, Rajiv Jain:
Chain of Logic: Rule-Based Reasoning with Large Language Models. 2721-2733 - Cheng-Han Chiang, Hung-yi Lee:
Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations. 2734-2751 - William Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim, Tal Linzen:
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment. 2752-2773 - Weicheng Ma, Chunyuan Deng, Aram Moossavi, Lili Wang, Soroush Vosoughi, Diyi Yang:
Simulated Misinformation Susceptibility (SMISTS): Enhancing Misinformation Research with Large Language Model Simulations. 2774-2788 - Minzhi Li, Weiyan Shi, Caleb Ziems, Diyi Yang:
Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future. 2789-2805 - Hongyi Zhang, Zuchao Li, Ping Wang, Hai Zhao:
Selective Prefix Tuning for Pre-trained Language Models. 2806-2813 - Xiaobo Guo, Soroush Vosoughi:
MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization. 2814-2827 - Jianing Zhou, Suma Bhat:
Non-compositional Expression Generation and its Continual Learning. 2828-2839 - Xiaoming Shi, Zeming Liu, Li Du, Yuxuan Wang, Hongru Wang, Yuhang Guo, Tong Ruan, Jie Xu, Xiaofan Zhang, Shaoting Zhang:
Medical Dialogue System: A Survey of Categories, Methods, Evaluation and Challenges. 2840-2861 - Thi Nguyen, Linhao Luo, Fatemeh Shiri, Dinh Phung, Yuan-Fang Li, Thuy-Trang Vu, Gholamreza Haffari:
Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs. 2862-2883 - Longyin Zhang, Bowei Zou, Jacintha Yi, AiTi Aw:
Comprehensive Abstractive Comment Summarization with Dynamic Clustering and Chain of Thought. 2884-2896 - Zhongkun Liu, Zheng Chen, Mengqi Zhang, Zhaochun Ren, Pengjie Ren, Zhumin Chen:
Self-Supervised Position Debiasing for Large Language Models. 2897-2917 - Yuhuan Lu, Weijian Yu, Xin Jing, Dingqi Yang:
HyperCL: A Contrastive Learning Framework for Hyper-Relational Knowledge Graph Embedding with Hierarchical Ontology. 2918-2929 - Songtao Liu, Bang Wang, Wei Xiang, Han Xu, Minghua Xu:
Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology Detection. 2930-2942 - Yang Hou, Zhenghua Li:
Character-Level Chinese Dependency Parsing via Modeling Latent Intra-Word Structure. 2943-2956 - Zehan Li, Fu Zhang, Jingwei Cheng:
AlignRE: An Encoding and Semantic Alignment Approach for Zero-Shot Relation Extraction. 2957-2966 - Tingchen Fu, Deng Cai, Lemao Liu, Shuming Shi, Rui Yan:
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction. 2967-2985 - Zhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, Shuhan Luo, Zhiqiang Zhang:
Efficient Knowledge Infusion via KG-LLM Alignment. 2986-2999 - Dahyun Jung, Sugyeong Eo, Heuiseok Lim:
Towards Precise Localization of Critical Errors in Machine Translation. 3000-3012 - Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang:
LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning. 3013-3026 - Jiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai:
Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism. 3027-3043 - Yumeng Liu, Zhenghua Li, Haochen Jiang, Bo Zhang, Chen Li, Ji Zhang:
Towards Better Utilization of Multi-Reference Training Data for Chinese Grammatical Error Correction. 3044-3052 - Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang:
AgentTuning: Enabling Generalized Agent Abilities for LLMs. 3053-3077 - Tianlai Ma, Zhongqing Wang, Guodong Zhou:
Transition-based Opinion Generation for Aspect-based Sentiment Analysis. 3078-3087 - Xiaobao Wu, Xinshuai Dong, Liangming Pan, Thong Nguyen, Anh Tuan Luu:
Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion. 3088-3105 - Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Shom Lin, Zhenxuan Zhang, Angela Zhao, Preslav Nakov, Timothy Baldwin:
A Chinese Dataset for Evaluating the Safeguards in Large Language Models. 3106-3119 - Meiyun Wang, Kiyoshi Izumi, Hiroki Sakaji:
LLMFactor: Extracting Profitable Factors through Prompts for Explainable Stock Movement Prediction. 3120-3131 - Zhuosheng Zhang, Aston Zhang:
You Only Look at Screens: Multimodal Chain-of-Action Agents. 3132-3149 - Yuxuan Hu, Jing Zhang, Zhe Zhao, Chen Zhao, Xiaodong Chen, Cuiping Li, Hong Chen:
SP³: Enhancing Structured Pruning via PCA Projection. 3150-3170 - Sangwon Park, Hongseok Choi, Dongha Choi, Hyunju Lee:
GENDEX: Generative Data Augmentation Strategy Leveraging External Data for Abstractive Dialogue Summarization. 3171-3185 - Boaz Carmeli, Yonatan Belinkov, Ron Meir:
Concept-Best-Matching: Evaluating Compositionality In Emergent Communication. 3186-3194 - T. Y. S. S. Santosh, Natwar Modani, Apoorv Saxena:
A Tale of Two Revisions: Summarizing Changes Across Document Versions. 3195-3211 - Guixin Su, Mingmin Wu, Zhongqiang Huang, Yongcheng Zhang, Tongguan Wang, Yuxue Hu, Ying Sha:
Refine, Align, and Aggregate: Multi-view Linguistic Features Enhancement for Aspect Sentiment Triplet Extraction. 3212-3228 - Yingjie Li, Yue Zhang:
Pro-Woman, Anti-Man? Identifying Gender Bias in Stance Detection. 3229-3236 - Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki:
Likelihood-based Mitigation of Evaluation Bias in Large Language Models. 3237-3245 - Jiajia Li, Lu Yang, Mingni Tang, Chenchong Chenchong, Zuchao Li, Ping Wang, Hai Zhao:
The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language Models. 3246-3257 - Dongjie Yang, Xiaodong Han, Yan Gao, Yao Hu, Shilin Zhang, Hai Zhao:
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference. 3258-3270 - Weiqi Wu, Hongqiu Wu, Lai Jiang, Xingyuan Liu, Hai Zhao, Min Zhang:
From Role-Play to Drama-Interaction: An LLM Solution. 3271-3290 - Jaewoo Ahn, Taehyun Lee, Junyoung Lim, Jin-Hwa Kim, Sangdoo Yun, Hwaran Lee, Gunhee Kim:
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models. 3291-3325 - Mukai Li, Lei Li, Yuwei Yin, Masood Ahmed, Zhenguang Liu, Qi Liu:
Red Teaming Visual Language Models. 3326-3342 - Jingyuan Yang, Dapeng Chen, Yajing Sun, Rongjun Li, Zhiyong Feng, Wei Peng:
Enhancing Semantic Consistency of Large Language Models through Model Editing: An Interpretability-Oriented Approach. 3343-3353 - Sangwoo Shin, Seunghyun Kim, Youngsoo Jang, Moontae Lee, Honguk Woo:
Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments. 3354-3376 - Mingye Zhu, Yi Liu, Lei Zhang, Junbo Guo, Zhendong Mao:
LIRE: listwise reward enhancement for preference alignment. 3377-3394 - Minjung Kim, Hyung Lim, Seung Hwan Kim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim:
See It All: Contextualized Late Aggregation for 3D Dense Captioning. 3395-3405 - Haishuo Fang, Xiaodan Zhu, Iryna Gurevych:
DARA: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge Graphs. 3406-3432 - Yao Yao, Zuchao Li, Hai Zhao:
GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment. 3433-3446 - Sondre Wold, Étienne Simon, Lucas Georges Gabriel Charpentier, Egor V. Kostylev, Erik Velldal, Lilja Øvrelid:
Compositional Generalization with Grounded Language Models. 3447-3460 - Yuyang Ding, Juntao Li, Pinzheng Wang, Zecheng Tang, Yan Bowen, Min Zhang:
Rethinking Negative Instances for Generative Named Entity Recognition. 3461-3475 - Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao:
WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing. 3476-3503 - Jialong Wu, Linhai Zhang, Deyu Zhou, Guoqiang Xu:
DINER: Debiasing Aspect-based Sentiment Analysis with Multi-variable Causal Inference. 3504-3518 - Linhai Zhang, Jialong Wu, Deyu Zhou, Guoqiang Xu:
STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models. 3519-3532 - Yu Wang, Yang Xu, Gabriel Skantze, Hendrik Buschmeier:
How Much Does Nonverbal Communication Conform to Entropy Rate Constancy?: A Case Study on Listener Gaze in Interaction. 3533-3545 - Xu Huang, Zhirui Zhang, Xiang Geng, Yichao Du, Jiajun Chen, Shujian Huang:
Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation. 3546-3562 - Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, Roberta Raileanu, Xian Li, Asli Celikyilmaz, Jason Weston:
Chain-of-Verification Reduces Hallucination in Large Language Models. 3563-3578 - Tian Xia, Zhiwei He, Tong Ren, Yibo Miao, Zhuosheng Zhang, Yang Yang, Rui Wang:
Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method. 3579-3602 - Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yuqi Zhu, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li, Bin Gu, Mengfei Yang:
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories. 3603-3614 - Baolong Bi, Shenghua Liu, Yiwei Wang, Lingrui Mei, Xueqi Cheng:
LPNL: Scalable Link Prediction with Large Language Models. 3615-3625 - Kevin Heffernan, Artyom Kozhevnikov, Loïc Barrault, Alexandre Mourachko, Holger Schwenk:
Aligning Speech Segments Beyond Pure Semantics. 3626-3635 - Thong Nguyen, Yi Bin, Junbin Xiao, Leigang Qu, Yicong Li, Jay Zhangjie Wu, Cong-Duy Nguyen, See-Kiong Ng, Anh Tuan Luu:
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives. 3636-3657 - Keyu Ding, Yongcan Wang, Zihang Xu, Zhenzhen Jia, Enhong Chen:
Generative Input: Towards Next-Generation Input Methods Paradigm. 3658-3669 - Wei Tang, Yixin Cao, Jiahao Ying, Bo Wang, Yuyue Zhao, Yong Liao, Peng Zhou:
A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential. 3670-3685 - Hung To, Minh Nguyen, Nghi Bui:
Functional Overlap Reranking for Neural Code Generation. 3686-3704 - Pengyu Cheng, Yifan Yang, Jian Li, Yong Dai, Tianhao Hu, Peixin Cao, Nan Du, Xiaolong Li:
Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game. 3705-3716 - Linan Zhu, Xiangfan Chen, Xiaolei Guo, Chenwei Zhang, Zhechao Zhu, Zehai Zhou, Xiangjie Kong:
Pinpointing Diffusion Grid Noise to Enhance Aspect Sentiment Quad Prediction. 3717-3726 - Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj:
Continual Contrastive Spoken Language Understanding. 3727-3741 - Kai Wang, Yuwei Xu, Zhiyong Wu, Siqiang Luo:
LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge Graphs. 3742-3759 - Junjie Chen, Xiangheng He, Danushka Bollegala, Yusuke Miyao:
Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument Structures. 3760-3772 - Yingji Li, Mengnan Du, Rui Song, Xin Wang, Ying Wang:
Data-Centric Explainable Debiasing for Improving Fairness in Pre-trained Language Models. 3773-3786 - Monika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru:
Knowledge-Driven Cross-Document Relation Extraction. 3787-3797 - Wen Chang, Yun-Nung Chen:
Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning. 3798-3812 - Shiyu Tian, Yangyang Luo, Tianze Xu, Caixia Yuan, Huixing Jiang, Chen Wei, Xiaojie Wang:
KG-Adapter: Enabling Knowledge Graph Integration in Large Language Models through Parameter-Efficient Fine-Tuning. 3813-3828 - Lei Lin, Jia-Yi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai:
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios. 3829-3852 - Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth:
Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering. 3853-3878 - Hongling Xu, Qianlong Wang, Yice Zhang, Min Yang, Xi Zeng, Bing Qin, Ruifeng Xu:
Improving In-Context Learning with Prediction Feedback for Sentiment Analysis. 3879-3890 - Zhiwei Li, Ran Song, Caihong Sun, Wei Xu, Zhengtao Yu, Ji-Rong Wen:
Can Large Language Models Mine Interpretable Financial Factors More Effectively? A Neural-Symbolic Factor Mining Agent Model. 3891-3902 - Xiaowei Yuan, Zhao Yang, Yequan Wang, Shengping Liu, Jun Zhao, Kang Liu:
Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint. 3903-3922 - Lijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao:
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models. 3923-3954 - Pablo Messina, René Vidal, Denis Parra, Alvaro Soto, Vladimir Araujo:
Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation. 3955-3986 - Shuzhou Yuan, Ercong Nie, Michael Färber, Helmut Schmid, Hinrich Schütze:
GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network. 3987-4001 - Anand Subramanian, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler:
M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering. 4002-4042 - Rohit Saxena, Frank Keller:
MovieSum: An Abstractive Summarization Dataset for Movie Screenplays. 4043-4050 - Jiahuan Pei, Irene Viola, Haochen Huang, Junxiao Wang, Moonisa Ahsan, Fanghua Ye, Jiang Yiming, Yao Sai, Di Wang, Zhumin Chen, Pengjie Ren, Pablo César:
Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality. 4051-4066 - Faye Holt, William Held, Diyi Yang:
Perceptions of Language Technology Failures from South Asian English Speakers. 4067-4081 - Jannik Brinkmann, Abhay Sheshadri, Victor Levoso, Paul Swoboda, Christian Bartelt:
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task. 4082-4102 - Zefeng Zhang, Jiawei Sheng, Chuang Zhang, Liangyunzhi Liangyunzhi, Wenyuan Zhang, Siqi Wang, Tingwen Liu:
Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking. 4103-4117 - Anej Svete, Robin Chan, Ryan Cotterell:
On Efficiently Representing Regular Languages as RNNs. 4118-4135 - Ines Reinig, Maria Becker, Ines Rehbein, Simone Paolo Ponzetto:
A Survey on Modelling Morality for Text Analysis. 4136-4155 - Ruibo Chen, Yihan Wu, Lichang Chen, Guodong Liu, Qi He, Tianyi Xiong, Chenxi Liu, Junfeng Guo, Heng Huang:
Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection. 4156-4172 - Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Yinxu Pan, Yesai Wu, Haotian Hui, Weichuan Liu, Zhiyuan Liu, Maosong Sun:
DebugBench: Evaluating Debugging Capability of Large Language Models. 4173-4198 - Zhihan Zhou, Xue Gu, Yujie Zhao, Hao Xu:
POP-CEE: Position-oriented Prompt-tuning Model for Causal Emotion Entailment. 4199-4210 - Linhan Li, Huaping Zhang:
Context Length Extension via Generalized Extrapolation Scale. 4211-4218 - Julian Eisenschlos, Hernán Maina, Guido Ivetta, Luciana Benotti:
Selectively Answering Visual Questions. 4219-4229 - Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. 4230-4242 - Jiaheng Liu, ZhiqiBai ZhiqiBai, Yuanxing Zhang, Chenchen Zhang, YuangZh YuangZh, Ge Zhang, JiakaiWang JiakaiWang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng:
E2-LLM: Efficient and Extreme Length Extension of Large Language Models. 4243-4253 - Da Ju, Karen Ullrich, Adina Williams:
Are Female Carpenters like Blue Bananas? A Corpus Investigation of Occupation Gender Typicality. 4254-4274 - Sitao Cheng, Ziyuan Zhuang, Yong Xu, Fangkai Yang, Chaoyun Zhang, Xiaoting Qin, Xiang Huang, Ling Chen, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments. 4275-4295 - Shubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya:
Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts. 4296-4315 - Xiaojuan Tang, Song-Chun Zhu, Yitao Liang, Muhan Zhang:
RulE: Knowledge Graph Reasoning with Rule Embedding. 4316-4335 - Dang Nguyen, Jiuhai Chen, Tianyi Zhou:
Multi-Objective Linguistic Control of Large Language Models. 4336-4347 - Shang Zhou, Feng Yao, Chengyu Dong, Zihan Wang, Jingbo Shang:
Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs. 4348-4362 - Shijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang, Ruifeng Xu, Qun Liu:
Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios. 4363-4400 - Sky CH-Wang, Benjamin Van Durme, Jason Eisner, Chris Kedzie:
Do Androids Know They're Only Dreaming of Electric Sheep? 4401-4420 - Bo Lv, Chen Tang, Yanan Zhang, Xin Liu, Ping Luo, Yue Yu:
URG: A Unified Ranking and Generation Method for Ensembling Language Models. 4421-4434 - Aditya Gourav, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Grant P. Strimel, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Multi-Modal Retrieval For Large Language Model Based Speech Recognition. 4435-4446 - Ziyu Zhao, Leilei Gan, Guoyin Wang, Wangchunshu Zhou, Hongxia Yang, Kun Kuang, Fei Wu:
LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild. 4447-4462 - Yifei Zhang, Bo Pan, Chen Ling, Yuntong Hu, Liang Zhao:
ELAD: Explanation-Guided Large Language Models Active Distillation. 4463-4475 - Carolin Holtermann, Paul Röttger, Timm Dill, Anne Lauscher:
Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ. 4476-4494 - Jacob Matthews, John Starr, Marten van Schijndel:
Semantics or spelling? Probing contextual word embeddings with orthographic noise. 4495-4504 - Shenglai Zeng, Jiankun Zhang, Pengfei He, Yiding Liu, Yue Xing, Han Xu, Jie Ren, Yi Chang, Shuaiqiang Wang, Dawei Yin, Jiliang Tang:
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG). 4505-4524 - Jocelyn Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Park:
EmpathicStories++: A Multimodal Dataset for Empathy Towards Personal Experiences. 4525-4536 - Shaltiel Shmidman, Avi Shmidman, Moshe Koppel, Reut Tsarfaty:
MRL Parsing Without Tears: The Case of Hebrew. 4537-4550 - Kenza Amara, Rita Sevastjanova, Mennatallah El-Assady:
SyntaxShap: Syntax-aware Explainability Method for Text Generation. 4551-4566 - Mukund Srinath, Pranav Narayanan Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson:
Automated Detection and Analysis of Data Practices Using A Real-World Corpus. 4567-4574 - Xiran Fan, Minghua Xu, Huiyuan Chen, Yuzhong Chen, Mahashweta Das, Hao Yang:
Enhancing Hyperbolic Knowledge Graph Embeddings via Lorentz Transformations. 4575-4589 - Andrea Burns, Kate Saenko, Bryan A. Plummer:
Tell Me What's Next: Textual Foresight for Generic UI Representations. 4590-4611 - Iqra Zahid, Tharindu Madusanka, Riza Batista-Navarro, Youcheng Sun:
Probing the Uniquely Identifiable Linguistic Patterns of Conversational AI Agents. 4612-4628 - Abel Salinas, Fred Morstatter:
The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model Performance. 4629-4651 - Hanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin:
X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification. 4652-4665 - Difan Jiao, Yilun Liu, Zhenwei Tang, Daniel Matter, Jürgen Pfeffer, Ashton Anderson:
SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text Classification. 4666-4682 - Akihiro Maeda, Takuma Torii, Shohei Hidaka:
Decomposing Co-occurrence Matrices into Interpretable Components as Formal Concepts. 4683-4700 - Ziyu Yang, Santhosh Cherian, Slobodan Vucetic:
Two-Pronged Human Evaluation of ChatGPT Self-Correction in Radiology Report Simplification. 4701-4714 - Kunze Li, Yu Zhang:
Planning First, Question Second: An LLM-Guided Method for Controllable Question Generation. 4715-4729 - Yanming Liu, Xinyue Peng, Xuhong Zhang, Weihao Liu, Jianwei Yin, Jiannan Cao, Tianyu Du:
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback. 4730-4749 - Danupat Khamnuansin, Tawunrat Chalothorn, Ekapol Chuangsuwanich:
MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model. 4750-4762 - Yixing Peng, Quan Wang, Licheng Zhang, Yi Liu, Zhendong Mao:
Chain-of-Question: A Progressive Question Decomposition Approach for Complex Knowledge Base Question Answering. 4763-4776 - Guangmin Zheng, Jin Wang, Liang-Chih Yu, Xuejie Zhang:
Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment Analysis. 4777-4788 - Xinyi Mou, Zhongyu Wei, Xuanjing Huang:
Unveiling the Truth and Facilitating Change: Towards Agent-based Large-scale Social Movement Simulation. 4789-4809 - Hiroshi Kanayama, Yang Zhao, Ran Iwamoto, Takuya Ohko:
Incorporating Syntax and Lexical Knowledge to Multilingual Sentiment Classification on Large Language Models. 4810-4817 - Zijian Wang, Britney White, Chang Xu:
Locating and Extracting Relational Concepts in Large Language Models. 4818-4832 - Mingda Li, Xinyu Li, Yifan Chen, Wenfeng Xuan, Weinan Zhang:
Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models. 4833-4850 - Xulang Zhang, Rui Mao, Erik Cambria:
SenticVec: Toward Robust and Human-Centric Neurosymbolic Sentiment Analysis. 4851-4863 - Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong Liu, Jing Shao:
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models. 4864-4888 - Tingyu Xia, Bowen Yu, Yuan Wu, Yi Chang, Chang Zhou:
Language Models can Evaluate Themselves via Probability Discrepancy. 4889-4901 - Huichi Zhou, Zhaoyang Wang, Hongtao Wang, Dongping Chen, Wenhan Mu, Fangyuan Zhang:
Evaluating the Validity of Word-level Adversarial Attacks with Large Language Models. 4902-4922 - Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji:
On the Language Encoder of Contrastive Cross-modal Models. 4923-4940 - Guande Wu, Chen Zhao, Cláudio T. Silva, He He:
Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World. 4941-4957 - Jianhui Pang, Fanghua Ye, Derek F. Wong, Xin He, Wanshun Chen, Longyue Wang:
Anchor-based Large Language Models. 4958-4976 - Dexuan Xu, Yanyuan Chen, Jieyi Wang, Yue Huang, Hanpin Wang, Zhi Jin, Hongxing Wang, Weihua Yue, Jing He, Hang Li, Yu Huang:
MLeVLM: Improve Multi-level Progressive Capabilities based on Multimodal Large Language Model for Medical Visual Question Answering. 4977-4997 - Ryan Park, Rafael Rafailov, Stefano Ermon, Chelsea Finn:
Disentangling Length from Quality in Direct Preference Optimization. 4998-5017 - Jiaqi Li, Miaozeng Du, Chuanyi Zhang, Yongrui Chen, Nan Hu, Guilin Qi, Haiyun Jiang, Siyuan Cheng, Bozhong Tian:
MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing. 5018-5029 - Zhen Wan, Yating Zhang, Yexiang Wang, Fei Cheng, Sadao Kurohashi:
Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise: A Case Study on Chinese Legal Domain. 5030-5041 - Siddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty:
MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing. 5042-5078 - Dongfang Li, Zetian Sun, Baotian Hu, Zhenyu Liu, Xinshuo Hu, Xuebo Liu, Min Zhang:
Improving Attributed Text Generation of Large Language Models via Preference Learning. 5079-5101 - SungHo Kim, Juhyeong Park, Yeachan Kim, SangKeun Lee:
KOMBO: Korean Character Representations Based on the Combination Rules of Subcharacters. 5102-5119 - Ryo Yoshida, Taiga Someya, Yohei Oseki:
Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision. 5120-5134 - Zhiyuan Chang, Mingyang Li, Yi Liu, Junjie Wang, Qing Wang, Yang Liu:
Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues. 5135-5147 - Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi:
Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes. 5148-5168 - Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, Wei Han:
Extending Context Window of Large Language Models via Semantic Compression. 5169-5181 - Wei Jie Yeo, Ranjan Satapathy, Erik Cambria:
Plausible Extractive Rationalization through Semi-Supervised Entailment Signal. 5182-5192 - ChaeHun Park, Koanho Lee, Hyesu Lim, Jaeseok Kim, Junmo Park, Yu-Jung Heo, Du-Seong Chang, Jaegul Choo:
Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering. 5193-5221 - Yu Yang, Jinyu Guo, Kai Shuang, Chenrui Mao:
Scented-EAE: Stage-Customized Entity Type Embedding for Event Argument Extraction. 5222-5235 - Zijian Lei, Dong Qian, William Cheung:
Fast Randomized Low-Rank Adaptation of Pre-trained Language Models with PAC Regularization. 5236-5249 - Yuchen Yang, Yu Wang, Yanfeng Wang:
SDA: Semantic Discrepancy Alignment for Text-conditioned Image Retrieval. 5250-5261 - Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang:
Se²: Sequential Example Selection for In-Context Learning. 5262-5284 - Hanling Yi, Feng Lin, Hongbin Li, Peiyang Ning, Xiaotian Yu, Rong Xiao:
Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding. 5285-5299 - Boxi Cao, Mengjie Ren, Hongyu Lin, Xianpei Han, Feng Zhang, Junfeng Zhan, Le Sun:
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation. 5300-5318 - Xinwei Wu, Weilong Dong, Shaoyang Xu, Deyi Xiong:
Mitigating Privacy Seesaw in Large Language Models: Augmented Privacy Neuron Editing via Activation Patching. 5319-5332 - Laura Mascarell, Yan L'Homme, Majed El Helou:
Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information Decomposition. 5333-5338 - Biao Yi, Sishuo Chen, Yiming Li, Tong Li, Baolei Zhang, Zheli Liu:
BadActs: A Universal Backdoor Defense in the Activation Space. 5339-5352 - Zhiyuan Liu, Yaorui Shi, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua:
ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text Pretraining. 5353-5377 - Quan Yan, Junwen Duan, Jianxin Wang:
Multi-modal Concept Alignment Pre-training for Generative Medical Visual Question Answering. 5378-5389 - Siva Rajesh Kasa, Aniket Goel, Karan Gupta, Sumegh Roychowdhury, Pattisapu Priyatam, Anish Bhanushali, Prasanna Srinivasa Murthy:
Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques. 5390-5404 - Fan Gao, Hang Jiang, Rui Yang, Qingcheng Zeng, Jinghui Lu, Moritz Blum, Tianwei She, Yuang Jiang, Irene Li:
Evaluating Large Language Models on Wikipedia-Style Survey Generation. 5405-5418 - Wanli Yang, Fei Sun, Xinyu Ma, Xun Liu, Dawei Yin, Xueqi Cheng:
The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models Collapse. 5419-5437 - Qi Li, Xiaowen Chu:
Can We Continually Edit Language Models? On the Knowledge Attenuation in Sequential Model Editing. 5438-5455 - Ge Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma, Reynold Cheng:
Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation. 5456-5471 - Zhibin Lan, Liqiang Niu, Fandong Meng, Jie Zhou, Min Zhang, Jinsong Su:
Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation. 5472-5485 - Farhad Nooralahzadeh, Yi Zhang, Ellery Smith, Sabine Maennel, Cyril Matthey-Doret, Raphaël de Fondeville, Kurt Stockinger:
StatBot.Swiss: Bilingual Open Data Exploration in Natural Language. 5486-5507 - Yubing Ren, Ping Guo, Yanan Cao, Wei Ma:
Subtle Signatures, Strong Shields: Advancing Robust and Imperceptible Watermarking in Large Language Models. 5508-5519 - Kai Shuang, Zhouji Zhouji, Qiwei Wang, Jinyu Guo:
Thinking about how to extract: Energizing LLMs' emergence capabilities for document-level event argument extraction. 5520-5532 - Shuzheng Si, Helan Hu, Haozhe Zhao, Shuang Zeng, Kaikai An, Zefan Cai, Baobao Chang:
Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning. 5533-5546 - Harri Rowlands, Gaku Morio, Dylan Tanner, Christopher D. Manning:
Predicting Narratives of Climate Obstruction in Social Media Advertising. 5547-5558 - Huazheng Wang, Haifeng Sun, Jingyu Wang, Qi Qi, Zixuan Xia, Menghao Zhang, Jianxin Liao:
SSS: Editing Factual Knowledge in Language Models towards Semantic Sparse Space. 5559-5570 - Fengyu Cai, Xinran Zhao, Hongming Zhang, Iryna Gurevych, Heinz Koeppl:
GeoHard: Towards Measuring Class-wise Hardness through Modelling Class Semantics. 5571-5597 - Sheng-Lun Wei, Cheng-Kuang Wu, Hen-Hsen Huang, Hsin-Hsi Chen:
Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models. 5598-5621 - Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin:
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic. 5622-5640 - Edi Muskardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock:
On the Relationship Between RNN Hidden-State Vectors and Semantic Structures. 5641-5658 - Yanjiang Liu, Tianyun Zhong, Yaojie Lu, Hongyu Lin, Ben He, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun:
XMC-Agent : Dynamic Navigation over Scalable Hierarchical Index for Incremental Extreme Multi-label Classification. 5659-5672 - Jie Zhu, Junhui Li, Yalong Wen, Lifan Guo:
Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation Dataset. 5673-5693 - Zhipeng Chen, Kun Zhou, Xin Zhao, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen:
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint. 5694-5711 - Mariia Fedorova, Andrey Kutuzov, Yves Scherrer:
Definition generation for lexical semantic change detection. 5712-5724 - Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alexandre Mourachko, Christophe Ropers, Carleigh Wood:
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector. 5725-5734 - Wei Pang, Chuan Zhou, Xiao-Hua Zhou, Xiaojie Wang:
Phased Instruction Fine-Tuning for Large Language Models. 5735-5748 - Xinlin Zhuang, Hongyi Wu, Xinshu Shen, Peimin Yu, Gaowei Yi, Xinhao Chen, Tu Hu, Yang Chen, Yupei Ren, Yadong Zhang, Youqi Song, Binxuan Liu, Man Lan:
TOREE: Evaluating Topic Relevance of Student Essays for Chinese Primary and Middle School Education. 5749-5765 - Yuxiang Cai, Qiao Liu, Yanglei Gan, Changlin Li, Xueyi Liu, Run Lin, Da Luo, JiayeYang JiayeYang:
Predicting the Unpredictable: Uncertainty-Aware Reasoning over Temporal Knowledge Graphs via Diffusion Process. 5766-5778 - Haz Sameen Shahgir, Xianghao Kong, Greg Ver Steeg, Yue Dong:
Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks. 5779-5796 - Xun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu Li, Feiyu Xiong, Bo Tang:
Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs. 5797-5814 - Jun-Hyung Park, Mingyu Lee, Junho Kim, SangKeun Lee:
Coconut: Contextualized Commonsense Unified Transformers for Graph-Based Commonsense Augmentation of Language Models. 5815-5830 - Daniel Mela, Aitor Gonzalez-Agirre, Javier Hernando, Marta Villegas:
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge. 5831-5847 - Yanis Labrak, Adrien Bazoge, Emmanuel Morin, Pierre-Antoine Gourraud, Mickael Rouvier, Richard Dufour:
BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains. 5848-5864 - Wenxuan Wang, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu:
All Languages Matter: On the Multilingual Safety of LLMs. 5865-5877 - Yuan Zhang, Wanhong Huang, Yi Feng, Chuanyi Li, Zhiwei Fei, Jidong Ge, Bin Luo, Vincent Ng:
LJPCheck: Functional Tests for Legal Judgment Prediction. 5878-5894 - Wanhong Huang, Yi Feng, Chuanyi Li, Honghan Wu, Jidong Ge, Vincent Ng:
CMDL: A Large-Scale Chinese Multi-Defendant Legal Judgment Prediction Dataset. 5895-5906 - Govind Krishnan Gangadhar, Karl Stratos:
Model Editing by Standard Fine-Tuning. 5907-5913 - Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gaël Gendron, Timothy Pistotti, Neset Tan, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu:
Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning. 5914-5934 - Nathanaël Beau, Benoît Crabbé:
CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack Overflow. 5935-5947 - Luan Thanh Nguyen:
ViHateT5: Enhancing Hate Speech Detection in Vietnamese With a Unified Text-to-Text Transformer Model. 5948-5961 - Julius Steen, Katja Markert:
Bias in News Summarization: Measures, Pitfalls and Corpora. 5962-5983 - Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, Jinyang Gao, Huawei Shen, Bolin Ding:
When to Trust LLMs: Aligning Confidence with Response Quality. 5984-5996 - Xi Ai, Zhiyong Huang:
Zero-shot Cross-lingual Alignment for Embedding Initialization. 5997-6007 - Avshalom Manevich, Reut Tsarfaty:
Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD). 6008-6022 - Liviu P. Dinu, Ana Sabina Uban, Anca Dinu, Ioan-Bogdan Iordache, Simona Georgescu, Laurentiu Zoicas:
It takes two to borrow: a donor and a recipient. Who's who? 6023-6035 - Shuhao Guan, Derek Greene:
Advancing Post-OCR Correction: A Comparative Study of Synthetic Data. 6036-6047 - Chenghua Huang, Shisong Chen, Zhixu Li, Jianfeng Qu, Yanghua Xiao, Jiaxin Liu, Zhigang Chen:
GeoAgent: To Empower LLMs using Geospatial Tools for Address Standardization. 6048-6063 - Abdurahman Maarouf, Dominik Bär, Dominique Geissler, Stefan Feuerriegel:
HQP: A Human-Annotated Dataset for Detecting Online Propaganda. 6064-6089 - Chi Hu, Yimin Hu, Hang Cao, Tong Xiao, JingBo Zhu:
Teaching Language Models to Self-Improve by Learning from Language Feedback. 6090-6101 - Philipp Wicke, Lennart Wachowiak:
Exploring Spatial Schema Intuitions in Large Language and Vision Models. 6102-6117 - Yibo Miao, Hongcheng Gao, Hao Zhang, Zhijie Deng:
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model. 6118-6130 - Layla Bouzoubaa, Elham Aghakhani, Max Song, Quang Trinh, Rezvaneh (Shadi) Rezapour:
Decoding the Narratives: Analyzing Personal Drug Experiences Shared on Reddit. 6131-6148 - Shaobo Cui, Yiyang Feng, Yisong Mao, Yifan Hou, Boi Faltings:
Unveiling the Art of Heading Design: A Harmonious Blend of Summarization, Neology, and Algorithm. 6149-6174 - Amelie Wührl, Dustin Wright, Roman Klinger, Isabelle Augenstein:
Understanding Fine-grained Distortions in Reports of Scientific Findings. 6175-6191 - Yiqiao Jin, Minje Choi, Gaurav Verma, Jindong Wang, Srijan Kumar:
MM-SOC: Benchmarking Multimodal Large Language Models in Social Media Platforms. 6192-6210 - Saurabh Srivastava, Chengyue Huang, Weiguo Fan, Ziyu Yao:
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance. 6211-6232 - Guangzhi Xiong, Qiao Jin, Zhiyong Lu, Aidong Zhang:
Benchmarking Retrieval-Augmented Generation for Medicine. 6233-6251 - Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Liumeng Xue, Ziyang Ma, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Jie Fu, Emmanouil Benetos, Gus Xia, Roger B. Dannenberg, Wei Xue, Shiyin Kang, Yike Guo:
ChatMusician: Understanding and Generating Music Intrinsically with LLM. 6252-6271 - Qingyu Tan, Hwee Tou Ng, Lidong Bing:
Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning. 6272-6286 - Anton Voronov, Lena Wolf, Max Ryabinin:
Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements. 6287-6310 - Haochen Liu, Song Wang, Yaochen Zhu, Yushun Dong, Jundong Li:
Knowledge Graph-Enhanced Large Language Models via Path Selection. 6311-6321 - Chenyang Huang, Abbas Ghaddar, Ivan Kobyzev, Mehdi Rezagholizadeh, Osmar Zaïane, Boxing Chen:
OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection. 6322-6334 - Xuanqing Yu, Wangtao Sun, Jingwei Li, Kang Liu, Chengbao Liu, Jie Tan:
ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language Model. 6335-6350 - Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gasic, Philip C. Woodland:
Speech-based Slot Filling using Large Language Models. 6351-6362 - Changye Li, Zhecheng Sheng, Trevor Cohen, Serguei Pakhomov:
Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies. 6363-6377 - Tzuf Paz-Argaman, Itai Mondshine, Asaf Achi Mordechai, Reut Tsarfaty:
HeSum: a Novel Dataset for Abstractive Text Summarization in Hebrew. 6378-6388 - Yuqing Wang, Yun Zhao:
TRAM: Benchmarking Temporal Reasoning for Large Language Models. 6389-6415 - Alfonso Amayuelas, Kyle Wong, Liangming Pan, Wenhu Chen, William Yang Wang:
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models. 6416-6432 - Shaobo Cui, Lazar Milikic, Yiyang Feng, Mete Ismayilzada, Debjit Paul, Antoine Bosselut, Boi Faltings:
Exploring Defeasibility in Causal Reasoning. 6433-6452 - Saumya Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig:
Better Synthetic Data by Retrieving and Transforming Existing Datasets. 6453-6466 - Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He:
Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models. 6467-6481 - Joan Plepi, Charles Welch, Lucie Flek:
Perspective Taking through Generating Responses to Conflict Situations. 6482-6497 - Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipalli, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement. 6498-6526 - Ori Ernst, Ori Shapira, Aviv Slobodkin, Sharon Adar, Mohit Bansal, Jacob Goldberger, Ran Levy, Ido Dagan:
The Power of Summary-Source Alignments. 6527-6548 - Gantavya Bhatt, Yifang Chen, Arnav Mohanty Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeff A. Bilmes, Simon S. Du, Kevin G. Jamieson, Jordan T. Ash, Robert D. Nowak:
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models. 6549-6560 - Yuanhe Tian, Fei Xia, Yan Song:
Learning Multimodal Contrast with Cross-modal Memory and Reinforced Contrast Recognition. 6561-6573 - Seyed Ali Bahrainian, Jonathan Dou, Carsten Eickhoff:
Text Simplification via Adaptive Teaching. 6574-6584 - Gokcen Gokceoglu, Devrim Cavusoglu, Emre Akbas, Özen Nergis Dolcerocca:
A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts. 6585-6596 - Laura Cabello, Uchenna Akujuobi:
It is Simple Sometimes: A Study On Improving Aspect-Based Sentiment Analysis Performance. 6597-6610 - Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman:
Whose Emotions and Moral Sentiments do Language Models Reflect? 6611-6631 - Siyin Wang, Shimin Li, Tianxiang Sun, Jinlan Fu, Qinyuan Cheng, Jiasheng Ye, Junjie Ye, Xipeng Qiu, Xuanjing Huang:
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation. 6632-6646 - Weisen Jiang, Han Shi, Longhui Yu, Zhengying Liu, Yu Zhang, Zhenguo Li, James T. Kwok:
Forward-Backward Reasoning in Large Language Models for Mathematical Verification. 6647-6661 - Jiuzhou Han, Wray L. Buntine, Ehsan Shareghi:
Towards Uncertainty-Aware Language Agent. 6662-6685 - Shuya Lin, Yuxiong Wang, Jonathan Dong, Shiguang Ni:
Detection and Positive Reconstruction of Cognitive Distortion Sentences: Mandarin Dataset and Evaluation. 6686-6701 - Jiuzhou Han, Nigel Collier, Wray L. Buntine, Ehsan Shareghi:
PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs. 6702-6718 - Yifu Gao, Linbo Qiao, Zhigang Kan, Zhihua Wen, Yongquan He, Dongsheng Li:
Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models. 6719-6734 - Syeda Nahida Akter, Sangwu Lee, Yingshan Chang, Yonatan Bisk, Eric Nyberg:
VISREAS: Complex Visual Reasoning with Unanswerable Questions. 6735-6752 - Yuxue Hu, Junsong Li, Tongguan Wang, Dongyu Su, Guixin Su, Ying Sha:
A Unified Generative Framework for Bilingual Euphemism Detection and Identification. 6753-6766 - Gaoxiang Cong, Yuankai Qi, Liang Li, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang:
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing. 6767-6779 - Jiechao Yang, Yong Liu:
ETAS: Zero-Shot Transformer Architecture Search via Network Trainability and Expressivity. 6780-6795 - Kaishuai Xu, Yi Cheng, Wenjun Hou, Qiaoyu Tan, Wenjie Li:
Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment. 6796-6814 - Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, ZhiqiBai ZhiqiBai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng:
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models. 6815-6839 - Shu Chen, Xinyan Guan, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun:
REInstruct: Building Instruction Data from Unlabeled Corpus. 6840-6856 - Xin Chen, Hanxian Huang, Yanjun Gao, Yi Wang, Jishen Zhao, Ke Ding:
Learning to Maximize Mutual Information for Chain-of-Thought Distillation. 6857-6868 - Zhisheng Lin, Han Fu, Chenghao Liu, Zhuo Li, Jianling Sun:
PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning. 6869-6883 - Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. 6884-6915 - Jie Ren, Qipeng Guo, Hang Yan, Dongrui Liu, Quanshi Zhang, Xipeng Qiu, Dahua Lin:
Identifying Semantic Induction Heads to Understand In-Context Learning. 6916-6932 - Lai Jiang, Hongqiu Wu, Hai Zhao, Min Zhang:
Chinese Spelling Corrector Is Just a Language Learner. 6933-6943 - Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan:
Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. 6944-6962 - Zihan Zhang, Meng Fang, Ling Chen:
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering. 6963-6975 - Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura:
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models. 6976-6987 - Ming Gu, Yan Yang:
Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data Augmentation. 6988-7005 - Shanghaoran Quan:
DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling. 7006-7028 - Ikuya Yamada, Ryokan Ri:
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation. 7029-7039 - Yijie Chen, Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou:
Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective. 7040-7051 - Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen:
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration. 7052-7074 - Yujie Feng, Bo Liu, Xiaoyu Dong, Zexin Lu, Li-Ming Zhan, Xiao-Ming Wu, Albert Y. S. Lam:
Continual Dialogue State Tracking via Reason-of-Select Distillation. 7075-7087 - Yafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi, Yue Zhang:
Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text. 7088-7107 - Xinyu Lu, Bowen Yu, Yaojie Lu, Hongyu Lin, Haiyang Yu, Le Sun, Xianpei Han, Yongbin Li:
SoFA: Shielded On-the-fly Alignment via Priority Rule Following. 7108-7136 - Ariel Goldstein, Gabriel Stanovsky:
Do Zombies Understand? A Choose-Your-Own-Adventure Exploration of Machine Cognition. 7137-7143 - Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn W. Schuller:
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning. 7144-7159 - Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. 7160-7174 - Longyue Wang, Zefeng Du, Wenxiang Jiao, Chenyang Lyu, Jianhui Pang, Leyang Cui, Kaiqiang Song, Derek F. Wong, Shuming Shi, Zhaopeng Tu:
Benchmarking and Improving Long-Text Translation with Large Language Models. 7175-7187 - Shixuan Fan, Wei Wei, Xiaofei Wen, Xian-Ling Mao, Jixiong Chen, Dangyang Chen:
Personalized Topic Selection Model for Topic-Grounded Dialogue. 7188-7202 - Lvxue Li, Jiaqi Chen, Xinyu Lu, Yaojie Lu, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun:
Debiasing In-Context Learning by Instructing LLMs How to Follow Demonstrations. 7203-7215 - Christos Vlachos, Themos Stafylakis, Ion Androutsopoulos:
Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems. 7216-7240 - Jian Ma, Wenguan Wang, Yi Yang, Feng Zheng:
MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production. 7241-7254 - Xueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong:
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models. 7255-7279 - Tong Zheng, Bei Li, Huiwen Bao, Jiale Wang, Weiqiao Shan, Tong Xiao, JingBo Zhu:
PartialFormer: Modeling Part Instead of Whole for Machine Translation. 7280-7294 - Jieyong Kim, Ryang Heo, Yongsik Seo, SeongKu Kang, Jinyoung Yeo, Dongha Lee:
Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy. 7295-7303 - Yihong Dong, Kangcheng Luo, Xue Jiang, Zhi Jin, Ge Li:
PACE: Improving Prompt with Actor-Critic Editing for Large Language Model. 7304-7323 - Huatao Xu, Liying Han, Qirui Yang, Mo Li, Mani B. Srivastava:
Penetrative AI: Making LLMs Comprehend the Physical World. 7324-7341 - Miaoran Zhang, Vagrant Gautam, Mingyang Wang, Jesujoba Alabi, Xiaoyu Shen, Dietrich Klakow, Marius Mosbach:
The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis. 7342-7371 - Ming Dong, Yujing Chen, Miao Zhang, Hao Sun, Tingting He:
Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking. 7372-7383 - Pranjal A. Chitale, Jay P. Gala, Raj Dabre:
An Empirical Study of In-context Learning in LLMs for Machine Translation. 7384-7406 - Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank:
"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models. 7407-7416 - Lei Sun, Zhengwei Tao, Youdi Li, Hiroshi Arakawa:
ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs. 7417-7431 - Zihao Xu, Yi Liu, Gelei Deng, Yuekang Li, Stjepan Picek:
A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models. 7432-7449 - Panagiotis Kaliosis, John Pavlopoulos, Foivos Charalampakos, Georgios Moschovis, Ion Androutsopoulos:
A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning. 7450-7466 - Hengyuan Zhang, Yanru Wu, Dawei Li, Sak Yang, Rui Zhao, Yong Jiang, Fei Tan:
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model. 7467-7509 - Ting Xu, Haiqin Yang, Fei Zhao, Zhen Wu, Xinyu Dai:
A Two-Agent Game for Zero-shot Relation Triplet Extraction. 7510-7527 - Naibin Gu, Peng Fu, Xiyu Liu, Bowen Shen, Zheng Lin, Weiping Wang:
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning. 7528-7541 - Manuel Lardelli, Giuseppe Attanasio, Anne Lauscher:
Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German. 7542-7550 - Shichao Sun, Ruifeng Yuan, Ziqiang Cao, Wenjie Li, Pengfei Liu:
Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization. 7551-7558 - Xinwei Long, Jiali Zeng, Fandong Meng, Jie Zhou, Bowen Zhou:
Trust in Internal or External Knowledge? Generative Multi-Modal Entity Linking with Knowledge Retriever. 7559-7569 - Taichi Aida, Danushka Bollegala:
A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection. 7570-7584 - Yafu Li, Huajian Zhang, Jianhao Yan, Yongjing Yin, Yue Zhang:
What Have We Achieved on Non-autoregressive Translation? 7585-7606 - Tal Reiss, George Kour, Naama Zwerdling, Ateret Anaby-Tavor, Yedid Hoshen:
From Zero to Hero: Cold-Start Anomaly Detection. 7607-7617 - Runcong Zhao, Qinglin Zhu, Hainiu Xu, Jiazheng Li, Yuxiang Zhou, Yulan He, Lin Gui:
Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives. 7618-7638 - Shanbao Qiao, Xuebing Liu, Seung-Hoon Na:
DistillMIKE: Editing Distillation of Massive In-Context Knowledge Editing in Large Language Models. 7639-7654 - Heming Xia, Zhe Yang, Qingxiu Dong, Peiyi Wang, Yongqi Li, Tao Ge, Tianyu Liu, Wenjie Li, Zhifang Sui:
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding. 7655-7671 - Gibaeg Kim, Sanghun Im, Heung-Seon Oh:
Hierarchy-aware Biased Bound Margin Loss Function for Hierarchical Text Classification. 7672-7682 - Zhuo Chen, Xinyu Wang, Yong Jiang, Pengjun Xie, Fei Huang, Kewei Tu:
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts. 7683-7694 - Korbinian Randl, John Pavlopoulos, Aron Henriksson, Tony Lindgren:
CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification. 7695-7715 - Ruikang Liu, Haoli Bai, Haokun Lin, Yuening Li, Han Gao, Zhengzhuo Xu, Lu Hou, Jun Yao, Chun Yuan:
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact. 7716-7741 - Tomoe Taniguchi, Daichi Mochihashi, Ichiro Kobayashi:
Learning Adverbs with Spectral Mixture Kernels. 7742-7752 - Jinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu, Ruifeng Xu, Shiwen Ni, Min Yang:
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models. 7753-7774 - Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. 7775-7803 - Xiang Li, Shizhu He, Fangyu Lei, JunYang JunYang, Tianhuang Su, Kang Liu, Jun Zhao:
Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering. 7804-7816 - Yuhang Lai, Siyuan Wang, Shujun Liu, Xuanjing Huang, Zhongyu Wei:
ALaRM: Align Language Models via Hierarchical Rewards Modeling. 7817-7831 - Haoxin Liu, Zhiyuan Zhao, Jindong Wang, Harshavardhan Kamarthi, B. Aditya Prakash:
LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting. 7832-7840 - Zhenyi Lu, Jie Tian, Wei Wei, Xiaoye Qu, Yu Cheng, Wenfeng Xie, Dangyang Chen:
Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models. 7841-7864 - Wei Du, Peixuan Li, Haodong Zhao, Tianjie Ju, Ge Ren, Gongshen Liu:
UOR: Universal Backdoor Attacks on Pre-trained Language Models. 7865-7877 - Patrick Haller, Lena S. Bolliger, Lena Ann Jäger:
Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences. 7878-7892 - Erica Cai, Brendan T. O'Connor:
The State of Relation Extraction Data Quality: Is Bigger Always Better? 7893-7906 - Shudan Zhang, Hanlin Zhao, Xiao Liu, Qinkai Zheng, Zehan Qi, Xiaotao Gu, Yuxiao Dong, Jie Tang:
NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries. 7907-7928 - Weizhe Yuan, Pengfei Liu, Matthias Gallé:
LLMCrit: Teaching Large Language Models to Use Criteria. 7929-7960 - Leonardo Ranaldi, Giulia Pucci, André Freitas:
Empowering cross-lingual abilities of instruction-tuned large language models by translation-following demonstrations. 7961-7973 - Nitesh Kumar, Usashi Chatterjee, Steven Schockaert:
Ranking Entities along Conceptual Space Dimensions with LLMs: An Analysis of Fine-Tuning Strategies. 7974-7989 - Yan Gao, Zhiwei Cao, Zhongjian Miao, Baosong Yang, Shiyu Liu, Min Zhang, Jinsong Su:
Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval. 7990-8001 - Martin Courtois, Malte Ostendorff, Leonhard Hennig, Georg Rehm:
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models. 8002-8011 - Fanyou Wu, Weijie Xu, Chandan K. Reddy, Srinivasan Sengamedu:
Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation. 8012-8026 - Marcio Fonseca, Shay B. Cohen:
Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains. 8027-8042 - Armin Sarhangzadeh, Taro Watanabe:
Alignment-Based Decoding Policy for Low-Latency and Anticipation-Free Neural Japanese Input Method Editors. 8043-8054 - Zhunheng Wang, Xiaoyi Liu, Mengting Hu, Rui Ying, Ming Jiang, Jianfeng Wu, Yalan Xie, Hang Gao, Renhong Cheng:
ECoK: Emotional Commonsense Knowledge Graph for Mining Emotional Gold. 8055-8074 - Jiashu Yao, Heyan Huang, Zeming Liu, Yuhang Guo:
Deterministic Reversible Data Augmentation for Neural Machine Translation. 8075-8089 - Anlai Zhou, Sunshine Jiang, Yifei Liu, Yiquan Wu, Kun Kuang, Jun Xiao:
Latent Learningscape Guided In-context Learning. 8090-8101 - Biqing Qi, Junqi Gao, Kaiyan Zhang, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou:
SMR: State Memory Replay for Long Sequence Modeling. 8102-8116 - Aditi Mishra, Sajjadur Rahman, Kushan Mitra, Hannah Kim, Estevam Hruschka:
Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks. 8117-8139 - Chenxi Li, Yuanhe Tian, Zhaxi Zerong, Yan Song, Fei Xia:
Challenging Large Language Models with New Tasks: A Study on their Adaptability and Robustness. 8140-8162 - Oleg Vasilyev, Fumika Isono, John Bohannon:
Linear Cross-Lingual Mapping of Sentence Embeddings. 8163-8171 - Xinliang Frederick Zhang, Carter Wood Blum, Temma Choji, Shalin Shah, Alakananda Vempala:
ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Self-Refinement. 8172-8185 - Wen Lai, Mohsen Mesgar, Alexander Fraser:
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback. 8186-8213 - Haifeng Qian, Sujan Kumar Gonugondla, Sungsoo Ha, Mingyue Shang, Sanjay Krishna Gouda, Ramesh Nallapati, Sudipta Sengupta, Xiaofei Ma, Anoop Deoras:
BASS: Batched Attention-optimized Speculative Sampling. 8214-8224 - Dekun Wu, Haochen Shi, Zhiyuan Sun, Bang Liu:
Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games. 8225-8291 - Sagi Shaier, Lawrence Hunter, Katharina von der Wense:
It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension. 8292-8305 - Michelle Lo, Fazl Barez, Shay B. Cohen:
Large Language Models Relearn Removed Concepts. 8306-8323 - Xinyu Wang, Hainiu Xu, Lin Gui, Yulan He:
Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond. 8324-8340 - Yinhong Liu, Yimai Fang, David Vandyke, Nigel Collier:
TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles. 8341-8356 - Zhongping Zhang, Wenda Qin, Bryan A. Plummer:
Machine-Generated Text Localization. 8357-8371 - Fabrice Lamarche, Philippe Langlais:
BenchIE⌃FL: A Manually Re-Annotated Fact-Based Open Information Extraction Benchmark. 8372-8394 - Ishan Agrawal, Zhijing Jin, Ehsan Mokhtarian, Siyuan Guo, Yuen Chen, Mrinmaya Sachan, Bernhard Schölkopf:
CausalCite: A Causal Formulation of Paper Citations. 8395-8410 - Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch:
Question Translation Training for Better Multilingual Reasoning. 8411-8423 - Ante Wang, Linfeng Song, Baolin Peng, Lifeng Jin, Ye Tian, Haitao Mi, Jinsong Su, Dong Yu:
Improving LLM Generations via Fine-Grained Self-Endorsement. 8424-8436 - Wanqiu Long, Siddharth Narayanaswamy, Bonnie Webber:
Multi-Label Classification for Implicit Discourse Relation Recognition. 8437-8451 - Hannah McLean Babe, Sydney Nguyen, Yangtian Zi, Arjun Guha, Molly Q. Feldman, Carolyn Jane Anderson:
StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code. 8452-8474 - Xuanming Zhang, Zixun Chen, Zhou Yu:
ProLex: A Benchmark for Language Proficiency-oriented Lexical Substitution. 8475-8493 - Yuu Jinnai, Ukyo Honda, Tetsuro Morimura, Peinan Zhang:
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding. 8494-8525 - Spencer Rarrick, Ranjita Naik, Sundar Poudel, Vishal Chowdhary:
GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages. 8526-8546 - Yuu Jinnai, Kaito Ariu:
Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding. 8547-8566 - Masashi Oshika, Makoto Morishita, Tsutomu Hirao, Ryohei Sasano, Koichi Takeda:
Simplifying Translations for Children: Iterative Simplification Considering Age of Acquisition with LLMs. 8567-8577 - Shuqi Liu, Bowei He, Linqi Song:
Bi-Chainer: Automated Large Language Models Reasoning with Bidirectional Chaining. 8578-8598 - Marcio Fonseca, Shay B. Cohen:
Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals? 8599-8618 - Guangqian Yang, Yi Liu, Lei Zhang, Licheng Zhang, Hongtao Xie, Zhendong Mao:
Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion. 8619-8630 - Tao Zhang, Chenwei Zhang, Xian Li, Jingbo Shang, Hoang Nguyen, Philip S. Yu:
Stronger, Lighter, Better: Towards Life-Long Attribute Value Extraction for E-Commerce Products. 8631-8643 - Hyuk Namgoong, Jeesu Jung, Sangkeun Jung, Yoon-Hyung Roh:
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism. 8644-8652 - Wenbin An, Wenkai Shi, Feng Tian, Haonan Lin, Qianying Wang, Yaqiang Wu, Mingxiang Cai, Luyan Wang, Yan Chen, Haiping Zhu, Ping Chen:
Generalized Category Discovery with Large Language Models in the Loop. 8653-8665 - Zhenyi Wang, Haiyan Ning, Qing Ling, Dan Wang:
VAEGPT-Sim: Improving Sentence Representation with Limited Corpus Using Gradually-Denoising VAE. 8666-8681 - Yiduo Guo, Zekai Zhang, Yaobo Liang, Dongyan Zhao, Nan Duan:
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion. 8682-8701 - Xinran Zhao, Hongming Zhang, Xiaoman Pan, Wenlin Yao, Dong Yu, Tongshuang Wu, Jianshu Chen:
Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models. 8702-8718 - Hong Chen, Chengtao Lv, Liang Ding, Haotong Qin, Xiabin Zhou, Yifu Ding, Xuebo Liu, Min Zhang, Jinyang Guo, Xianglong Liu, Dacheng Tao:
DB-LLM: Accurate Dual-Binarization for Efficient LLMs. 8719-8730 - Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou:
TempCompass: Do Video LLMs Really Understand Videos? 8731-8772 - Shengqi Zhu, Jeffrey M. Rzeszotarski:
"Get Their Hands Dirty, Not Mine": On Researcher-Annotator Collaboration and the Agency of Annotators. 8773-8782 - Chen Zhang, Xiao Liu, Jiuheng Lin, Yansong Feng:
Teaching Large Language Models an Unseen Language on the Fly. 8783-8800 - Qingyu Lu, Baopu Qiu, Liang Ding, Kanjian Zhang, Tom Kocmi, Dacheng Tao:
Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models. 8801-8816 - Yi Zong, Xipeng Qiu:
GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation. 8817-8825 - Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jin:
DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation. 8826-8840 - Kejuan Yang, Xiao Liu, Kaiwen Men, Aohan Zeng, Yuxiao Dong, Jie Tang:
Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration. 8841-8852 - Yidan Zhang, Mingfeng Xue, Dayiheng Liu, Zhenan He:
Rationales for Answers to Simple Math Word Problems Confuse Large Language Models. 8853-8869 - Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
ResLoRA: Identity Residual Mapping in Low-Rank Adaption. 8870-8884 - Chenxu Wang, Bin Dai, Huaping Liu, Baoyuan Wang:
Towards Objectively Benchmarking Social Intelligence of Language Agents at the Action Level. 8885-8897 - Huiyao Chen, Xinxin Li, Meishan Zhang, Min Zhang:
Semantic Role Labeling from Chinese Speech via End-to-End Learning. 8898-8911 - Zhengwei Tao, Zhi Jin, Junqiang Huang, Xiancai Chen, Xiaoying Bai, Yifan Zhang, Chongyang Tao:
MEEL: Multi-Modal Event Evolution Learning. 8912-8925 - Tingting Liang, Chenxin Jin, Lingzhi Wang, Wenqi Fan, Congying Xia, Kai Chen, Yuyu Yin:
LLM-REDIAL: A Large-Scale Dataset for Conversational Recommender Systems Created from User Behaviors with LLMs. 8926-8939 - Mahammed Kamruzzaman, Md. Minul Islam Shovon, Gene Louis Kim:
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models. 8940-8965 - Zhengwei Tao, Xiancai Chen, Zhi Jin, Xiaoying Bai, Haiyan Zhao, Yiwei Lou:
EVIT: Event-Oriented Instruction Tuning for Event Reasoning. 8966-8979 - Juseon-Do, Hidetaka Kamigaito, Manabu Okumura, Jingun Kwon:
InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models. 8980-8996 - Karan Goyal, Mayank Goel, Vikram Goyal, Mukesh K. Mohania:
SymTax: Symbiotic Relationship and Taxonomy Fusion for Effective Citation Recommendation. 8997-9008 - Yejun Yoon, Seunghyun Yoon, Kunwoo Park:
Assessing News Thumbnail Representativeness: Counterfactual text can enhance the cross-modal matching ability. 9009-9024 - Zijin Hong, Jian Liu:
Towards Better Question Generation in QA-based Event Extraction. 9025-9038 - Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Budget-Constrained Tool Learning with Planning. 9039-9052 - Huayang Li, Siheng Li, Deng Cai, Longyue Wang, Lemao Liu, Taro Watanabe, Yujiu Yang, Shuming Shi:
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild. 9053-9076 - Shichao Sun, Junlong Li, Weizhe Yuan, Ruifeng Yuan, Wenjie Li, Pengfei Liu:
The Critique of Critique. 9077-9096 - Xinbei Ma, Zhuosheng Zhang, Hai Zhao:
CoCo-Agent: A Comprehensive Cognitive MLLM Agent for Smartphone GUI Automation. 9097-9110 - Yue Fan, Hu Zhang, Ru Li, Yujie Wang, Hongye Tan, Jiye Liang:
FRVA: Fact-Retrieval and Verification Augmented Entailment Tree Generation for Explainable Question Answering. 9111-9128 - Yuansen Zhang, Xiao Wang, Tianze Chen, Jiayi Fu, Tao Gui, Qi Zhang:
P4: Plug-and-Play Discrete Prompting for Large Language Models Personalization. 9129-9144 - Yiduo Guo, Yaobo Liang, Dongyan Zhao, Nan Duan:
Large Language Models Can Learn Representation in Natural Language. 9145-9154 - Qingkai Fang, Zhengrui Ma, Yan Zhou, Min Zhang, Yang Feng:
CTC-based Non-autoregressive Textless Speech-to-Speech Translation. 9155-9161 - Yongqi Fan, Yansha Zhu, Kui Xue, Jingping Liu, Tong Ruan:
RRNorm: A Novel Framework for Chinese Disease Diagnoses Normalization via LLM-Driven Terminology Component Recognition and Reconstruction. 9162-9175 - Weiyan Zhang, Wanpeng Lu, Jiacheng Wang, Yating Wang, Lihan Chen, Haiyun Jiang, Jingping Liu, Tong Ruan:
Unexpected Phenomenon: LLMs' Spurious Associations in Information Extraction. 9176-9190 - Yongheng Zhang, Qiguang Chen, Min Li, Wanxiang Che, Libo Qin:
AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought. 9191-9200 - Zengkui Sun, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou:
LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation. 9201-9214 - Xiao Liu, Zirui Wu, Xueqing Wu, Pan Lu, Kai-Wei Chang, Yansong Feng:
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data. 9215-9235 - Jingwei Yi, Rui Ye, Qisi Chen, Bin Zhu, Siheng Chen, Defu Lian, Guangzhong Sun, Xing Xie, Fangzhao Wu:
On the Vulnerability of Safety Alignment in Open-Access LLMs. 9236-9260 - Pan Yang, Dandan Song, Zhijing Wu, Yanru Zhou:
PEK: A Parameter-Efficient Framework for Knowledge-Grounded Dialogue Generation. 9261-9273 - Liwen Zheng, Chaozhuo Li, Xi Zhang, Yuming Shang, Feiran Huang, Haoran Jia:
Evidence Retrieval is almost All You Need for Fact Verification. 9274-9281 - Zengkui Sun, Yijin Liu, Jiaan Wang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou:
Outdated Issue Aware Decoding for Factual Knowledge Editing. 9282-9293 - Maximilian Spliethöver, Sai Nikhil Menon, Henning Wachsmuth:
Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness. 9294-9313 - Stephen Meisenbacher, Maulik Chevli, Juraj Vladika, Florian Matthes:
DP-MLM: Differentially Private Text Rewriting Using Masked Language Models. 9314-9328 - David Mogrovejo, Thamar Solorio:
Question-Instructed Visual Descriptions for Zero-Shot Video Answering. 9329-9339 - Huanhuan Ma, Weizhi Xu, Yifan Wei, Liuji Chen, Liang Wang, Qiang Liu, Shu Wu:
EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification. 9340-9353 - Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao:
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models. 9354-9366 - Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov:
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification. 9367-9385 - Yang Zhao, Li Du, Xiao Ding, Kai Xiong, Zhouhao Sun, Shi Jun, Ting Liu, Bing Qi:
Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning. 9386-9406 - Everlyn Chimoto, Jay Gala, Orevaoghene Ahia, Julia Kreutzer, Bruce A. Bassett, Sara Hooker:
Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning. 9407-9426 - Luca Ragazzi, Paolo Italiani, Gianluca Moro, Mattia Panni:
What Are You Token About? Differentiable Perturbed Top-k Token Selection for Scientific Document Summarization. 9427-9440 - Gabriele Picco, Leopold Fuchs, Marcos Martínez Galindo, Alberto Purpura, Vanessa López, Hoang Thanh Lam:
Description Boosting for Zero-Shot Entity and Relation Classification. 9441-9457 - Zhexuan Wang, Shudong Liu, Xuebo Liu, Miao Zhang, Derek F. Wong, Min Zhang:
Domain-Aware k-Nearest-Neighbor Knowledge Distillation for Machine Translation. 9458-9469 - Wanlong Liu, Li Zhou, Dingyi Zeng, Yichen Xiao, Shaohuan Cheng, Chen Zhang, Grandee Lee, Malu Zhang, Wenyu Chen:
Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction. 9470-9487 - Chen Xu, Jie Wang, Xiaoqian Liu, Qian Dong, Chunliang Zhang, Tong Xiao, JingBo Zhu, Dapeng Man, Wu Yang:
Revisiting Interpolation Augmentation for Speech-to-Text Generation. 9488-9499 - Dennis Ulmer, Elman Mansimov, Kaixiang Lin, Lijia Sun, Xibin Gao, Yi Zhang:
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk. 9500-9522 - Renzhi Wang, Piji Li:
Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning. 9523-9537 - Gili Lior, Yoav Goldberg, Gabriel Stanovsky:
Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction. 9538-9550 - Yinuo Jiang, Xiang Zhuang, Keyan Ding, Qiang Zhang, Huajun Chen:
Enhancing Cross Text-Molecule Learning by Self-Augmentation. 9551-9565 - Erxin Yu, Jing Li, Chunpu Xu:
RePALM: Popular Quote Tweet Generation via Auto-Response Augmentation. 9566-9579 - Anton Schäfer, Thomas Hofmann, Imanol Schlag, Tiago Pimentel:
On the Effect of (Near) Duplicate Subwords in Language Modelling. 9580-9597 - Frank Wildenburg, Michael Hanna, Sandro Pezzelle:
Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST! 9598-9613 - Wen Huang, Hongbin Liu, Minxin Guo, Neil Gong:
Visual Hallucinations of Multi-modal Large Language Models. 9614-9631 - Ran Liu, Ming Liu, Min Yu, He Zhang, Jianguo Jiang, Gang Li, Weiqing Huang:
SumSurvey: An Abstractive Dataset of Scientific Survey Papers for Long Document Summarization. 9632-9651 - Joan Santoso, Patrick Sutanto, Billy Cahyadi, Esther Irawati Setiawan:
Pushing the Limits of Low-Resource NER Using LLM Artificial Data Generation. 9652-9667 - Zhaoyi Li, Gangwei Jiang, Hong Xie, Linqi Song, Defu Lian, Ying Wei:
Understanding and Patching Compositional Reasoning in LLMs. 9668-9688 - Elena Chistova:
Bilingual Rhetorical Structure Parsing with Large Parallel Annotations. 9689-9706 - Junling Wang, Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Mrinmaya Sachan:
Book2Dial: Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots. 9707-9731 - Wenxin Liang, Tingyu Zhang, Han Liu, Feng Zhang:
SELP: A Semantically-Driven Approach for Separated and Accurate Class Prototypes in Few-Shot Text Classification. 9732-9741 - Eric Chamoun, Michael Schlichtkrull, Andreas Vlachos:
Automated Focused Feedback Generation for Scientific Writing Assistance. 9742-9763 - Zihan Chen, Song Wang, Cong Shen, Jundong Li:
FastGAS: Fast Graph-based Annotation Selection for In-Context Learning. 9764-9780 - Bowen Shen, Zheng Lin, Daren Zha, Wei Liu, Jian Luan, Bin Wang, Weiping Wang:
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations. 9781-9793 - Langlin Huang, Yang Feng:
Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation. 9794-9801 - Afra Feyza Akyürek, Ekin Akyürek, Leshem Choshen, Derry Wijaya, Jacob Andreas:
Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability. 9802-9818 - Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao:
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion. 9819-9831 - Andy Liu, Mona Diab, Daniel Fried:
Evaluating Large Language Model Biases in Persona-Steered Generation. 9832-9850 - Yanghai Zhang, Ye Liu, Shiwei Wu, Kai Zhang, Xukai Liu, Qi Liu, Enhong Chen:
Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization. 9851-9862 - Qian Lou, Xin Liang, Jiaqi Xue, Yancheng Zhang, Rui Xie, Mengxin Zheng:
CR-UTP: Certified Robustness against Universal Text Perturbations on Large Language Models. 9863-9875 - Rachel Wicks, Matt Post, Philipp Koehn:
Recovering document annotations for sentence-level bitext. 9876-9890 - Rui Mao, Kai He, Claudia Ong, Qian Liu, Erik Cambria:
MetaPro 2.0: Computational Metaphor Processing on the Effectiveness of Anomalous Language Modeling. 9891-9908 - Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, Gao Huang:
Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling. 9909-9953 - Afra Amini, Tim Vieira, Ryan Cotterell:
Direct Preference Optimization with an Offset. 9954-9972 - Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. 9973-9986 - Jiaxu Zhao, Zijing Shi, Yitong Li, Yulong Pei, Ling Chen, Meng Fang, Mykola Pechenizkiy:
More than Minorities and Majorities: Understanding Multilateral Bias in Language Generation. 9987-10001 - Huimin Zeng, Zhenrui Yue, Yang Zhang, Lanyu Shang, Dong Wang:
Fair Federated Learning with Biased Vision-Language Models. 10002-10017 - Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J. Han, Katrin Kirchhoff:
SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models. 10018-10035 - David Wan, Koustuv Sinha, Srini Iyer, Asli Celikyilmaz, Mohit Bansal, Ramakanth Pasunuru:
ACUEval: Fine-grained Hallucination Evaluation and Correction for Abstractive Summarization. 10036-10056 - Xiongtao Zhou, Jie He, Yuhua Ke, Guangyao Zhu, Víctor Gutiérrez-Basulto, Jeff Z. Pan:
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models. 10057-10084 - Arda Uzunoglu, Gözde Gül Sahin, Abdulfattah Safa:
PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset. 10085-10102 - Gökçe Uludogan, Zeynep Yirmibesoglu Balal, Salih Furkan Akkurt, Meliksah Türker, Onur Güngör, Susan Üsküdarli:
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation. 10103-10117 - Sirou Chen, Sakiko Yahata, Shuichiro Shimizu, Zhengdong Yang, Yihang Li, Chenhui Chu, Sadao Kurohashi:
MELD-ST: An Emotion-aware Speech Translation Dataset. 10118-10126 - Rishabh Adiga, Lakshmi Subramanian, Varun Chandrasekaran:
Designing Informative Metrics for Few-Shot Example Selection. 10127-10135 - Yiquan Wu, Anlai Zhou, Yuhang Liu, Yifei Liu, Adam Jatowt, Weiming Lu, Jun Xiao, Kun Kuang:
Chain-of-Quizzes: Pedagogy-inspired Example Selection in In-Context-Learning. 10136-10142 - Nishant Balepur, Shramay Palta, Rachel Rudinger:
It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination Reasoning. 10143-10166 - Feng Zhang, Wei Chen, Fei Ding, Meng Gao, Tengjiao Wang, Jiahui Yao, Jiabin Zheng:
From Discrimination to Generation: Low-Resource Intent Detection with Language Model Instruction Tuning. 10167-10183 - Yong Xie, Karan Aggarwal, Aitzaz Ahmad:
Efficient Continual Pre-training for Building Domain Specific Large Language Models. 10184-10201 - Yufei Li, Xiao Yu, Yanghong Guo, Yanchi Liu, Haifeng Chen, Cong Liu:
Distantly-Supervised Joint Extraction with Noise-Robust Learning. 10202-10217 - Jinwen He, Yujia Gong, Zijin Lin, Cheng'an Wei, Yue Zhao, Kai Chen:
LLM Factoscope: Uncovering LLMs' Factual Discernment through Measuring Inner States. 10218-10230 - YiQiu Guo, Yuchen Yang, Ya Zhang, Yu Wang, Yanfeng Wang:
DictLLM: Harnessing Key-Value Data Structures with Large Language Models for Enhanced Medical Diagnostics. 10231-10241 - Huimin Wang, Yutian Zhao, Xian Wu, Yefeng Zheng:
imapScore: Medical Fact Evaluation Made Easy. 10242-10257 - Xin Zhou, Yi Lu, Ruotian Ma, Yujian Wei, Tao Gui, Qi Zhang, Xuanjing Huang:
Making Harmful Behaviors Unlearnable for Large Language Models. 10258-10273 - Congda Ma, Tianyu Zhao, Manabu Okumura:
Debiasing Large Language Models with Structured Knowledge. 10274-10287 - Tianyi Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen:
Contrastive Instruction Tuning. 10288-10302 - Yubao Tang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng:
Bootstrapped Pre-training with Dynamic Identifier Prediction for Generative Retrieval. 10303-10317 - Haining Wang, Kang He, Bobo Li, Lei Chen, Fei Li, Xu Han, Chong Teng, Donghong Ji:
Refining and Synthesis: A Simple yet Effective Data Augmentation Framework for Cross-Domain Aspect-based Sentiment Analysis. 10318-10329 - Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee:
Codec-SUPERB: An In-Depth Analysis of Sound Codec Models. 10330-10348 - Sirry Chen, Shuo Feng, Songsong Liang, Chen-Chen Zong, Jing Li, Piji Li:
CACL: Community-Aware Heterogeneous Graph Contrastive Learning for Social Media Bot Detection. 10349-10360 - Soumya Sanyal, Tianyi Xiao, Jiacheng Liu, Wenya Wang, Xiang Ren:
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification. 10361-10386 - Ahmed Masry, Mehrad Shahmohammadi, Md. Rizwan Parvez, Enamul Hoque, Shafiq Joty:
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning. 10387-10409 - Mengyu Bu, Shuhao Gu, Yang Feng:
Improving Multilingual Neural Machine Translation by Utilizing Semantic and Linguistic Features. 10410-10423 - Ganesh Jawahar, Haichuan Yang, Yunyang Xiong, Zechun Liu, Dilin Wang, Fei Sun, Meng Li, Aasish Pappu, Barlas Oguz, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Raghuraman Krishnamoorthi, Vikas Chandra:
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts. 10424-10443 - Hyeseon Ahn, Youngwook Kim, Jungin Kim, Yo-Sub Han:
SharedCon: Implicit Hate Speech Detection using Shared Semantics. 10444-10455 - Dheeraj Mekala, Alex Nguyen, Jingbo Shang:
Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models. 10456-10470 - Qiusi Zhan, Zhixiang Liang, Zifan Ying, Daniel Kang:
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents. 10471-10506 - Xiaohu Du, Ming Wen, Jiahao Zhu, Zifan Xie, Bin Ji, Huijun Liu, Xuanhua Shi, Hai Jin:
Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning. 10507-10521 - Wenhui Liao, Jiapeng Wang, Zening Lin, Longfei Xiong, Lianwen Jin:
PPTSER: A Plug-and-Play Tag-guided Method for Few-shot Semantic Entity Recognition on Visually-rich Documents. 10522-10539 - Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Dujian Ding:
LLM Performance Predictors are good initializers for Architecture Search. 10540-10560 - Chen Gong, Dexin Kong, Suxian Zhao, Xingyu Li, Guohong Fu:
MODDP: A Multi-modal Open-domain Chinese Dataset for Dialogue Discourse Parsing. 10561-10573 - Wei Zhai, Hongzhi Qi, Qing Zhao, Jianqiang Li, Ziqi Wang, Han Wang, Bing Yang, Guanghui Fu:
Chinese MentalBERT: Domain-Adaptive Pre-training on Social Media for Chinese Mental Health Text Analysis. 10574-10585 - Zhanhui Zhou, Jie Liu, Jing Shao, Xiangyu Yue, Chao Yang, Wanli Ouyang, Yu Qiao:
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization. 10586-10613 - Lirong Gao, Ru Peng, Yiming Zhang, Junbo Zhao:
DORY: Deliberative Prompt Recovery for LLM. 10614-10632 - Yue Chen, Chen Huang, Yang Deng, Wenqiang Lei, Dingnan Jin, Jia Liu, Tat-Seng Chua:
STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents. 10633-10649 - Xuming Hu, Xiaochuan Li, Junzhe Chen, Yinghui Li, Yangning Li, Xiaoguang Li, Yasheng Wang, Qun Liu, Lijie Wen, Philip S. Yu, Zhijiang Guo:
Evaluating Robustness of Generative Search Engine on Adversarial Factoid Questions. 10650-10671 - Cho-Jui Hsieh, Si Si, Felix Yu, Inderjit S. Dhillon:
Automatic Engineering of Long Prompts. 10672-10685 - Nuwa Xi, Yuhan Chen, Sendong Zhao, Haochun Wang, GongZhang GongZhang, Bing Qin, Ting Liu:
AS-ES Learning: Towards efficient CoT learning in small models. 10686-10697 - Jihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung Kim:
II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering. 10698-10709 - Pooja Guhan, Uttaran Bhattacharya, Somdeb Sarkhel, Vahid Azizi, Xiang Chen, Saayan Mitra, Aniket Bera, Dinesh Manocha:
TAME-RD: Text Assisted Replication of Image Multi-Adjustments for Reverse Designing. 10710-10727 - Kaiyi Zhang, Ang Lv, Yuhan Chen, Hansen Ha, Tao Xu, Rui Yan:
Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning. 10728-10739 - Tahir Javed, Janki Nawale, Eldho Ittan George, Sakshi Joshi, Kaushal Santosh Bhogale, Deovrat Mehendale, Ishvinder Virender Sethi, Aparna Ananthanarayanan, Hafsah Faquih, Pratiti Palit, Sneha Ravishankar, Saranya Sukumaran, Tripura Panchagnula, Sunjay Murali, Kunal Sharad Gandhi, Ambujavalli R, Manickam K. M, C. Venkata Vaijayanthi, Krishnan Srinivasa Raghavan Karunganni, Pratyush Kumar, Mitesh M. Khapra:
IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages. 10740-10782 - Kaiwen Zhou, Kwonjoon Lee, Teruhisa Misu, Xin Wang:
ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models. 10783-10795 - Yuanzhen Xie, Xinzhou Jin, Tao Xie, Matrixmxlin Matrixmxlin, Liang Chen, Chenyun Yu, Cheng Lei, Chengxiang Zhuo, Bo Hu, Zang Li:
Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm. 10796-10816 - Linlin Zong, Jiahui Zhou, Wenmin Lin, Xinyue Liu, Xianchao Zhang, Bo Xu:
Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection. 10817-10826 - Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, Andesha Mangla, Sudeep Choudhary, Monali Barbate, Ashutosh Modi:
iSign: A Benchmark for Indian Sign Language Processing. 10827-10844 - Wentao Ye, Jiaqi Hu, Liyao Li, Haobo Wang, Gang Chen, Junbo Zhao:
Data Contamination Calibration for Black-box LLMs. 10845-10861 - Tian Yu, Shaolei Zhang, Yang Feng:
Truth-Aware Context Selection: Mitigating Hallucinations of Large Language Models Being Misled by Untruthful Contexts. 10862-10884 - Menglong Cui, Jiangcun Du, Shaolin Zhu, Deyi Xiong:
Efficiently Exploring Large Language Models for Document-Level Machine Translation with In-context Learning. 10885-10897 - Yixuan Wang, Baoxin Wang, Yijun Liu, Qingfu Zhu, Dayong Wu, Wanxiang Che:
Improving Grammatical Error Correction via Contextual Data Augmentation. 10898-10910 - Qi Zhang, Yiming Zhang, Haobo Wang, Junbo Zhao:
RECOST: External Knowledge Guided Data-efficient Instruction Tuning. 10911-10921 - Katharina Hämmerl, Jindrich Libovický, Alexander Fraser:
Understanding Cross-Lingual Alignment - A Survey. 10922-10943 - Chenyuan Wu, Gangwei Jiang, Defu Lian:
Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning. 10944-10959 - An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs. 10960-10977 - Albert Sawczyn, Katsiaryna Viarenich, Konrad Wojtasik, Aleksandra Domogala, Marcin Oleksy, Maciej Piasecki, Tomasz Kajdanowicz:
Developing PUGG for Polish: A Modern Approach to KBQA, MRC, and IR Dataset Construction. 10978-10996 - Zijin Hong, Zheng Yuan, Hao Chen, Qinggang Zhang, Feiran Huang, Xiao Huang:
Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM. 10997-11008 - Hiroyuki Deguchi, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe, Hideki Tanaka, Masao Utiyama:
Centroid-Based Efficient Minimum Bayes Risk Decoding. 11009-11018 - Han-Cheng Yu, Yu-An Shih, Kin-Man Law, Kai-Yu Hsieh, Yu-Chen Cheng, Hsin-Chih Ho, Zih-An Lin, Wen-Chuan Hsu, Yao-Chung Fan:
Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration. 11019-11029 - Andrew Parry, Sean MacAvaney, Debasis Ganguly:
Exploiting Positional Bias for Query-Agnostic Generative Content in Search. 11030-11047 - Moran Yanuka, Morris Alper, Hadar Averbuch-Elor, Raja Giryes:
ICC : Quantifying Image Caption Concreteness for Multimodal Dataset Curation. 11048-11064 - Lin Long, Rui Wang, Ruixuan Xiao, Junbo Zhao, Xiao Ding, Gang Chen, Haobo Wang:
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey. 11065-11082 - Li Du, Holden Lee, Jason Eisner, Ryan Cotterell:
When is a Language Process a Language Model? 11083-11094 - Jimin Hong, Gibbeum Lee, Jaewoong Cho:
Accelerating Multilingual Language Model for Excessively Tokenized Languages. 11095-11111 - Yi Han, Ryohei Sasano, Koichi Takeda:
Definition Generation for Automatically Induced Semantic Frame. 11112-11118 - Yongqi Li, Zhen Zhang, Wenjie Wang, Liqiang Nie, Wenjie Li, Tat-Seng Chua:
Distillation Enhanced Generative Retrieval. 11119-11129 - Krishanu Maity, Poornash Sangeetha, Sriparna Saha, Pushpak Bhattacharyya:
ToxVidLM: A Multimodal Framework for Toxicity Detection in Code-Mixed Videos. 11130-11142 - Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu:
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models. 11143-11156 - Weixiang Zhao, Zhuojun Li, Shilong Wang, Yang Wang, Yulin Hu, Yanyan Zhao, Chen Wei, Bing Qin:
Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence. 11157-11176 - Jiyoung Lee, Minwoo Kim, Seungho Kim, Junghwan Kim, Seunghyun Won, Hwaran Lee, Edward Choi:
KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge. 11177-11213 - Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal:
Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development. 11214-11226 - Wuttikorn Ponwitayarat, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong:
Space Decomposition for Sentence Embedding. 11227-11239 - Camilla Casula, Elisa Leonardelli, Sara Tonelli:
Don't Augment, Rewrite? Assessing Abusive Language Detection with Synthetic Data. 11240-11247 - Francis Zheng, Edison Marrese-Taylor, Yutaka Matsuo:
Improving Low-Resource Machine Translation for Formosan Languages Using Bilingual Lexical Resources. 11248-11259 - Haonan Li, Yixuan Zhang, Fajri Koto, Yifei Yang, Hai Zhao, Yeyun Gong, Nan Duan, Timothy Baldwin:
CMMLU: Measuring massive multitask language understanding in Chinese. 11260-11285 - Seongyun Lee, Seungone Kim, Sue Hyun Park, Geewook Kim, Minjoon Seo:
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation. 11286-11315 - Xiaoyuan Li, Wenjie Wang, Moxin Li, Junrong Guo, Yang Zhang, Fuli Feng:
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction. 11316-11360 - Michele Mastromattei, Fabio Massimo Zanzotto:
Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language Models. 11361-11374 - Shiyu Ni, Keping Bi, Jiafeng Guo, Xueqi Cheng:
When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation. 11375-11388 - Chenglong Wang, Hang Zhou, Kaiyan Chang, Bei Li, Yongyu Mu, Tong Xiao, Tongran Liu, JingBo Zhu:
Hybrid Alignment Training for Large Language Models. 11389-11403 - Zhuocheng Gong, Jiahao Liu, Ziyue Wang, Pengfei Wu, Jingang Wang, Xunliang Cai, Dongyan Zhao, Rui Yan:
Graph-Structured Speculative Decoding. 11404-11415 - Chaoyi Zhu, Jeroen Galjaard, Pin-Yu Chen, Lydia Y. Chen:
Duwak: Dual Watermarks in Large Language Models. 11416-11436 - Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Wai Lam, Lizhuang Ma:
CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion. 11437-11452 - Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang:
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training. 11453-11464 - Biao Fu, Kai Fan, Minpeng Liao, Yidong Chen, Xiaodong Shi, Zhongqiang Huang:
wav2vec-S: Adapting Pre-trained Speech Models for Streaming. 11465-11480 - Anirudh Phukan, Shwetha Somasundaram, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan:
Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering. 11481-11495 - Martin Gubri, Dennis Ulmer, Hwaran Lee, Sangdoo Yun, Seong Joon Oh:
TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification. 11496-11517 - Jianing Zhou, Ziheng Zeng, Hongyu Gong, Suma Bhat:
CLASP: Cross-modal Alignment Using Pre-trained Unimodal Models. 11518-11531 - Guiyang Hou, Wenqi Zhang, Yongliang Shen, Linjuan Wu, Weiming Lu:
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind. 11532-11547 - Sitiporn Sae Lim, Can Udomcharoenchaikit, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong:
Identifying and Mitigating Annotation Bias in Natural Language Understanding using Causal Mediation Analysis. 11548-11563 - Ruchit Rawal, Mariya Toneva:
Perturbed examples reveal invariances shared by language models. 11564-11584 - Yiwei Li, Fei Mi, Yitong Li, Yasheng Wang, Bin Sun, Shaoxiong Feng, Kan Li:
Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation. 11585-11596 - Yang Sun, Guanrong Chen, Caihua Yang, Jianzhu Bao, Bin Liang, Xi Zeng, Min Yang, Ruifeng Xu:
Discourse Structure-Aware Prefix for Generation-Based End-to-End Argumentation Mining. 11597-11613 - Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Boyuan Pan, Heda Wang, Yao Hu, Kan Li:
Poor-Supervised Evaluation for SuperLLM via Mutual Consistency. 11614-11627 - Tian Liang, Xing Wang, Mingming Yang, Yujiu Yang, Shuming Shi, Zhaopeng Tu:
Addressing Entity Translation Problem via Translation Difficulty and Context Diversity. 11628-11638 - Chongyang Tao, Chang Liu, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang:
ADAM: Dense Retrieval Distillation with Adaptive Dark Examples. 11639-11651 - Yijin Liu, Xianfeng Zeng, Chenze Shao, Fandong Meng, Jie Zhou:
Instruction Position Matters in Sequence Generation with Large Language Models. 11652-11663 - Yuanhang Yang, Shiyi Qi, Wenchao Gu, Chaozheng Wang, Cuiyun Gao, Zenglin Xu:
XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection. 11664-11674 - Yijin Liu, Xianfeng Zeng, Fandong Meng, Jie Zhou:
BranchNorm: Robustly Scaling Extremely Deep Transformers. 11675-11687 - Tingyi Zhang, Jiaan Wang, Zhixu Li, Jianfeng Qu, An Liu, Zhigang Chen, Hongping Zhi:
MusTQ: A Temporal Knowledge Graph Question Answering Dataset for Multi-Step Temporal Reasoning. 11688-11699 - Anthony Sicilia, Hyunwoo Kim, Khyathi Raghavi Chandu, Malihe Alikhani, Jack Hessel:
Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models. 11700-11726 - Guodong Du, Jing Li, Hanting Liu, Runhua Jiang, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang:
Knowledge Fusion By Evolving Weights of Language Models. 11727-11742 - Markus Frohmann, Carolin Holtermann, Shahed Masoudian, Anne Lauscher, Navid Rekabsaz:
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale. 11743-11776 - Chang-Sheng Kao, Yun-Nung Chen:
Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models. 11777-11788 - Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun:
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization. 11789-11804 - Jianpeng Hu, Chengxiang Tan, Jiacheng Xu, Xiangyun Kong:
Continual Few-shot Relation Extraction via Adaptive Gradient Correction and Knowledge Decomposition. 11805-11816 - Linhao Yu, Yongqi Leng, Yufei Huang, Shang Wu, Haixin Liu, Xinmeng Ji, Jiahui Zhao, Jinwang Song, Tingting Cui, Xiaoqing Cheng, Liutao Liutao, Deyi Xiong:
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models. 11817-11837 - Guillem Ramírez, Matthias Lindemann, Alexandra Birch, Ivan Titov:
Cache & Distil: Optimising API Calls to Large Language Models. 11838-11853 - Sara Marjanovic, Isabelle Augenstein, Christina Lioma:
Investigating the Impact of Model Instability on Explanations and Uncertainty. 11854-11879 - Longhui Zhang, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang:
A Two-Stage Adaptation of Large Language Models for Text Ranking. 11880-11891 - Daniela Occhipinti, Michele Marchi, Irene Mondella, Huiyuan Lai, Felice Dell'Orletta, Malvina Nissim, Marco Guerini:
Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models. 11892-11907 - Xinran Chen, Xuanang Chen, Ben He, Tengfei Wen, Le Sun:
Analyze, Generate and Refine: Query Expansion with LLMs for Zero-Shot Open-Domain QA. 11908-11922 - Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan S. Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe:
On the Evaluation of Speech Foundation Models for Spoken Language Understanding. 11923-11938 - Xianfeng Zeng, Yijin Liu, Fandong Meng, Jie Zhou:
Towards Multiple References Era - Addressing Data Leakage and Limited Reference Diversity in Machine Translation Evaluation. 11939-11951 - Christopher Davis, Andrew Caines, Øistein E. Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, Paula Buttery:
Prompting open-source and commercial language models for grammatical error correction of English learner text. 11952-11967 - Christin Kreutz, Fabian Haak, Björn Engelmann, Philipp Schaer:
BATS: BenchmArking Text Simplicity �. 11968-11989 - Pia Pachinger, Janis Goldzycher, Anna Maria Planitzer, Wojciech Kusa, Allan Hanbury, Julia Neidhardt:
AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection. 11990-12001 - Megan Ayers, Luke Sanford, Margaret E. Roberts, Eddie Yang:
Discovering influential text using convolutional neural networks. 12002-12027 - Mengna Zhu, Kaisheng Zeng, Jibing Wu, Lihua Liu, Hongbin Huang, Lei Hou, Juanzi Li:
LC4EE: LLMs as Good Corrector for Event Extraction. 12028-12038 - Yihong Dong, Xue Jiang, Huanyu Liu, Zhi Jin, Bin Gu, Mengfei Yang, Ge Li:
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models. 12039-12050 - Ashutosh Sathe, Sunita Sarawagi:
Efficient Training of Language Models with Compact and Consistent Next Token Distributions. 12051-12064 - Yang Chi, Fausto Giunchiglia, Chuntao Li, Hao Xu:
Ancient Chinese Glyph Identification Powered by Radical Semantics. 12065-12074 - Settaluri Lakshmi Sravanthi, Meet Doshi, Pavan Tankala, V. Rudra Murthy, Raj Dabre, Pushpak Bhattacharyya:
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities. 12075-12097 - Huan Zhao, Xupeng Zha, Zixing Zhang:
EmoTransKG: An Innovative Emotion Knowledge Graph to Reveal Emotion Transformation. 12098-12110 - Fei Yuan, Shuai Yuan, Zhiyong Wu, Lei Li:
How Vocabulary Sharing Facilitates Multilingualism in LLaMA? 12111-12130 - Runzhe Zhan, Xinyi Yang, Derek F. Wong, Lidia S. Chao, Yue Zhang:
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model. 12131-12145 - Sishi Xiong, Yu Zhao, Jie Zhang, Mengxiang Li, Zhongjiang He, Xuelong Li, Shuangyong Song:
Dual Prompt Tuning based Contrastive Learning for Hierarchical Text Classification. 12146-12158 - Hetong Wang, Pasquale Minervini, Edoardo M. Ponti:
Probing the Emergence of Cross-lingual Alignment during LLM Training. 12159-12173 - Wenhua Nie, Lin Deng, Chang-Bo Liu, Jialing Wei, Ruitong Han, Haoran Zheng:
STSPL-SSC: Semi-Supervised Few-Shot Short Text Clustering with Semantic text similarity Optimized Pseudo-Labels. 12174-12185 - Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong:
A Comprehensive Evaluation of Quantization Strategies for Large Language Models. 12186-12215 - Abudurexiti Reheman, Yingfeng Luo, Junhao Ruan, Chunliang Zhang, Anxiang Ma, Tong Xiao, JingBo Zhu:
Exploiting Target Language Data for Neural Machine Translation Beyond Back Translation. 12216-12228 - Francesco Tonolini, Nikolaos Aletras, Jordan Massiah, Gabriella Kazai:
Bayesian Prompt Ensembles: Model Uncertainty Estimation for Black-Box Large Language Models. 12229-12272 - Qian Wang, Jia-Chen Gu, Zhen-Hua Ling:
X-ACE: Explainable and Multi-factor Audio Captioning Evaluation. 12273-12287 - Weiwen Xu, Deng Cai, Zhisong Zhang, Wai Lam, Shuming Shi:
Reasons to Reject? Aligning Language Models with Judgments. 12288-12304 - Yuhang He, Jianzhu Bao, Yang Sun, Bin Liang, Min Yang, Bing Qin, Ruifeng Xu:
Decomposing Argumentative Essay Generation via Dialectical Planning of Complex Reasoning. 12305-12322 - Tariq Alhindi, Smaranda Muresan, Preslav Nakov:
Large Language Models are Few-Shot Training Example Generators: A Case Study in Fallacy Recognition. 12323-12334 - Michal Stefánik, Marek Kadlcík, Petr Sojka:
Concept-aware Data Construction Improves In-context Learning of Language Models. 12335-12352 - Gerard Yeo, Shaz Furniturewala, Kokil Jaidka:
Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis. 12353-12360 - Haoran Li, Zhanming Jie, Wei Lu:
Non-Autoregressive Machine Translation as Constrained HMM. 12361-12372 - Bin Liang, Ang Li, Jingqian Zhao, Lin Gui, Min Yang, Yue Yu, Kam-Fai Wong, Ruifeng Xu:
Multi-modal Stance Detection: New Datasets and Model. 12373-12387 - Farima Fatahi Bayat, Xin Liu, H. V. Jagadish, Lu Wang:
Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression. 12388-12400 - Duzhen Zhang, Yahan Yu, Jiahua Dong, Chenxing Li, Dan Su, Chenhui Chu, Dong Yu:
MM-LLMs: Recent Advances in MultiModal Large Language Models. 12401-12430 - Yizhi Li, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Noah Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Wenhao Huang, Chenghua Lin, Jie Fu:
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models. 12431-12446 - Mathieu Rita, Florian Strub, Rahma Chaabouni, Paul Michel, Emmanuel Dupoux, Olivier Pietquin:
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning. 12447-12472 - Wei He, Marco Idiart, Carolina Scarton, Aline Villavicencio:
Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss. 12473-12485 - Kai Lv, Hang Yan, Qipeng Guo, Haijun Lv, Xipeng Qiu:
AdaLomo: Low-memory Optimization with Adaptive Learning Rate. 12486-12502 - Wenyue Hua, Jiang Guo, Mingwen Dong, Henghui Zhu, Patrick Ng, Zhiguo Wang:
Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks. 12503-12525 - Anthony Hills, Talia Tseriotou, Xenia Miscouridou, Adam Tsakalidis, Maria Liakata:
Exciting Mood Changes: A Time-aware Hierarchical Transformer for Change Detection Modelling. 12526-12537 - Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang:
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation. 12538-12559 - Siwei Wu, Yizhi Li, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin:
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval. 12560-12574 - Chinmaya Devaraj, Cornelia Fermüller, Yiannis Aloimonos:
Diving Deep into the Motion Representation of Video-Text Models. 12575-12584 - Nihal V. Nayak, Yiyang Nan, Avi Trost, Stephen H. Bach:
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation. 12585-12611 - Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri:
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning. 12612-12627 - Khiem Phi, Noushin Salek Faramarzi, Chenlu Wang, Ritwik Banerjee:
Paying Attention to Deflections: Mining Pragmatic Nuances for Whataboutism Detection in Online Discourse. 12628-12643 - Minsu Kim, James Thorne:
Epistemology of Language Models: Do Language Models Have Holistic Knowledge? 12644-12669 - Swarnadeep Bhar, Nicholas Asher:
Strong hallucinations from negation and how to fix them. 12670-12687 - Yiqi Liu, Nafise Sadat Moosavi, Chenghua Lin:
LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores. 12688-12701 - Tim Franzmeyer, Aleksandar Shtedritski, Samuel Albanie, Philip Torr, João F. Henriques, Jakob N. Foerster:
HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits. 12702-12716 - Aswin RRV, Nemika Tyagi, Md Nayem Uddin, Neeraj Varshney, Chitta Baral:
Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies. 12717-12733 - Yichuan Li, Kaize Ding, Jianling Wang, Kyumin Lee:
Empowering Large Language Models for Textual Data Augmentation. 12734-12751 - Lukas Garbaciauskas, Max Ploner, Alan Akbik:
Choose Your Transformer: Improved Transferability Estimation of Transformer Models on Classification Tasks. 12752-12768 - I-Hung Hsu, Zihan Xue, Nilay Pochhi, Sahil Bansal, Prem Natarajan, Jayanth Srinivasa, Nanyun Peng:
Argument-Aware Approach To Event Linking. 12769-12781 - I-Hung Hsu, Zifeng Wang, Long T. Le, Lesly Miculicich, Nanyun Peng, Chen-Yu Lee, Tomas Pfister:
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation. 12782-12803 - Kuan-Hao Huang, I-Hung Hsu, Tanmay Parekh, Zhiyu Xie, Zixuan Zhang, Prem Natarajan, Kai-Wei Chang, Nanyun Peng, Heng Ji:
TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction. 12804-12825 - Jay Cunningham, Su Lin Blodgett, Michael Madaio, Hal Daumé III, Christina Harrington, Hanna M. Wallach:
Understanding the Impacts of Language Technologies' Performance Disparities on African American Language Speakers. 12826-12833 - Tianyu Zheng, Ge Zhang, Tianhao Shen, Xueling Liu, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue:
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement. 12834-12859 - Caleb Ziems, William Held, Jane Dwivedi-Yu, Diyi Yang:
Measuring and Addressing Indexical Bias in Information Retrieval. 12860-12877 - Zaid Alyafeai, Khalid Almubarak, Ahmed Ashraf, Deema Alnuhait, Saied Alshahrani, Gubran A. Q. Abdulrahman, Gamil Ahmed, Qais Gawah, Zead Saleh, Mustafa Ghaleb, Yousef Ali, Maged S. Al-Shaibani:
CIDAR: Culturally Relevant Instruction Dataset For Arabic. 12878-12901 - Jean-Benoit Delbrouck, Pierre J. Chambon, Zhihong Chen, Maya Varma, Andrew Johnston, Louis Blankemeier, Dave Van Veen, Tan Bui, Steven Quoc Hung Truong, Curtis P. Langlotz:
RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports. 12902-12915 - H. S. V. N. S. Kowndinya Renduchintala, Sumit Bhatia, Ganesh Ramakrishnan:
SMART: Submodular Data Mixture Strategy for Instruction Tuning. 12916-12934 - Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Raghavi Chandu:
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning. 12935-12948 - Jonne Sälevä, Constantine Lignos:
Language Model Priors and Data Augmentation Strategies for Low-resource Machine Translation: A Case Study Using Finnish to Northern Sámi. 12949-12956 - James Flemings, Murali Annavaram:
Differentially Private Knowledge Distillation via Synthetic Text Generation. 12957-12968 - Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden:
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions. 12969-12990 - Faisal Tareque Shohan, Mir Tafseer Nayeem, Samsul Islam, Abu Ubaida Akash, Shafiq Joty:
XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags. 12991-13024 - Yiwei Qin, Kaiqiang Song, Yebowen Hu, Wenlin Yao, Sangwoo Cho, Xiaoyang Wang, Xuansheng Wu, Fei Liu, Pengfei Liu, Dong Yu:
InFoBench: Evaluating Instruction Following Ability in Large Language Models. 13025-13048 - Muhammad Shihab Rashid, Jannat Ara Meem, Yue Dong, Vagelis Hristidis:
EcoRank: Budget-Constrained Text Re-ranking Using Large Language Models. 13049-13063 - Gagan Bhatia, El Moatez Billah Nagoudi, Hasan Cavusoglu, Muhammad Abdul-Mageed:
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models. 13064-13087 - Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liangyan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell:
Aligning Large Multimodal Models with Factually Augmented RLHF. 13088-13110 - Neeraj Varshney, Pavel Dolin, Agastya Seth, Chitta Baral:
The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness. 13111-13128 - Jannat Ara Meem, Muhammad Shihab Rashid, Yue Dong, Vagelis Hristidis:
PAT-Questions: A Self-Updating Benchmark for Present-Anchored Temporal Question-Answering. 13129-13148 - Shen Gao, Hao Li, Zhengliang Shi, Chengrui Huang, Quan Tu, Shuo Shang, Zhiliang Tian, Minlie Huang:
360°REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System. 13149-13162 - Ghazal Khalighinejad, Defne Circi, L. Catherine Brinson, Bhuwan Dhingra:
Extracting Polymer Nanocomposite Samples from Full-Length Documents. 13163-13175 - Alicia Tsai, Adam Kraft, Long Jin, Chenwei Cai, Anahita Hosseini, Taibai Xu, Zemin Zhang, Lichan Hong, Ed Huai-hsin Chi, Xinyang Yi:
Leveraging LLM Reasoning Enhances Personalized Recommender Systems. 13176-13188 - AbdelRahim A. Elmadany, Ife Adebara, Muhammad Abdul-Mageed:
Toucan: Many-to-Many Translation for 150 African Language Pairs. 13189-13206 - Zhouhang Xie, Bodhisattwa Prasad Majumder, Mengjie Zhao, Yoshinori Maeda, Keiichi Yamada, Hiromi Wakaki, Julian J. McAuley:
Few-shot Dialogue Strategy Learning for Motivational Interviewing via Inductive Reasoning. 13207-13219 - Ryoma Kumon, Daiki Matsuoka, Hitomi Yanaka:
Evaluating Structural Generalization in Neural Machine Translation. 13220-13239 - Gregorios A. Katsios, Ning Sa, Tomek Strzalkowski:
Figuratively Speaking: Authorship Attribution via Multi-Task Figurative Language Modeling. 13240-13255 - Yujun Mao, Yoon Kim, Yilun Zhou:
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities. 13256-13274 - Jiali Zeng, Fandong Meng, Yongjing Yin, Jie Zhou:
Improving Machine Translation with Large Language Models: A Preliminary Study with Cooperative Decoding. 13275-13288 - Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada:
Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition. 13289-13305 - Johnny Tian-Zheng Wei, Ryan Yixiang Wang, Robin Jia:
Proving membership in LLM pretraining data via data watermarks. 13306-13320 - Dongxu Zhang, Varun Gangal, Barrett Martin Lattimer, Yi Yang:
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses. 13321-13332 - Jinglong Luo, Yehong Zhang, Zhuo Zhang, Jiaqi Zhang, Xin Mu, Hui Wang, Yue Yu, Zenglin Xu:
SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC. 13333-13348 - Junlin Wang, Tianyi Yang, Roy Xie, Bhuwan Dhingra:
Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications. 13349-13365 - Fengran Mo, Chen Qu, Kelong Mao, Tianyu Zhu, Zhan Su, Kaiyu Huang, Jian-Yun Nie:
History-Aware Conversational Dense Retrieval. 13366-13378 - Yikai Zhang, Qianyu He, Xintao Wang, Siyu Yuan, Jiaqing Liang, Yanghua Xiao:
Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models. 13379-13389 - Chenye Zhao, Yingjie Li, Cornelia Caragea, Yue Zhang:
ZeroStance: Leveraging ChatGPT for Open-Domain Stance Detection via Dataset Generation. 13390-13405 - Barah Fazili, Ashish Agrawal, Preethi Jyothi:
Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection. 13406-13422 - Ruichao Yang, Wei Gao, Jing Ma, Hongzhan Lin, Bo Wang:
Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models. 13423-13439 - Zhiyuan Fan, Zhihong Chen, Benyou Wang:
Exploring the Potential of Dense Information in Multimodal Alignment. 13440-13451 - Michael Tang, Shunyu Yao, John Yang, Karthik Narasimhan:
Referral Augmentation for Zero-Shot Information Retrieval. 13452-13461 - Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Sujian Li:
InstructEval: Instruction-Tuned Text Evaluator from Human Preference. 13462-13474 - Cuong Dang, Dung D. Le, Thai Le:
A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models. 13475-13491 - Jianing Wang, Junda Wu, Yupeng Hou, Yao Liu, Ming Gao, Julian J. McAuley:
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment. 13492-13510 - Minsoo Kim, Victor S. Bursztyn, Eunyee Koh, Shunan Guo, Seung-won Hwang:
RaDA: Retrieval-augmented Web Agent Planning with LLMs. 13511-13525 - Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, Yelong Shen, Chen Lin, Nan Duan, Weizhu Chen:
Competition-Level Problems are Effective LLM Evaluators. 13526-13544 - Zonglin Yang, Xinya Du, Junxian Li, Jie Zheng, Soujanya Poria, Erik Cambria:
Large Language Models for Automated Open-domain Scientific Hypotheses Discovery. 13545-13565 - Zhiming Li, Yuchen Lyu:
GRADUAL: Granularity-aware Dual Prototype Learning for Better Few-Shot Relation Extraction. 13566-13577 - Chi Wei, Shaobin Huang, Rongsheng Li, Naiyu Yan, Rui Wang:
Training a Better Chinese Spelling Correction Model via Prior-knowledge Guided Teacher. 13578-13589 - Davide Caffagni, Federico Cocchi, Luca Barsellotti, Nicholas Moratelli, Sara Sarto, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara:
The Revolution of Multimodal Large Language Models: A Survey. 13590-13618 - Shuai Wang, Liang Ding, Li Shen, Yong Luo, Bo Du, Dacheng Tao:
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models. 13619-13639 - Demin Song, Honglin Guo, Yunhua Zhou, Shuhao Xing, Yudong Wang, Zifan Song, Wenwei Zhang, Qipeng Guo, Hang Yan, Xipeng Qiu, Dahua Lin:
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation. 13640-13656 - Wangjie You, Pei Guo, Juntao Li, Kehai Chen, Min Zhang:
Efficient Domain Adaptation for Non-Autoregressive Machine Translation. 13657-13670 - Pei Guo, Wangjie You, Juntao Li, Bowen Yan, Min Zhang:
Exploring Reversal Mathematical Reasoning Ability for Large Language Models. 13671-13685 - Jingtao Guo, Chunxia Zhang, Lingxi Li, Xiaojun Xue, Zhendong Niu:
A Unified Joint Approach with Topological Context Learning and Rule Augmentation for Knowledge Graph Completion. 13686-13696 - Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry W. Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc V. Le, Thang Luong:
FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation. 13697-13720 - Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao:
ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding. 13721-13736 - Nianqi Li, Jingping Liu, Sihang Jiang, Haiyun Jiang, Yanghua Xiao, Jiaqing Liang, Zujie Liang, Feng Wei, Jinglei Chen, Zhenghong Hao, Bing Han:
CR-LLM: A Dataset and Optimization for Concept Reasoning of Large Language Models. 13737-13747 - Yingqian Min, Kun Zhou, Dawei Gao, Xin Zhao, He Hu, Yaliang Li:
DATA-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning. 13748-13761 - Yang Lin, Xinyu Ma, Xin Gao, Ruiqing Li, Yasha Wang, Xu Chu:
Combating Label Sparsity in Short Text Topic Modeling via Nearest Neighbor Augmentation. 13762-13774 - Jianhao Yan, Yun Luo, Yue Zhang:
RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models. 13775-13791 - Changyi Xiao, Yixin Cao:
Complex Logical Query Answering by Calibrating Knowledge Graph Completion Models. 13792-13803 - Chin-Yi Lin, Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen:
Argument-Based Sentiment Analysis on Forward-Looking Statements. 13804-13815 - Hongbin Zhang, Kehai Chen, Xuefeng Bai, Yang Xiang, Min Zhang:
Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model. 13816-13836 - Shreyanshu Bhushan, Eun-Soo Jung, Minho Lee:
Unveiling the Power of Integration: Block Diagram Summarization through Local-Global Fusion. 13837-13856 - Chunhui Li, Yifan Wang, Zhen Wu, Zhen Yu, Fei Zhao, Shujian Huang, Xinyu Dai:
MultiSQL: A Schema-Integrated Context-Dependent Text2SQL Dataset with Diverse SQL Operations. 13857-13867 - Chen Li, Meishan Zhang, Xuebo Liu, Zhaocong Li, Derek F. Wong, Min Zhang:
Towards Demonstration-Aware Large Language Models for Machine Translation. 13868-13881 - Dohyeon Lee, Jongyoon Kim, Seung-won Hwang, Joonsuk Park:
DADA: Distribution-Aware Domain Adaptation of PLMs for Information Retrieval. 13882-13893 - Gladys Tyen, Hassan Mansoor, Victor Carbune, Peter Chen, Tony Mak:
LLMs cannot find reasoning errors, but can correct them given the error location. 13894-13908 - Federico Ranaldi, Elena Sofia Ruzzetti, Dario Onorati, Leonardo Ranaldi, Cristina Giannone, Andrea Favalli, Raniero Romagnoli, Fabio Massimo Zanzotto:
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL translation. 13909-13920 - Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl:
ChartCheck: Explainable Fact-Checking over Real-World Chart Images. 13921-13937 - Mohanna Hoveyda, Arjen P. de Vries, Faegheh Hasibi, Maarten de Rijke:
Real World Conversational Entity Linking Requires More Than Zero-Shots. 13938-13946 - Chenhao Zhang, Renhao Li, Minghuan Tan, Min Yang, Jingwei Zhu, Di Yang, Jiahao Zhao, Guancheng Ye, Chengming Li, Xiping Hu:
CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling. 13947-13966 - Neemesh Yadav, Sarah Masud, Vikram Goyal, Md. Shad Akhtar, Tanmoy Chakraborty:
Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech. 13967-13983 - James Enouen, Hootan Nakhost, Sayna Ebrahimi, Sercan Ö. Arik, Yan Liu, Tomas Pfister:
TextGenSHAP: Scalable Post-Hoc Explanations in Text Generation with Long Documents. 13984-14011 - Yunfan Shao, Linyang Li, Zhaoye Fei, Hang Yan, Dahua Lin, Xipeng Qiu:
Balanced Data Sampling for Language Model Training with Clustering. 14012-14023 - Jie Wang, Tao Ji, Yuanbin Wu, Hang Yan, Tao Gui, Qi Zhang, Xuanjing Huang, Xiaoling Wang:
Length Generalization of Causal Transformers without Position Encoding. 14024-14040 - Zhengsheng Guo, Zhiwei He, Wenxiang Jiao, Xing Wang, Rui Wang, Kehai Chen, Zhaopeng Tu, Yong Xu, Min Zhang:
Unsupervised Sign Language Translation and Generation. 14041-14055 - Abelardo Carlos Martinez Lorenzo, Pere-Lluís Huguet Cabot, Karim Ghonim, Lu Xu, Hee-Soo Choi, Alberte Fernández-Castro, Roberto Navigli:
Mitigating Data Scarcity in Semantic Parsing across Languages with the Multilingual Semantic Layer and its Dataset. 14056-14080 - Chaoran Zhang, Lixin Zou, Dan Luo, Xiangyang Luo, Zihao Li, Min Tang, Chenliang Li:
Efficient Sparse Attention needs Adaptive Token Release. 14081-14094 - Lei Huang, Xiaocheng Feng, Weitao Ma, Yuxuan Gu, Weihong Zhong, Xiachong Feng, Weijiang Yu, Weihua Peng, Duyu Tang, Dandan Tu, Bing Qin:
Learning Fine-Grained Grounded Citations for Attributed Large Language Models. 14095-14113 - Riccardo Orlando, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli:
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget. 14114-14132 - Jinggui Liang, Lizi Liao, Hao Fei, Jing Jiang:
Synergizing Large Language Models and Pre-Trained Smaller Models for Conversational Intent Discovery. 14133-14147 - Alessandro Scirè, Karim Ghonim, Roberto Navigli:
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction. 14148-14161 - Wenqing Chen, Weicheng Wang, Zhixuan Chu, Kui Ren, Zibin Zheng, Zhichao Lu:
Self-Para-Consistency: Improving Reasoning Tasks at Low Cost for Large Language Models. 14162-14167 - David Dukic, Jan Snajder:
Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling. 14168-14181 - Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe:
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans. 14182-14214 - Hongxu Liu, Xiaojie Wang, Jiashen Sun, Ke Zeng, Guanglu Wan:
Dual-Stage Multi-Task Syntax-Oriented Pre-Training for Syntactically Controlled Paraphrase Generation. 14215-14231 - Yi Su, Yunpeng Tai, Yixin Ji, Juntao Li, Yan Bowen, Min Zhang:
Demonstration Augmentation for Zero-shot In-context Learning. 14232-14244 - Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
Pushing the Limits of Zero-shot End-to-End Speech Translation. 14245-14267 - Ancheng Xu, Minghuan Tan, Lei Wang, Min Yang, Ruifeng Xu:
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models. 14268-14290 - Ankan Mullick, Sombit Bose, Rounak Saha, Ayan Kumar Bhowmick, Pawan Goyal, Niloy Ganguly, Prasenjit Dey, Ravi Kokku:
On The Persona-based Summarization of Domain-Specific Documents. 14291-14307 - Navreet Kaur, Monojit Choudhury, Danish Pruthi:
Evaluating Large Language Models for Health-related Queries with Presuppositions. 14308-14331 - Andrei Stefan Bejgu, Edoardo Barba, Luigi Procopio, Alberte Fernández-Castro, Roberto Navigli:
Word Sense Linking: Disambiguating Outside the Sandbox. 14332-14347 - Verna Dankers, Ivan Titov:
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks. 14348-14366 - Jian Liu, Zihe Liu, Xueqiang Lyu, Peng Jin, Jinan Xu:
Towards Multi-Relational Multi-Hop Reasoning over Dense Temporal Knowledge Graphs. 14367-14378 - Weihang Su, Changyue Wang, Qingyao Ai, Yiran Hu, Zhijing Wu, Yujia Zhou, Yiqun Liu:
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models. 14379-14391 - Guiyang Hou, Yongliang Shen, Weiming Lu:
Progressive Tuning: Towards Generic Sentiment Abilities for Large Language Models. 14392-14402 - Duy C. Hoang, Nguyen Hung-Quang, Saurav Manchanda, Minlong Peng, Kok-Seng Wong, Khoa D. Doan:
Fooling the Textual Fooler via Randomizing Latent Representations. 14403-14421 - Sanjeev Kumar, Preethi Jyothi, Pushpak Bhattacharyya:
Part-of-speech Tagging for Extremely Low-resource Indian Languages. 14422-14431 - Kaixin Lan, Tao Fang, Derek F. Wong, Yabo Xu, Lidia S. Chao, Cecilia G. Zhao:
FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models. 14432-14447 - Xinxin Zhang, Jun Sun, Simin Hong, Taihao Li:
Amanda: Adaptively Modality-Balanced Domain Adaptation for Multimodal Emotion Recognition. 14448-14458 - Juraj Vladika, Phillip Schneider, Florian Matthes:
MedREQAL: Examining Medical Knowledge Recall of Large Language Models via Question Answering. 14459-14469 - Sheza Munir, Wassay Sajjad, Mukeet Raza, Emaan Abbas, Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza:
Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset. 14470-14480 - Huajian Zhang, Laura Perez-Beltrachini:
Leveraging Entailment Judgements in Cross-Lingual Summarisation. 14481-14497 - Meishan Zhang, Hao Fei, Bin Wang, Shengqiong Wu, Yixin Cao, Fei Li, Min Zhang:
Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction. 14498-14511 - Yanda Li, Chi Zhang, Gang Yu, Wanqi Yang, Zhibin Wang, Bin Fu, Guosheng Lin, Chunhua Shen, Ling Chen, Yunchao Wei:
Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data. 14512-14531 - Akari Haga, Saku Sugawara, Akiyo Fukatsu, Miyu Oba, Hiroki Ouchi, Taro Watanabe, Yohei Oseki:
Modeling Overregularization in Children with Small Language Models. 14532-14550 - Zhu Liu, Cunliang Kong, Ying Liu, Maosong Sun:
Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics. 14551-14558 - Zhiqiang Zhong, Kuangyu Zhou, Davide Mottin:
Harnessing Large Language Models as Post-hoc Correctors. 14559-14574 - Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei:
Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM. 14575-14595 - Jixiang Hong, Quan Tu, Changyu Chen, Gao Xing, Ji Zhang, Rui Yan:
CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment. 14596-14609 - Armineh Nourbakhsh, Sameena Shah, Carolyn P. Rosé:
Towards a new research agenda for multimodal enterprise document understanding: What are we missing? 14610-14622 - Amin Abolghasemi, Zhaochun Ren, Arian Askari, Mohammad Aliannejadi, Maarten de Rijke, Suzan Verberne:
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems. 14623-14635 - Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti:
Measuring Retrieval Complexity in Question Answering Systems. 14636-14650 - Jiayu Song, Jenny Chim, Adam Tsakalidis, Julia Ive, Dana Atzil-Slonim, Maria Liakata:
Combining Hierachical VAEs with LLMs for clinically meaningful timeline summarisation in social media. 14651-14672 - Yintao Tai, Xiyang Liao, Alessandro Suglia, Antonio Vergari:
PIXAR: Auto-Regressive Language Modeling in Pixel Space. 14673-14695 - Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li, Liangtai Sun, Su Zhu, Shuai Fan, Kai Yu:
Sparsity-Accelerated Training for Large Language Models. 14696-14707 - Rongwu Xu, Zehan Qi, Wei Xu:
Preemptive Answer "Attacks" on Chain-of-Thought Reasoning. 14708-14726 - Jaap Jumelet, Willem H. Zuidema, Arabella Sinclair:
Do Language Models Exhibit Human-like Structural Priming Effects? 14727-14742 - Noah Wang, Z. y. Peng, Haoran Que, Jiaheng Liu, Wangchunshu Zhou, Yuhan Wu, Hongcheng Guo, Ruitong Gan, Zehao Ni, Jian Yang, Man Zhang, Zhaoxiang Zhang, Wanli Ouyang, Ke Xu, Wenhao Huang, Jie Fu, Junran Peng:
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models. 14743-14777 - Zixia Jia, Mengmeng Wang, Baichen Tong, Song-Chun Zhu, Zilong Zheng:
LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments. 14778-14814 - Adil Soubki, John Murzaku, Arash Yousefi Jordehi, Peter Zeng, Magdalena Markowska, Seyed Abolghasem Mirroshandel, Owen Rambow:
Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground. 14815-14823 - Divyanshu Aggarwal, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram:
MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models. 14824-14867 - Xuxin Cheng, Zhihong Zhu, Xianwei Zhuang, Zhanpeng Chen, Zhiqi Huang, Yuexian Zou:
MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts. 14868-14879 - David Mueller, Mark Dredze, Nicholas Andrews:
Multi-Task Transfer Matters During Instruction-Tuning. 14880-14891 - Qi Guo, Leiyu Wang, Yidong Wang, Wei Ye, Shikun Zhang:
What Makes a Good Order of Examples in In-Context Learning. 14892-14904 - Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran:
BloomVQA: Assessing Hierarchical Multi-modal Comprehension. 14905-14918 - Yifei Li, Xiang Yue, Zeyi Liao, Huan Sun:
AttributionBench: How Hard is Automatic Attribution Evaluation? 14919-14935 - Justin Lovelace, Varsha Kishore, Yiwei Chen, Kilian Q. Weinberger:
Diffusion Guided Language Modeling. 14936-14952 - Xiaoqi Han, Ru Li, Xiaoli Li, Jiye Liang, Zifang Zhang, Jeff Z. Pan:
InstructEd: Soft-Instruction Tuning for Model Editing with Hops. 14953-14968 - Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark Hasegawa-Johnson, Sungwoong Kim, Chang Dong Yoo:
TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback. 14969-14981 - Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization. 14982-14995 - Sarkar Snigdha Sarathi Das, Chirag Shah, Mengting Wan, Jennifer Neville, Longqi Yang, Reid Andersen, Georg Buscher, Tara Safavi:
S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs. 14996-15014 - Bowen Zhao, Zander Brumbaugh, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith:
Set the Clock: Temporal Alignment of Pretrained Language Models. 15015-15040 - Beyza Ermis, Luiza Pozzobon, Sara Hooker, Patrick Lewis:
From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models. 15041-15058 - Ansh Arora, Xuanli He, Maximilian Mozes, Srinibas Swain, Mark Dras, Qiongkai Xu:
Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge. 15059-15075 - Arthur Scalercio, Maria José Finatto, Aline Paes:
Enhancing Sentence Simplification in Portuguese: Leveraging Paraphrases, Context, and Linguistic Features. 15076-15091 - Di Wu, Shaomu Tan, Yan Meng, David Stap, Christof Monz:
How Far can 100 Samples Go? Unlocking Zero-Shot Translation with Tiny Multi-Parallel Data. 15092-15108 - Satanu Ghosh, Neal R. Brodnik, Carolina Frey, Collin Holgate, Tresa M. Pollock, Samantha H. Daly, Samuel Carton:
Toward Reliable Ad-hoc Scientific Information Extraction: A Case Study on Two Materials Dataset. 15109-15123 - Jinwook Park, Kangil Kim:
Structural Optimization Ambiguity and Simplicity Bias in Unsupervised Neural Grammar Induction. 15124-15139 - Vincent Perot, Kai Kang, Florian Luisier, Guolong Su, Xiaoyu Sun, Ramya Sree Boppana, Zilong Wang, Zifeng Wang, Jiaqi Mu, Hao Zhang, Chen-Yu Lee, Nan Hua:
LMDX: Language Model-based Document Information Extraction and Localization. 15140-15168 - Rungsiman Nararatwong, Chung-Chi Chen, Natthawut Kertkeidkachorn, Hiroya Takamura, Ryutaro Ichise:
DBQR-QA: A Question Answering Dataset on a Hybrid of Database Querying and Reasoning. 15169-15182 - Junda Wang, Zonghai Yao, Zhichao Yang, Huixue Zhou, Rumeng Li, Xun Wang, Yucheng Xu, Hong Yu:
NoteChat: A Dataset of Synthetic Patient-Physician Conversations Conditioned on Clinical Notes. 15183-15201 - Akshat Gupta, Anurag Rao, Gopala Anumanchipalli:
Model Editing at Scale leads to Gradual and Catastrophic Forgetting. 15202-15232 - Yihao Ding, Lorenzo Vaiani, Soyeon Caren Han, Jean Lee, Paolo Garza, Josiah Poon, Luca Cagliero:
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding. 15233-15244 - Pegah Jandaghi, XiangHai Sheng, Xinyi Bai, Jay Pujara, Hakim Sidahmed:
Faithful Persona-based Conversational Dataset Generation with Large Language Models. 15245-15270 - Zhiyang Xu, Chao Feng, Rulin Shao, Trevor Ashby, Ying Shen, Di Jin, Yu Cheng, Qifan Wang, Lifu Huang:
Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning. 15271-15342 - Derek Powell, Walter Gerych, Thomas Hartvigsen:
TAXI: Evaluating Categorical Knowledge Editing for Language Models. 15343-15352 - Claire Jin, Sudha Rao, Xiangyu Peng, Portia Botchway, Jessica Quaye, Chris Brockett, Bill Dolan:
Automatic Bug Detection in LLM-Powered Text-Based Games Using LLMs. 15353-15368 - Nadine Amin, Julia Rayz:
Embodied Language Learning: Opportunities, Challenges, and Future Directions. 15369-15379 - Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler, Jackie Chi Kit Cheung:
Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective. 15380-15395 - Sai Vallurupalli, Katrin Erk, Francis Ferraro:
SAGA: A Participant-specific Examination of Story Alternatives and Goal Applicability for a Deeper Understanding of Complex Events. 15396-15420 - Kun Zhao, Bohao Yang, Chen Tang, Chenghua Lin, Liang Zhan:
SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation. 15421-15435 - Janghoon Han, Changho Lee, Joongbo Shin, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae:
Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning. 15436-15452 - Daiki Asami, Saku Sugawara:
What Makes Language Models Good-enough? 15453-15467 - Dingyao Yu, Yang An, Wei Ye, Xiongfeng Xiao, Shaoguang Mao, Tao Ge, Shikun Zhang:
Refining Corpora from a Model Calibration Perspective for Chinese Spelling Correction. 15468-15480 - Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee:
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples. 15481-15495 - Ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, May Dongmei Wang, Wei Jin, Joyce C. Ho, Carl Yang:
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models. 15496-15523 - Min-Jae Hwang, Ilia Kulikov, Benjamin N. Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee:
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation. 15524-15541 - Yang Wu, Chenghao Wang, Ece Gumusel, Xiaozhong Liu:
Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning. 15542-15555 - Hui Liu, Wenya Wang, Haoru Li, Haoliang Li:
TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection. 15556-15583 - Shuyang Cao, Lu Wang:
Verifiable Generation with Subsentence-Level Fine-Grained Citations. 15584-15596 - Yash Kumar Lal, Li Zhang, Faeze Brahman, Bodhisattwa Prasad Majumder, Peter Clark, Niket Tandon:
Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization. 15597-15611 - Xinbo Wu, Lav R. Varshney:
A Meta-Learning Perspective on Transformers for Causal Language Modeling. 15612-15622 - Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Haorui Wang, Zhen Qin, Feng Han, Jialu Liu, Simon Baumgartner, Michael Bendersky, Chao Zhang:
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs. 15623-15636 - Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang:
Small Language Models Need Strong Verifiers to Self-Correct Reasoning. 15637-15653 - Kexun Zhang, Yee Man Choi, Zhenqiao Song, Taiqi He, William Yang Wang, Lei Li:
Hire a Linguist!: Learning Endangered Languages in LLMs with In-Context Linguistic Descriptions. 15654-15669 - Ali Malik, Stephen Mayhew, Christopher Piech, Klinton Bicknell:
From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation. 15670-15693 - Khaoula Chehbouni, Megha Roshan, Emmanuel Ma, Futian Andrew Wei, Afaf Taïk, Jackie Chi Kit Cheung, Golnoosh Farnadi:
From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards. 15694-15710 - Zishan Guo, Yufei Huang, Deyi Xiong:
CToolEval: A Chinese Benchmark for LLM-Powered Agent Evaluation in Real-World API Interactions. 15711-15724 - Ben Athiwaratkun, Shiqi Wang, Mingyue Shang, Yuchen Tian, Zijian Wang, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Robert Kwiatkowski, Ramesh Nallapati, Parminder Bhatia, Bing Xiang:
Token Alignment via Character Matching for Subword Completion. 15725-15738 - Rilyn Han, Jiawen Chen, Yixin Liu, Arman Cohan:
Rethinking Efficient Multilingual Text Summarization Meta-Evaluation. 15739-15746 - Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen:
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation. 15747-15760 - Yilin Yang, Stefan Lee, Prasad Tadepalli:
Language-Informed Beam Search Decoding for Multilingual Machine Translation. 15761-15772 - Minsoo Kim, Sihwa Lee, Wonyong Sung, Jungwook Choi:
RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models. 15773-15786 - Alexander Taylor, Wei Wang:
The PGNSC Benchmark: How Do We Predict Where Information Spreads? 15787-15803 - Shreyas Basavatia, Keerthiram Murugesan, Shivam Ratnakar:
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models. 15804-15819 - Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo:
Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models. 15820-15839 - Xintong Wang, Jingheng Pan, Liang Ding, Chris Biemann:
Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding. 15840-15853 - Dingmin Wang, Jinman Zhao, Hengzhi Pei, Samson Tan, Sheng Zha:
Fine-tuning Language Models for Joint Rewriting and Completion of Code with Potential Bugs. 15854-15868 - Abhinav Anand, Shweta Verma, Krishna Narasimhan, Mira Mezini:
A Critical Study of What Code-LLMs (Do Not) Learn. 15869-15889 - Yucheng Zhou, Xiang Li, Qianning Wang, Jianbing Shen:
Visual In-Context Learning for Large Vision-Language Models. 15890-15902 - Xin Cheng, Xun Wang, Tao Ge, Si-Qing Chen, Furu Wei, Dongyan Zhao, Rui Yan:
SCALE: Synergized Collaboration of Asymmetric Language Translation Engines. 15903-15918 - Gauri Naik, Sharad Chandakacherla, Shweta Yadav, Md. Shad Akhtar:
No perspective, no perception!! Perspective-aware Healthcare Answer Summarization. 15919-15932 - Tao Shen, Guodong Long, Xiubo Geng, Chongyang Tao, Yibin Lei, Tianyi Zhou, Michael Blumenstein, Daxin Jiang:
Retrieval-Augmented Retrieval: Large Language Models are Strong Zero-Shot Retriever. 15933-15946 - Preslav Nakov, Jisun An, Haewoon Kwak, Muhammad Arslan Manzoor, Zain Muhammad Mujahid, Husrev T. Sencar:
A Survey on Predicting the Factuality and the Bias of News Media. 15947-15962 - Rana Aref Salama, Abdou Youssef, Mona T. Diab:
Semantic Compression for Word and Sentence Embeddings using Discrete Wavelet Transform. 15963-15977 - Jeonghoon Kim, Heesoo Jung, Hyeju Jang, Hogun Park:
Improving Multi-hop Logical Reasoning in Knowledge Graphs with Context-Aware Query Representation Learning. 15978-15991 - Yuzhao Heng, Chunyuan Deng, Yitong Li, Yue Yu, Yinghao Li, Rongzhi Zhang, Chao Zhang:
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models. 15992-16030 - Yihan Wang, Zhouxing Shi, Andrew Bai, Cho-Jui Hsieh:
Defending LLMs against Jailbreaking Attacks via Backtranslation. 16031-16046 - Shiki Sato, Reina Akama, Jun Suzuki, Kentaro Inui:
A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems. 16047-16062 - Kentaro Ozeki, Risako Ando, Takanobu Morishita, Hirohiko Abe, Koji Mineshima, Mitsuhiro Okada:
Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset. 16063-16077 - Chunyuan Deng, Yilun Zhao, Yuzhao Heng, Yitong Li, Jiannan Cao, Xiangru Tang, Arman Cohan:
Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation. 16078-16092 - Sneha Mondal, Ritika, Ashish Agrawal, Preethi Jyothi, Aravindan Raghuveer:
DIMSIM: Distilled Multilingual Critics for Indic Text Simplification. 16093-16109 - Dongkyu Lee, Chandana Satya Prakash, Jack FitzGerald, Jens Lehmann:
MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources. 16110-16121 - Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong Park:
Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models. 16122-16143 - Yuwei Xia, Ding Wang, Qiang Liu, Liang Wang, Shu Wu, Xiao-Yu Zhang:
Chain-of-History Reasoning for Temporal Knowledge Graph Forecasting. 16144-16159 - Ming Li, Jiuhai Chen, Lichang Chen, Tianyi Zhou:
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements. 16160-16176 - Jaehoon Kim, Seungwan Jin, Sohyun Park, Someen Park, Kyungsik Han:
Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection. 16177-16188 - Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou:
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning. 16189-16211 - Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang:
Selective Prompting Tuning for Personalized Conversations with LLMs. 16212-16226 - Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria:
Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models. 16227-16239 - Honglin Lin, Siyu Li, Guoshun Nan, Chaoyue Tang, Xueting Wang, Jingxin Xu, Yankai Rong, Zhouzhili Zhouzhili, Yutong Gao, Qimei Cui, Xiaofeng Tao:
ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions. 16240-16258 - Yew Ken Chia, Vernon Toh, Deepanway Ghosal, Lidong Bing, Soujanya Poria:
PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns. 16259-16273 - Jaehong Kim, Chaeyoon Jeong, Seongchan Park, Meeyoung Cha, Wonjae Lee:
How Do Moral Emotions Shape Political Participation? A Cross-Cultural Analysis of Online Petitions Using Language Models. 16274-16289 - Yubo Dong, Xukun Zhu, Zhengzhe Pan, Linchao Zhu, Yi Yang:
VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft. 16290-16314 - Yuchen Yang, Yu Wang, Yanfeng Wang:
CF-TCIR: A Compositor-Free Framework for Hierarchical Text-Conditioned Image Retrieval. 16315-16325 - Peijie Huang, Xisheng Xiao, Yuhong Xu, Jiawei Chen:
DMIN: A Discourse-specific Multi-granularity Integration Network for Conversational Aspect-based Sentiment Quadruple Analysis. 16326-16338 - Muhammad Reza Qorib, Geonsik Moon, Hwee Tou Ng:
Are Decoder-Only Language Models Better than Encoder-Only Language Models in Understanding Word Meaning? 16339-16347 - Xihang Yue, Linchao Zhu, Yi Yang:
FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models. 16348-16361 - Shiao Meng, Xuming Hu, Aiwei Liu, Fukun Ma, Yawen Yang, Shuang Li, Lijie Wen:
On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations. 16362-16374 - Bo Hu, Meng Zhang, Chenfei Xie, Yuanhe Tian, Yan Song, Zhendong Mao:
RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content. 16375-16387 - Jaehee Ryu, Seonhee Cho, Gyubok Lee, Edward Choi:
EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records. 16388-16407 - Yihe Wang, Jin Liu, Yao Wan, Yitong Li, Zifeng Liu, Weipeng Chen:
KEEP CHATTING! An Attractive Dataset for Continuous Conversation Agents. 16408-16414 - Yuze Zhao, Zhenya Huang, Yixiao Ma, Rui Li, Kai Zhang, Hao Jiang, Qi Liu, Linbo Zhu, Yu Su:
RePair: Automated Program Repair with Process-based Feedback. 16415-16429 - Yang Xu, Yunlong Feng, Honglin Mu, Yutai Hou, Yitong Li, Xinghao Wang, Wanjun Zhong, Zhongyang Li, Dandan Tu, Qingfu Zhu, Min Zhang, Wanxiang Che:
Concise and Precise Context Compression for Tool-Using Language Models. 16430-16441 - Mohamed Elgaar, Jiali Cheng, Nidhi Vakil, Hadi Amiri, Leo Anthony Celi:
MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries. 16442-16455
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.