An Evaluation Resource for Grounding Translation Errors

Sujin Chen; Kang Wang; Zixuan Zhou; Xiangyu Duan; Wanqun Zhang; Hao Yang; Jinsong Su; Min Zhang

doi:10.18653/v1/2025.findings-emnlp.1299

An Evaluation Resource for Grounding Translation Errors

Sujin Chen, Kang Wang, Zixuan Zhou, Xiangyu Duan, Wanqun Zhang, Hao Yang, Jinsong Su, Min Zhang

Abstract

Current fine-grained error analyses by LLMs gain more and more attention in machine translation, but these analyses do not ground the errors to the reasons why the annotated text spans are erroneous. If LLMs do not know such reasons, the corrections or refinements by LLMs will be untrustworthy.In this paper, we check whether LLMs know such reasons in translation error grounding task. We manually build an evaluation resource through a bi-directional grounding scheme. In the forward direction, we annotate the explanation of the reason for each error span. In the backward direction, we annotate the error span given its explanation, in which the error span is masked. If the error spans of both directions are consistent, we deem the explanation is valid. Such grounding process can regulate the explanation so as to avoid the subjective bias. The evaluation results on this resource show that LLMs perform significantly worse than human in both directions. Furthermore, we apply the error grounding for filtering false alarmed errors, and achieve significant improvement in translation error detection.

Anthology ID:: 2025.findings-emnlp.1299
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 23900–23916
Language:
URL:: https://aclanthology.org/2025.findings-emnlp.1299/
DOI:: 10.18653/v1/2025.findings-emnlp.1299
Bibkey:
Cite (ACL):: Sujin Chen, Kang Wang, Zixuan Zhou, Xiangyu Duan, Wanqun Zhang, Hao Yang, Jinsong Su, and Min Zhang. 2025. An Evaluation Resource for Grounding Translation Errors. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 23900–23916, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: An Evaluation Resource for Grounding Translation Errors (Chen et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-emnlp.1299.pdf
Checklist:: 2025.findings-emnlp.1299.checklist.pdf

PDF Cite Search Checklist Fix data