On Faithfulness and Factuality in Abstractive Summarization

Article Status
Published
Authors/contributors
Title
On Faithfulness and Factuality in Abstractive Summarization
Abstract
It is well known that the standard likelihood training and approximate decoding objectives in neural text generation models lead to less human-like responses for open-ended tasks such as language modeling and story generation. In this paper we have analyzed limitations of these models for abstractive document summarization and found that these models are highly prone to hallucinate content that is unfaithful to the input document. We conducted a large scale human evaluation of several neural abstractive summarization systems to better understand the types of hallucinations they produce. Our human annotators found substantial amounts of hallucinated content in all model generated summaries. However, our analysis does show that pretrained models are better summarizers not only in terms of raw metrics, i.e., ROUGE, but also in generating faithful and factual summaries as evaluated by humans. Furthermore, we show that textual entailment measures better correlate with faithfulness than standard metrics, potentially leading the way to automatic evaluation metrics as well as training and decoding criteria.
Repository
arXiv
Archive ID
arXiv:2005.00661
Place
Online
Date
2020
Accessed
06/05/2024, 21:10
Library Catalogue
Extra
arXiv:2005.00661 [cs] Citation Key: maynez2020 <标题>: 关于抽象摘要中的忠实性与事实性 <AI Smry>: It is found that neural abstractive summarization models are highly prone to hallucinate content that is unfaithful to the input document and textual entailment measures better correlate with faithfulness than standard metrics, potentially leading the way to automatic evaluation metrics as well as training and decoding criteria.
Citation
Maynez, J., Narayan, S., Bohnet, B., & McDonald, R. (2020). On Faithfulness and Factuality in Abstractive Summarization (arXiv:2005.00661). arXiv. https://www.aclweb.org/anthology/2020.acl-main.173
Powered by Zotero and Kerko.