A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Open in Zotero

View on zotero.org

Open in Zotero

View on zotero.org

Article Status

Published

Authors/contributors

Huang, Lei (Author)
Yu, Weijiang (Author)
Ma, Weitao (Author)
Zhong, Weihong (Author)
Feng, Zhangyin (Author)
Wang, Haotian (Author)
Chen, Qianglong (Author)
Peng, Weihua (Author)
Feng, Xiaocheng (Author)
Qin, Bing (Author)
Liu, Ting (Author)

Title

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Abstract

The emergence of large language models (LLMs) has marked a significant breakthrough in natural language processing (NLP), fueling a paradigm shift in information acquisition. Nevertheless, LLMs are prone to hallucination, generating plausible yet nonfactual content. This phenomenon raises significant concerns over the reliability of LLMs in real-world information retrieval (IR) systems and has attracted intensive research to detect and mitigate such hallucinations. Given the open-ended general-purpose attributes inherent to LLMs, LLM hallucinations present distinct challenges that diverge from prior task-specific models. This divergence highlights the urgency for a nuanced understanding and comprehensive overview of recent advances in LLM hallucinations. In this survey, we begin with an innovative taxonomy of hallucination in the era of LLM and then delve into the factors contributing to hallucinations. Subsequently, we present a thorough overview of hallucination detection methods and benchmarks. Our discussion then transfers to representative methodologies for mitigating LLM hallucinations. Additionally, we delve into the current limitations faced by retrieval-augmented LLMs in combating hallucinations, offering insights for developing more robust IR systems. Finally, we highlight the promising research directions on LLM hallucinations, including hallucination in large vision-language models and understanding of knowledge boundaries in LLM hallucinations.

Repository

arXiv

Archive ID

arXiv:2311.05232

Date

2025-1-24

URL

https://dl.acm.org/doi/10.1145/3703155

Accessed

21/03/2024, 16:05

Short Title

A Survey on Hallucination in Large Language Models

Language

Library Catalogue

arXiv.org

Extra

arXiv:2311.05232 [cs] <标题>: 关于大型语言模型幻觉的调研：原理、分类、挑战与未解问题 <AI Smry>: A thorough overview of hallucination detection methods and benchmarks is presented and the promising research directions on LLM hallucinations are highlighted, including hallucination in large vision-language models and understanding of knowledge boundaries in LLM hallucinations. Read_Status: New Read_Status_Date: 2025-11-10T07:25:58.334Z Citation Key: huang2025b

Citation

Huang, L., Yu, W., Ma, W., Zhong, W., Feng, Z., Wang, H., Chen, Q., Peng, W., Feng, X., Qin, B., & Liu, T. (2025). A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions (arXiv:2311.05232). arXiv. https://dl.acm.org/doi/10.1145/3703155

Link to this record

https://aievidencehub.org/lib/77NQVF8U