Improving Factuality by Contrastive Decoding with Factual and Hallucination Prompts.

Bojie Lv,Ao Feng,Chenlong Xie

doi:10.3390/s24217097

Abstract

Large language models have demonstrated impressive capabilities in many domains. But they sometimes generate irrelevant or nonsensical text, or produce outputs that deviate from the provided input, an occurrence commonly referred to as hallucination. To mitigate this issue, we introduce a novel decoding method that incorporates both factual and hallucination prompts (DFHP). It applies contrastive decoding to highlight the disparity in output probabilities between factual prompts and hallucination prompts. Experiments on both multiple-choice and text generation tasks show that our approach significantly improves factual accuracy of large language models without additional training. On the TruthfulQA dataset, the DFHP method significantly improves factual accuracy of the LLaMA model, with an average improvement of 6.4% for the 7B, 13B, 30B, and 65B versions. Its high accuracy in factuality makes it an ideal choice for high reliability tasks like medical diagnosis and legal cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Factuality by Contrastive Decoding with Factual and Hallucination Prompts.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Journal: Sensors (Basel, Switzerland)	Publication Date: Nov 4, 2024
License type: CC BY 4.0

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Navigating GPT-4 and BERT: A Dual Perspective on Financial and Political Sentiment Analysis
Akash Ghosh ... Rahul Sarkar
International Journal for Research in Applied Science and Engineering Technology | VOL. 11
Akash Ghosh, et. al.Akash Ghosh ... Rahul Sarkar
31 Dec 2024
International Journal for Research in Applied Science and Engineering Technology | VOL. 11

Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review.
Leyao Wang ... Zhijun Yin
Journal of medical Internet research | VOL. 26
Leyao Wang, et. al.Leyao Wang ... Zhijun Yin
07 Nov 2024
Journal of medical Internet research | VOL. 26

Use of SNOMED CT in Large Language Models: Scoping Review.
Eunsuk Chang ... Sumi Sung
JMIR medical informatics | VOL. 12
Eunsuk Chang, et. al.Eunsuk Chang ... Sumi Sung
07 Oct 2024
JMIR medical informatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Factuality by Contrastive Decoding with Factual and Hallucination Prompts.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)