Evaluating the Ripple Effects of Knowledge Editing in Language Models

Roi Cohen,Eden Biran,Ori Yoran,Amir Globerson,Mor Geva

doi:10.1162/tacl_a_00644

Abstract

Abstract Modern language models capture a large body of factual knowledge. However, some facts can be incorrectly induced or become obsolete over time, resulting in factually incorrect generations. This has led to the development of various editing methods that allow updating facts encoded by the model. Evaluation of these methods has primarily focused on testing whether an individual fact has been successfully injected, and if similar predictions for other subjects have not changed. Here we argue that such evaluation is limited, since injecting one fact (e.g., “Jack Depp is the son of Johnny Depp”) introduces a “ripple effect” in the form of additional facts that the model needs to update (e.g., “Jack Depp is the sibling of Lily-Rose Depp”). To address this, we propose novel evaluation criteria that consider the implications of an edit on related facts. Using these criteria, we then construct RippleEdits, a diagnostic benchmark of 5K factual edits, capturing various types of ripple effects. We evaluate prominent editing methods on RippleEdits, showing that they fail to introduce consistent changes in the model’s knowledge. In addition, we find that a simple in-context editing baseline obtains the best scores on our benchmark, suggesting a promising research direction for model editing.1

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Apr 9, 2024
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Evaluating the Ripple Effects of Knowledge Editing in Language Models

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Evaluating Complex Entity Knowledge Propagation for Knowledge Editing in LLMs
Wafa Shafqat ... Seung-Hoon Na
Applied Sciences | VOL. 14
Wafa Shafqat, et. al.Wafa Shafqat ... Seung-Hoon Na
13 Feb 2024
Applied Sciences | VOL. 14

Geoscience language models and their intrinsic evaluation
Christopher J.M Lawley ... Geneviève Marquis
Applied Computing and Geosciences | VOL. 14
Christopher J.M Lawley, et. al.Christopher J.M Lawley ... Geneviève Marquis
04 May 2022
Applied Computing and Geosciences | VOL. 14

Methodical Systematic Review of Abstractive Summarization and Natural Language Processing Models for Biomedical Health Informatics: Approaches, Metrics and Challenges
Praveen Kumar Katwe ... Deepak Gupta
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Praveen Kumar Katwe, et. al.Praveen Kumar Katwe ... Deepak Gupta
31 May 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

QWI: a method for improved smoothing in language modelling
G Bordel ... E Vidal
-
G Bordel, et. al.G Bordel ... E Vidal
09 May 1995
09 May 1995

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating the Ripple Effects of Knowledge Editing in Language Models

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics