Artificial intelligence-based data extraction for next generation risk assessment: Is fine-tuning of a large language model worth the effort?

Anna Sonnenburg,Dimitra Eleftheriadou,Florian Padberg,Tilmann Blümmel,Nynke I Kramer,Paul Wittkowski,Benthe Van Der Lugt,Todor Dobrikov,Ferdinand Graf,Hans Bouwmeester,Karsten Bech,Carsten Kneuer,Johannes Rehn,Carla Mereu

doi:10.1016/j.tox.2024.153933

Abstract

To underpin scientific evaluations of chemical risks, agencies such as the European Food Safety Authority (EFSA) heavily rely on the outcome of systematic reviews, which currently require extensive manual effort. One specific challenge constitutes the meaningful use of vast amounts of valuable data from new approach methodologies (NAMs) which are mostly reported in an unstructured way in the scientific literature. In the EFSA-initiated project ‘AI4NAMS’, the potential of large language models (LLMs) was explored. Models from the GPT family, where GPT refers to Generative Pre-trained Transformer, were used for searching, extracting, and integrating data from scientific publications for NAM-based risk assessment. A case study on bisphenol A (BPA), a substance of very high concern due to its adverse effects on human health, focused on the structured extraction of information on test systems measuring biologic activities of BPA. Fine-tuning of a GPT-3 model (Curie base model) for extraction tasks was tested and the performance of the fine-tuned model was compared to the performance of a ready-to-use model (text-davinci-002). To update findings from the AI4NAMS project and to check for technical progress, the fine-tuning exercise was repeated and a newer ready-to-use model (text-davinci-003) served as comparison. In both cases, the fine-tuned Curie model was found to be superior to the ready-to-use model. Performance improvement was also obvious between text-davinci-002 and the newer text-davinci-003. Our findings demonstrate how fine-tuning and the swift general technical development improve model performance and contribute to the growing number of investigations on the use of AI in scientific and regulatory tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Artificial intelligence-based data extraction for next generation risk assessment: Is fine-tuning of a large language model worth the effort?

Abstract

Talk to us

Similar Papers

More From: Toxicology

Lead the way for us

Journal: Toxicology	Publication Date: Aug 23, 2024
License type: cc-by

Similar Papers

Bisphenol A (BPA) hazard assessment protocol
... Detlef Wölfle
EFSA Supporting Publications | VOL. 14
, et. al. ... Detlef Wölfle
01 Dec 2017
EFSA Supporting Publications | VOL. 14

Outside Looking In: Understanding the Role of Science in Regulation
Tim Lougheed
Environmental Health Perspectives | VOL. 117
Tim LougheedTim Lougheed
01 Mar 2009
Environmental Health Perspectives | VOL. 117

Implementing New Approach Methodologies (NAMs) in food safety assessments: Strategic objectives and actions taken by the European Food Safety Authority
Irene Cattaneo ... Andrea Terron
Trends in Food Science & Technology | VOL. 133
Irene Cattaneo, et. al.Irene Cattaneo ... Andrea Terron
08 Feb 2023
Trends in Food Science & Technology | VOL. 133

Report on the two‐phase public consultation on the draft EFSA scientific opinion on bisphenol A (BPA)
-
EFSA Supporting Publications | VOL. 12
--
01 Jan 2015
EFSA Supporting Publications | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Artificial intelligence-based data extraction for next generation risk assessment: Is fine-tuning of a large language model worth the effort?

Abstract

Talk to us

Similar Papers

More From: Toxicology