CurieLM: Enhancing Large Language Models for Nuclear Domain Applications

Zakaria Bouhoun,Ahmed Allali,Riccardo Cocci,Mohamad Ali Assaad,Alexandra Plancon,Frederic Godest,Kirill Kondratenko,Julien Rodriguez,Francesco Vitillo,Olivier Malhomme,Lies Benmiloud Bechet,Robert Plana

doi:10.1051/epjconf/202430217006

Abstract

Large Language Models (LLMs), such as the Mistral model, have exhibited remarkable performance across diverse tasks. However, their efficacy in nuclear applications remains constrained by a lack of domain-specific knowledge and an inability to effectively leverage that knowledge. Nuclear-related tasks, including safety assessments and requirement analyses, pose unique challenges due to the intricate domain expertise and diverse constraints involved. To address these limitations, we introduce CurieLM, an LLM specifically tailored for the nuclear domain. CurieLM builds upon the Mistral model, enhancing its capabilities through domain-specific fine-tuning. Our team of nuclear engineers overcame the initial hurdle of accessing high-quality nuclear data, enabling CurieLM to comprehend and accurately respond to nuclear-specific instructions. This manuscript outlines the development and optimization process of CurieLM, marking a significant step toward enhancing nuclear-related natural language processing tasks. Experimental results demonstrate a 13% performance improvement over base LLMs, underscoring the effectiveness of our approach. Domain-specific LLMs like CurieLM hold a great potential across various applications, and this study sets the stage for further exploration in this emerging field.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CurieLM: Enhancing Large Language Models for Nuclear Domain Applications

Abstract

Talk to us

Similar Papers

More From: EPJ Web of Conferences

Lead the way for us

Journal: EPJ Web of Conferences	Publication Date: Jan 1, 2024
License type: CC BY 4.0

Similar Papers

A Large and Diverse Arabic Corpus for Language Modeling
Abbas Raza Ali ... Hasan Raza Ali
Procedia Computer Science | VOL. 225
Abbas Raza Ali, et. al.Abbas Raza Ali ... Hasan Raza Ali
01 Jan 2023
Procedia Computer Science | VOL. 225

BB-GeoGPT: A framework for learning a large language model for geographic information science
Yifan Zhang ... Wenhao Yu
Information Processing and Management | VOL. 61
Yifan Zhang, et. al.Yifan Zhang ... Wenhao Yu
22 Jun 2024
Information Processing and Management | VOL. 61

Use of SNOMED CT in Large Language Models: Scoping Review.
Eunsuk Chang ... Sumi Sung
JMIR medical informatics | VOL. 12
Eunsuk Chang, et. al.Eunsuk Chang ... Sumi Sung
07 Oct 2024
JMIR medical informatics | VOL. 12

#2924 Comparison of large language models and traditional natural language processing techniques in predicting arteriovenous fistula failure
Suman Lama ... Luca Neri
Nephrology Dialysis Transplantation | VOL. 39
Suman Lama, et. al.Suman Lama ... Luca Neri
23 May 2024
Nephrology Dialysis Transplantation | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CurieLM: Enhancing Large Language Models for Nuclear Domain Applications

Abstract

Talk to us

Similar Papers

More From: EPJ Web of Conferences