Supporting the working life exposome: Annotating occupational exposure for enhanced literature search.

Paul Thompson,Karen S Galea,Nhung Nguyen,Vivi Schlünssen,Zara Ann Stokholm,Qianqian Xie,Jorunn Kirkeleit,Roberto Nuñez,Christine Cramer,Sophia Ananiadou,Martie Van Tongeren,Håkan Tinnerberg,Panagiotis Georgiadis,Evana Amir Taher,Eelco Kuijpers,Ioannis Basinas,Bendik C Brinchmann,Calvin Ge

doi:10.1371/journal.pone.0307844

Abstract

An individual's likelihood of developing non-communicable diseases is often influenced by the types, intensities and duration of exposures at work. Job exposure matrices provide exposure estimates associated with different occupations. However, due to their time-consuming expert curation process, job exposure matrices currently cover only a subset of possible workplace exposures and may not be regularly updated. Scientific literature articles describing exposure studies provide important supporting evidence for developing and updating job exposure matrices, since they report on exposures in a variety of occupational scenarios. However, the constant growth of scientific literature is increasing the challenges of efficiently identifying relevant articles and important content within them. Natural language processing methods emulate the human process of reading and understanding texts, but in a fraction of the time. Such methods can increase the efficiency of both finding relevant documents and pinpointing specific information within them, which could streamline the process of developing and updating job exposure matrices. Named entity recognition is a fundamental natural language processing method for language understanding, which automatically identifies mentions of domain-specific concepts (named entities) in documents, e.g., exposures, occupations and job tasks. State-of-the-art machine learning models typically use evidence from an annotated corpus, i.e., a set of documents in which named entities are manually marked up (annotated) by experts, to learn how to detect named entities automatically in new documents. We have developed a novel annotated corpus of scientific articles to support machine learning based named entity recognition relevant to occupational substance exposures. Through incremental refinements to the annotation process, we demonstrate that expert annotators can attain high levels of agreement, and that the corpus can be used to train high-performance named entity recognition models. The corpus thus constitutes an important foundation for the wider development of natural language processing tools to support the study of occupational exposures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supporting the working life exposome: Annotating occupational exposure for enhanced literature search.

Abstract

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Journal: PloS one	Publication Date: Jan 1, 2024
License type: cc-by

Similar Papers

O-114 Natural language processing as a tool for developing and updating job exposure matrices for chemical exposures in the general population
Ioannis Basinas ... Eelco Kuijpers
Occupational and Environmental Medicine | VOL. 80
Ioannis Basinas, et. al.Ioannis Basinas ... Eelco Kuijpers
01 Mar 2023
Occupational and Environmental Medicine | VOL. 80

Occupational quantitative exposure to crystalline silica, solvents and pesticides and risk of clinical forms of systemic sclerosis
Gaël Galli ... Camille De Pous-Gerardin
Rheumatology | VOL. -
Gaël Galli, et. al.Gaël Galli ... Camille De Pous-Gerardin
14 Nov 2023
Rheumatology | VOL. -

Evaluation of the Suitability of an Existing Job-Exposure Matrix for the Assessment of Exposure of UK Biobank Participants to Dust, Fumes, and Diesel Exhaust Particulates.
Eirini Dimakakou ... George Streftaris
International Journal of Environmental Research and Public Health | VOL. 17
Eirini Dimakakou, et. al.Eirini Dimakakou ... George Streftaris
01 Jul 2020
International Journal of Environmental Research and Public Health | VOL. 17

Work-Relatedness.
William W Greaves ... Rajiv Das
Journal of Occupational & Environmental Medicine | VOL. 60
William W Greaves, et. al.William W Greaves ... Rajiv Das
01 Dec 2018
Journal of Occupational & Environmental Medicine | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supporting the working life exposome: Annotating occupational exposure for enhanced literature search.

Abstract

Talk to us

Similar Papers

More From: PloS one