BEHRT: Transformer for Electronic Health Records

Yikuan Li,Yajie Zhu,José Roberto Ayala Solares,Dexter Canoy,Kazem Rahimi,Rema Ramakrishnan,Abdelaali Hassaine,Gholamreza Salimi-Khorshidi,Shishir Rao

doi:10.1038/s41598-020-62922-y

Yikuan Li, Yajie Zhu + Show 7 more

Open Access

https://doi.org/10.1038/s41598-020-62922-y

Copy DOI

Journal: Scientific Reports	Publication Date: Apr 28, 2020
Citations: 272	License type: open-access

Affiliation: University of Oxford

Abstract

Today, despite decades of developments in medicine and the growing interest in precision healthcare, vast majority of diagnoses happen once patients begin to show noticeable signs of illness. Early indication and detection of diseases, however, can provide patients and carers with the chance of early intervention, better disease management, and efficient allocation of healthcare resources. The latest developments in machine learning (including deep learning) provides a great opportunity to address this unmet need. In this study, we introduce BEHRT: A deep neural sequence transduction model for electronic health records (EHR), capable of simultaneously predicting the likelihood of 301 conditions in one’s future visits. When trained and evaluated on the data from nearly 1.6 million individuals, BEHRT shows a striking improvement of 8.0–13.2% (in terms of average precision scores for different tasks), over the existing state-of-the-art deep EHR models. In addition to its scalability and superior accuracy, BEHRT enables personalised interpretation of its predictions; its flexible architecture enables it to incorporate multiple heterogeneous concepts (e.g., diagnosis, medication, measurements, and more) to further improve the accuracy of its predictions; its (pre-)training results in disease and patient representations can be useful for future studies (i.e., transfer learning).

Highlights

The field of precision healthcare aims to improve the provision of care through precise and personalised prediction, prevention, and intervention
To further investigate BEHRT’s predictive performance, we carried out three experiments: (1) We investigated if BEHRT can implicitly learn gender and utilise this latent understanding in subsequent visit prediction; (2) we carried out an ablation study by selectively deactivating age, segment, and/or position embeddings and seeing their effects on average precision score (APS) and area under the receiver operating characteristic curve (AUROC); and (3) we assessed the model’s performance on the prediction of new instances of diseases
We introduced a novel deep neural network model for electronic health records (EHR) called BEHRT; an interpretable personalised risk model, which scales across a range of diseases and incorporates a wide range of EHR modalities/ concepts in its modular architecture

Summary

Introduction

The field of precision healthcare aims to improve the provision of care through precise and personalised prediction, prevention, and intervention. Recent developments in deep learning, provided us with models that can learn useful representations (e.g., of individuals or concepts) from raw or minimally-processed data, with minimal need for expert guidance[9] This happens through a sequence of layers, each employing a large number of simple linear and nonlinear transformations to map their corresponding inputs to a representation; this progress across layers results in a final representation in which the data points form distinguishable patterns. Miotto et al.[12] employed a stack of denoising autoencoders (SDA) instead of RBM, and showed that it outperforms many popular feature extraction and feature transformation approaches (e.g., PCA, ICA13 and Gaussian mixture models) for providing classifiers with useful patient representations to predict the onset of a number of diseases from EHR These early works on the application of DL to EHR did not take into account the subtleties of EHR data (e.g., the irregularity of the inter-visit intervals, and the temporal order or events). Both these works employed some embedding techniques to map non-numeric medical concepts to an algebraic space in which the sequence models can operate

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BEHRT: Transformer for Electronic Health Records

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Exploring the Power of Transformer Models in Hospitality Domain
Jyoti Parsola
Mathematical Statistician and Engineering Applications | VOL. 70
Jyoti ParsolaJyoti Parsola
31 Jan 2021
Mathematical Statistician and Engineering Applications | VOL. 70

Cashing in: cost-benefit analysis framework for digital hospitals
Kim-Huong Nguyen ... Clair Sullivan
BMC Health Services Research | VOL. 24
Kim-Huong Nguyen, et. al.Kim-Huong Nguyen ... Clair Sullivan
31 May 2024
BMC Health Services Research | VOL. 24

ID-Viewer: a visual analytics architecture for infectious diseases surveillance and response management in Pakistan
M.A Ali ... M.N Ayyaz
Public Health | VOL. 134
M.A Ali, et. al.M.A Ali ... M.N Ayyaz
13 Feb 2016
Public Health | VOL. 134

Application of Transfer Learning to Detect Potato Disease from Leaf Image
Farabee Islam ... Md Nazmul Hoq
-
Farabee Islam, et. al.Farabee Islam ... Md Nazmul Hoq
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BEHRT: Transformer for Electronic Health Records

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports