COVIDSum: A linguistically enriched SciBERT-based summarization model for COVID-19 scientific papers

Xiaoyan Cai,Sen Liu,Libin Yang,Yan Lu,Jintao Zhao,Dinggang Shen,Tianming Liu

doi:10.1016/j.jbi.2022.103999

Abstract

The coronavirus disease (COVID-19) has claimed the lives of over 350,000 people and infected more than 173 million people worldwide, it triggers researchers from diverse fields are accelerating their research to help diagnostics, therapies, and vaccines. Researchers also publish their recent research progress through scientific papers. However, manually writing the abstract of a paper is time-consuming, and it increases the writing burden of the researchers. Abstractive summarization technique which automatically provides researchers reliable draft abstracts, can alleviate this problem. In this work, we propose a linguistically enriched SciBERT-based summarization model for COVID-19 scientific papers, named COVIDSum. Specifically, we first extract salient sentences from source papers and construct word co-occurrence graphs. Then, we adopt a SciBERT-based sequence encoder and a Graph Attention Networks-based graph encoder to encode sentences and word co-occurrence graphs, respectively. Finally, we fuse the above two encodings and generate an abstractive summary of each scientific paper. When evaluated on the publicly available COVID-19 open research dataset, the performance of our proposed model achieves significant improvement compared with other document summarization models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Informatics	Publication Date: Jan 30, 2022
Citations: 18	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

COVIDSum: A linguistically enriched SciBERT-based summarization model for COVID-19 scientific papers

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Similar Papers

Predictors of intrahospital mortality in patients with coronavirus disease 2019 and cerebrovascular diseases: rapid systematic review and meta-analysis protocol
Iván Pérez-Neri ... Philippe Tadger
Archivos de Neurociencias | VOL. -
Iván Pérez-Neri, et. al.Iván Pérez-Neri ... Philippe Tadger
02 Feb 2023
Archivos de Neurociencias | VOL. -

Predictors of intrahospital mortality in patients with coronavirus disease 2019 and cerebrovascular diseases: rapid systematic review and meta-analysis protocol
Iván Pérez-Neri ... Ashutosh Kumar Singh
Archivos de Neurociencias | VOL. 29
Iván Pérez-Neri, et. al.Iván Pérez-Neri ... Ashutosh Kumar Singh
02 Feb 2023
Archivos de Neurociencias | VOL. 29

The Impact of the COVID-19 Pandemic on Scientific Publishing
Philip D Sloane ... Sheryl Zimmerman
Journal of the American Medical Directors Association | VOL. 22
Philip D Sloane, et. al.Philip D Sloane ... Sheryl Zimmerman
28 Jan 2021
Journal of the American Medical Directors Association | VOL. 22

WITHDRAWN: Classification of covid related articles using machine learning
Deepthi Godavarthi ... Mary Sowjanya A
Materials Today: Proceedings | VOL. 383
Deepthi Godavarthi, et. al.Deepthi Godavarthi ... Mary Sowjanya A
01 Feb 2021
Materials Today: Proceedings | VOL. 383

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

COVIDSum: A linguistically enriched SciBERT-based summarization model for COVID-19 scientific papers

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics