BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model

Hongyi Yuan,Ruyi Gan,Zheng Yuan,Jiaxing Zhang,Yutao Xie,Sheng Yu

doi:10.18653/v1/2022.bionlp-1.9

Abstract

Pretrained language models have served as important backbones for natural language processing. Recently, in-domain pretraining has been shown to benefit various domain-specific downstream tasks. In the biomedical domain, natural language generation (NLG) tasks are of critical importance, while understudied. Approaching natural language understanding (NLU) tasks as NLG achieves satisfying performance in the general domain through constrained language generation or language prompting. We emphasize the lack of in-domain generative language models and the unsystematic generative downstream benchmarks in the biomedical domain, hindering the development of the research community. In this work, we introduce the generative language model BioBART that adapts BART to the biomedical domain. We collate various biomedical language generation tasks including dialogue, summarization, entity linking, and named entity recognition. BioBART pretrained on PubMed abstracts has enhanced performance compared to BART and set strong baselines on several tasks. Furthermore, we conduct ablation studies on the pretraining tasks for BioBART and find that sentence permutation has negative effects on downstream tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2022
Citations: 37	License type: cc-by

Similar Papers

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce
Song Xu ... Haoran Li
-
Song Xu, et. al.Song Xu ... Haoran Li
01 Jan 2020
01 Jan 2020

A Review of Current Trends, Techniques, and Challenges in Large Language Models (LLMs)
Rajvardhan Patil ... Venkat Gudivada
Applied Sciences | VOL. 14
Rajvardhan Patil, et. al.Rajvardhan Patil ... Venkat Gudivada
01 Mar 2024
Applied Sciences | VOL. 14

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model

-

12 May 2022
12 May 2022

Усвоение языка у языковых моделей и человека: хронологическое пробинг-исследование
Ekaterina Voloshina ... Oleg Serikov
-
Ekaterina Voloshina, et. al.Ekaterina Voloshina ... Oleg Serikov
18 Jun 2022
18 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model

Abstract

Talk to us

Similar Papers