Turtling: a time-aware neural topic model on NIH grant data.

Ruiyi Zhang,Ziheng Duan,Martin Renqiang Min,Cheyu Lee,Jing Zhang,Dylan Riffle

doi:10.1093/bioadv/vbad096

Ruiyi Zhang, Ziheng Duan + Show 4 more

Open Access

https://doi.org/10.1093/bioadv/vbad096

Copy DOI

Abstract

Recent initiatives for federal grant transparency allow direct knowledge extraction from large volumes of grant texts, serving as a powerful alternative to traditional surveys. However, its computational modeling is challenging as grants are usually multifaceted with constantly evolving topics. We propose Turtling, a time-aware neural topic model with three unique characteristics. First, Turtling employs pretrained biomedical word embedding to extract research topics. Second, it leverages a probabilistic time-series model to allow smooth and coherent topic evolution. Lastly, Turtling leverages additional topic diversity loss and funding institute classification loss to improve topic quality and facilitate funding institute prediction. We apply Turtling on publicly available NIH grant text and show that it significantly outperforms other methods on topic quality metrics. We also demonstrate that Turtling can provide insights into research topic evolution by detecting topic trends across decades. In summary, Turtling may be a valuable tool for grant text analysis. Turtling is freely available as an open-source software at https://github.com/aicb-ZhangLabs/Turtling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Turtling: a time-aware neural topic model on NIH grant data.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics advances

Lead the way for us

Journal: Bioinformatics advances	Publication Date: Jan 5, 2023
License type: CC BY 4.0

Similar Papers

DTM at 25: Essays on Themes and Future Directions
Dan Braha ... Amaresh Chakrabarti
-
Dan Braha, et. al.Dan Braha ... Amaresh Chakrabarti
04 Aug 2013
04 Aug 2013

Analyzing evolution of research topics with NEViewer: a new method based on dynamic co-word networks
Xiaoguang Wang ... Qikai Cheng
Scientometrics | VOL. 101
Xiaoguang Wang, et. al.Xiaoguang Wang ... Qikai Cheng
22 Jun 2014
Scientometrics | VOL. 101

Research frontier detection and analysis based on research grants information: A case study on health informatics in the US
Guanghui Ye ... Cancan Wang
Journal of Informetrics | VOL. 17
Guanghui Ye, et. al.Guanghui Ye ... Cancan Wang
01 Aug 2023
Journal of Informetrics | VOL. 17

MatrixSim: A new method for detecting the evolution paths of research topics
Xiaoguang Wang ... Han Huang
Journal of Informetrics | VOL. 16
Xiaoguang Wang, et. al.Xiaoguang Wang ... Han Huang
01 Nov 2022
Journal of Informetrics | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Turtling: a time-aware neural topic model on NIH grant data.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics advances