Recent Advances in Text Analysis

Zheng Tracy Ke,Jiashun Jin,Pengsheng Ji,Wanshan Li

doi:10.1146/annurev-statistics-040522-022138

Zheng Tracy Ke, Jiashun Jin + Show 2 more

Open Access

https://doi.org/10.1146/annurev-statistics-040522-022138

Copy DOI

Abstract

Text analysis is an interesting research area in data science and has various applications, such as in artificial intelligence, biomedical research, and engineering. We review popular methods for text analysis, ranging from topic modeling to the recent neural language models. In particular, we review Topic-SCORE, a statistical approach to topic modeling, and discuss how to use it to analyze the Multi-Attribute Data Set on Statisticians (MADStat), a data set on statistical publications that we collected and cleaned. The application of Topic-SCORE and other methods to MADStat leads to interesting findings. For example, we identified 11 representative topics in statistics. For each journal, the evolution of topic weights over time can be visualized, and these results are used to analyze the trends in statistical research. In particular, we propose a new statistical model for ranking the citation impacts of 11 topics, and we also build a cross-topic citation graph to illustrate how research results on different topics spread to one another. The results on MADStat provide a data-driven picture of the statistical research from 1975 to 2015, from a text analysis perspective.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Annual Review of Statistics and Its Application	Publication Date: Nov 29, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Recent Advances in Text Analysis

Abstract

Talk to us

Similar Papers

More From: Annual Review of Statistics and Its Application

Lead the way for us

Similar Papers

Data Science in Healthcare: Implications for Early Career Investigators.
Sanjeev P Bhavnani ... Daniel Muñoz
Circulation: Cardiovascular Quality and Outcomes | VOL. 9
Sanjeev P Bhavnani, et. al.Sanjeev P Bhavnani ... Daniel Muñoz
01 Nov 2016
Circulation: Cardiovascular Quality and Outcomes | VOL. 9

Assessing perspectives on artificial intelligence applications to gastroenterology
Gursimran S Kochhar ... Shyam Thakkar
Gastrointestinal Endoscopy | VOL. 93
Gursimran S Kochhar, et. al.Gursimran S Kochhar ... Shyam Thakkar
02 Nov 2020
Gastrointestinal Endoscopy | VOL. 93

Artificial Intelligence in the American Healthcare Industry: Looking Forward to 2030
Federico R Tewes
Journal of Medical Research and Surgery | VOL. 3
Federico R TewesFederico R Tewes
06 Oct 2022
Artificial Intelligence in the American Healthcare Industry: Looking Forward to 2030
Federico R Tewes

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recent Advances in Text Analysis

Abstract

Talk to us

Similar Papers

More From: Annual Review of Statistics and Its Application