Medical documents classification using topic modeling

Maryam Nuser,Enas Al-Horani

doi:10.11591/ijeecs.v17.i3.pp1524-1530

Abstract

The number of digital medical documents is increasing continuously; several medical websites share a lot of unclassified articles. These articles have very long texts that should be read to determine the topic of each document. The classification of these documents is important so researchers can use these documents easily and the effort and time in reading and searching for a specific topic will be reduced. Therefore, an automatic way to extract latent topics from these text documents is needed. Topic modeling is one of the techniques used to deal with this problem. In this paper, a medical collection of documents is used; this collection contains documents from three types of widespread diseases (Heart Diseases, Blood Pressure and Cholesterol). LDA topic modeling technique is applied to classify these documents into the previous mentioned topics. An evaluation of the algorithm’s results is done and the LDA shows a good level of classification accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Indonesian Journal of Electrical Engineering and Computer Science	Publication Date: Mar 1, 2020
Citations: 2	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Medical documents classification using topic modeling

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science

Lead the way for us

Similar Papers

Topic modeling in software engineering research
Camila Costa Silva ... Matthias Galster
Empirical Software Engineering | VOL. 26
Camila Costa Silva, et. al.Camila Costa Silva ... Matthias Galster
06 Sep 2021
Empirical Software Engineering | VOL. 26

Cardiac Disease in Pregnancy
Julie Arafeh ... Yasser Y El-Sayed
NeoReviews | VOL. 5
Julie Arafeh, et. al.Julie Arafeh ... Yasser Y El-Sayed
01 Jun 2004
NeoReviews | VOL. 5

Social Determinants of Risk and Outcomes for Cardiovascular Disease: A Scientific Statement From the American Heart Association.
Edward P Havranek ... Meryl S Cohen
Circulation | VOL. 132
Edward P Havranek, et. al.Edward P Havranek ... Meryl S Cohen
03 Aug 2015
Circulation | VOL. 132

Hypertension Optimal Treatment (HOT) trial
Lennart Hansson ... Alberto Zanchetti
The Lancet | VOL. 352
Lennart Hansson, et. al.Lennart Hansson ... Alberto Zanchetti
01 Aug 1998
The Lancet | VOL. 352

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Medical documents classification using topic modeling

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science