An Empirical Study of Clustering Algorithms to extract Knowledge from PubMed Articles

Deepak Agnihotri ,Kesari Verma ,Priyanka Tripathi

doi:10.14738/tmlai.53.3106

Abstract

Extraction of useful information from biomedical literature is one of the thrust for the world nowadays due to availability of almost articles on the web in electronic form. Information retrieval (IR) from biomedical literature is finding useful patterns from the unstructured text corpus that satisfies information. In this paper intelligent text analysis is carried out on PubMed articles related to influenza virus. In this context, various algorithms are discussed to reveal the information from PubMed articles, like year wise count of articles containing influenza virus related terms (viz. H1N1, H5N1, and H7N1 etc.), countries with their publication count, which tells about the outbreaks of the diseases in these countries. The articles may be grouped by searching the keyword “influenza virus strain” pattern with the help of regular expressions. Automatic text categorization is another challenging issue for text mining. We applied k-means, fuzzy C-means, and fuzzy C-shell algorithm for automatic categorization of text articles. The association between words based on their co-occurrence is computed which further helps to categorize the documents based on their co-occurrences. The basic k-means clustering algorithm is first applied to cluster the documents, and then to handle the fuzzy nature of words which may belong to more than one cluster, fuzzy c-means clustering is applied to form more accurate clusters. As Fuzzy c-means method clusters the documents which are in linear spaces but not in the circle, spherical, or ellipsoidal spaces. A new method is proposed here, which considers the clusters of documents in the radius of the circle.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Empirical Study of Clustering Algorithms to extract Knowledge from PubMed Articles

Abstract

Talk to us

Similar Papers

More From: Transactions on Machine Learning and Artificial Intelligence

Lead the way for us

Similar Papers

Viral Tropism and the Pathogenesis of Influenza in the Mammalian Host
Keith G Mansfield
The American Journal of Pathology | VOL. 171
Keith G MansfieldKeith G Mansfield
01 Oct 2007
The American Journal of Pathology | VOL. 171

Avian Influenza Virus Infections in Humans
Samson S.Y Wong ... Kwok-Yung Yuen
Chest | VOL. 129
Samson S.Y Wong, et. al.Samson S.Y Wong ... Kwok-Yung Yuen
01 Jan 2006
Chest | VOL. 129

Influenza Virus Receptor Specificity: Disease and Transmission
Adolfo García-Sastre
The American Journal of Pathology | VOL. 176
Adolfo García-SastreAdolfo García-Sastre
01 Apr 2010
The American Journal of Pathology | VOL. 176

Clinical Management of Pandemic 2009 Influenza A(H1N1) Infection
David S Hui ... Paul K.S Chan
Chest | VOL. 137
David S Hui, et. al.David S Hui ... Paul K.S Chan
01 Apr 2010
Clinical Management of Pandemic 2009 Influenza A(H1N1) Infection
David S Hui ... Paul K.S Chan

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Empirical Study of Clustering Algorithms to extract Knowledge from PubMed Articles

Abstract

Talk to us

Similar Papers

More From: Transactions on Machine Learning and Artificial Intelligence