GRiD: Gathering rich data from PubMed using one-class SVM

Junbum Cha,Sanghyun Park,Jeongwoo Kim

doi:10.1109/smc.2016.7844911

Abstract

The Medical Subject Headings (MeSH) term search is typical data-gathering method in biomedical text mining. However, it has two problems: the allocation delay of the MeSH term and missing valuable literature sources. Since MeSH term allocation is performed by a human being, the allocation process has delay. In addition, even if a literature source was allocated with a MeSH term, there is a still the problem that valuable literature sources are missed during the data-gathering process. There are literature sources that are not indexed to the MeSH term of a keyword, even though it contains valuable information related to the MeSH term. The MeSH term search misses these valuable literature sources. In order to resolve these problems, we propose a novel method to gather rich data using a one-class support vector machine (SVM) and relevance rule. The term frequency-inverse document frequency (TF-IDF) and paragraph vector are examined as text vectorization methods with various parameters and relevance factors. We apply our method to lung cancer, prostate cancer, breast cancer, and Alzheimer's disease. As a result, up to 26% of keyword data and 35% of target data are gathered with high quality (a C-score of at least 0.948).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GRiD: Gathering rich data from PubMed using one-class SVM

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The impact of MeSH (Medical Subject Headings) terms on information seeking effectiveness
Ying-Hsang Liu
ACM SIGIR Forum | VOL. 43
Ying-Hsang LiuYing-Hsang Liu
14 Dec 2009
ACM SIGIR Forum | VOL. 43

Not just keywords but MeSH keywords: Do mention for better visibility of your publication.
Manisha D Katikar ... Vanita Ahuja
Indian Journal of Anaesthesia | VOL. 67
Manisha D Katikar, et. al.Manisha D Katikar ... Vanita Ahuja
01 Mar 2023
Indian Journal of Anaesthesia | VOL. 67

Using Link Prediction Methods to Examine Networks of Co-occurring MeSH Terms in Zika and CRISPR Research
Meng-Hao Li
-
Meng-Hao LiMeng-Hao Li
01 Jan 2020
01 Jan 2020

HTA Database Canadian Repository
Ashleigh Faith ... Tanja Bekhuis
Journal of the Medical Library Association : JMLA | VOL. 103
Ashleigh Faith, et. al.Ashleigh Faith ... Tanja Bekhuis
01 Oct 2015
Journal of the Medical Library Association : JMLA | VOL. 103

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GRiD: Gathering rich data from PubMed using one-class SVM

Abstract

Talk to us

Similar Papers