A new approach to distinguish migraine from stroke by mining structured and unstructured clinical data-sources

Elham Sedghi,Jens H Weber,Maximilian Bibok,Alex Thomo,Andrew M W Penn

doi:10.1007/s13721-016-0137-2

Abstract

Distinguishing migraine from stroke is a challenge due to many common signs and symptoms. It is important to consider the cost of hospitalization and the time spent by neurologists and stroke nurses to visit, diagnose, and assign appropriate care to the patients; therefore, devising new ways to distinguish stroke, migraine and other types of mimics can help in saving time and cost, and improve decision-making. In this study, we utilized text and data mining methods to extract the most important predictors from clinical reports in order to establish a migraine detection model and distinguish migraine patients from stroke or other types of mimic (non-stroke) cases. The available data for this study was a heterogeneous mix of free-text fields, such as triage main-complaints and specialist final-impressions, as well as numeric data about patients, such as age, blood-pressure, and so on. After a careful combination of these sources, we obtained a highly imbalanced dataset where the migraine cases were only about 6 % of the dataset. Our main challenge was tackling this data imbalance. Using the dataset in its original form to build classifiers led to a learning bias towards the majority class and against the minority (migraine) class. We used a sampling method to address the imbalance problem. First, different sources of data were preprocessed and balanced datasets were generated; second, attribute selection algorithms were used to reduce the dimensionality of the data; third, a novel combination of data mining algorithms was employed in order to effectively distinguish migraine from other cases. We achieved a sensitivity and specificity of about 80 and 75 %, respectively, which is in contrast to a sensitivity and specificity of 15.7 and 97 % when using the original imbalanced data for building classifiers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new approach to distinguish migraine from stroke by mining structured and unstructured clinical data-sources

Abstract

Talk to us

Similar Papers

More From: Network Modeling Analysis in Health Informatics and Bioinformatics

Lead the way for us

Journal: Network Modeling Analysis in Health Informatics and Bioinformatics	Publication Date: Oct 6, 2016
Citations: 2

Similar Papers

From words to pixels: text and image mining methods for service research
Francisco Villarroel Ordenes ... Shunyuan Zhang
Journal of Service Management | VOL. 30
Francisco Villarroel Ordenes, et. al.Francisco Villarroel Ordenes ... Shunyuan Zhang
09 Oct 2019
Journal of Service Management | VOL. 30

Improving Student Academic Performance Using an Attribute Selection Algorithm
K Anvesh ... G Divya Jyothi
-
K Anvesh, et. al.K Anvesh ... G Divya Jyothi
05 Nov 2018
05 Nov 2018

National Burden of Pediatric Hospitalizations for Inflammatory Bowel Disease
Pamela C Heaton ... Namita L Tundia
Journal of Pediatric Gastroenterology and Nutrition | VOL. 54
Pamela C Heaton, et. al.Pamela C Heaton ... Namita L Tundia
01 Apr 2012
Journal of Pediatric Gastroenterology and Nutrition | VOL. 54

Knowledge discovery from distributed and textual data
Vincent Wing-Sing Cho
-
Vincent Wing-Sing ChoVincent Wing-Sing Cho
23 Dec 2014
23 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new approach to distinguish migraine from stroke by mining structured and unstructured clinical data-sources

Abstract

Talk to us

Similar Papers

More From: Network Modeling Analysis in Health Informatics and Bioinformatics