Abstract

Chronic obstructive pulmonary disease (COPD) is a chronic respiratory disease that seriously endangers human health and has high incidence and mortality worldwide. Therefore, an effective predictive model is required for COPD diagnosis. Given the limited data samples available in current COPD studies, we propose a method for diagnosing COPD based on transfer learning called balanced probability distribution (BPD) algorithm; this algorithm integrates instance- and feature-based transfers to improve the prediction accuracy of the model. First, instance-based cascaded transfer learning was used to initialize the weight distribution of the training data and obtain instances closer to the target domain. Second, the cross-domain feature filtering algorithm was adopted to filter irrelevant features, eliminate redundant features, and obtain the co-occurrence features of the source and target domains. Moreover, the remaining features were assigned different weights and transformed into the same space to reduce the distribution difference between the domains. Third, the BPD algorithm was used to balance the examples and the co-occurrence features from multiple disease source domains and construct a more suitable classification model of the target domain. Finally, the elastic network was used to further improve the generalization performance of the model. The experimental results show that the prediction effect of the BPD model is better than that of state-of-the-art methods and has strong generalization ability and robustness. We proved that our proposed BPD method works well in the COPD prediction model when the sample size is small.

Highlights

  • Chronic obstructive pulmonary disease (COPD) is a preventable and treatable common disease characterized by persistent respiratory symptoms and restricted airflow

  • To verify the effectiveness of the balanced probability distribution (BPD) algorithm based on the instance and feature transfers, we performed experiments on the following two datasets: the COPD dataset provided by the Clinical Medical Science Data Center and the COPD dataset extracted from the electronic medical records of a partner medical system [42]

  • To verify the effectiveness of the model proposed in this paper, we first compare the BPD algorithm with the TraAdaBoost algorithm, transfer component analysis (TCA) algorithm, and multi-task learning (MTL) algorithm based on the classical transfer learning method in terms of accuracy and F1 values

Read more

Summary

INTRODUCTION

Chronic obstructive pulmonary disease (COPD) is a preventable and treatable common disease characterized by persistent respiratory symptoms and restricted airflow. In some bioinformatics fields, such as COPD diagnosis, it is very difficult to construct large-scale well-labeled datasets because of the high cost of data collection and labeling; this limits the development of deep learning in these fields. Transfer learning has become a new learning framework to solve many knowledge transfer problems [10] In this work, it has a great positive impact on performance improvement despite limited training data. In this paper, we propose a novel transfer learning method based on instance-based and feature-based transfer called the balanced probability distribution (BPD) method, which solves the problem of having a small sample size. (2) By adopting cascading instance-based transfer in our BPD, we solved the problem of sparse data in the COPD prediction because the effective examples learned from multiple disease source domains were retained. The BPD prediction effect is better than other methods, and it has a strong generalization ability, which proves that the BPD method improves the prediction ability for small sample data

RELATED WORK
EXPERIMENTAL RESULTS
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.