Abstract

Cold pathogenic disease is a widespread disease in traditional Chinese medicine, which includes influenza and respiratory infection associated with high incidence and mortality. Discovering effective core drugs in Chinese medicine prescriptions for treating the disease and reducing patients' symptoms has attracted great interest. In this paper, we explore the core drugs for curing various syndromes of cold pathogenic disease from large-scale literature. We propose a core drug discovery framework incorporating word embedding and community detection algorithms, which contains three parts: disease corpus construction, drug network generation, and core drug discovery. First, disease corpus is established by collecting and preprocessing large-scale literature about the Chinese medicine treatment of cold pathogenic disease from China National Knowledge Infrastructure. Second, we adopt the Chinese word embedding model SSP2VEC for mining the drug implication implied in the literature; then, a drug network is established by the semantic similarity among drugs. Third, the community detection method COPRA based on label propagation is adopted to reveal drug communities and identify core drugs in the drug network. We compute the community size, closeness centrality, and degree distributions of the drug network to analyse the patterns of core drugs. We acquire 4681 literature from China national knowledge infrastructure. Twelve significant drug communities are discovered, in which the top-10 drugs in every drug community are recognized as core drugs with high accuracy, and four classical prescriptions for treating different syndromes of cold pathogenic disease are discovered. The proposed framework can identify effective core drugs for curing cold pathogenic disease, and the research can help doctors to verify the compatibility laws of Chinese medicine prescriptions.

Highlights

  • Cold pathogenic disease (CPD, 中医伤寒) is the general term for exogenous febrile diseases in traditional Chinese medicine (TCM), which are a class of diseases appearing with fever as the main clinical symptom caused by feeling pathogenic factors and six climatic exopathogens in TCM [1,2,3]

  • After searching in China National Knowledge Infrastructure (CNKI) by the key word pairs, we collect 4681 literature about the TCM treatment of CPD and process them according to stage 1; disease corpus is built with 50 million tokens

  • All literature are relevant to the treatment of CPD in TCM, so we can axcept that semantic analysis can better understand the semantics of Chinese drugs and obtain good semantic vectors. en, we apply core drug discovery framework (CDDF) in the corpus to discover core drugs for treating CPD comparing with CSG + COPRA, in which continuous skip-gram (CSG)

Read more

Summary

Introduction

Cold pathogenic disease (CPD, 中医伤寒) is the general term for exogenous febrile diseases in traditional Chinese medicine (TCM), which are a class of diseases appearing with fever as the main clinical symptom caused by feeling pathogenic factors and six climatic exopathogens (wind, cold, heat, wet, dryness, and fire, 六种外感病邪) in TCM [1,2,3]. According to the discovered core drugs for treating different syndromes of CPD, doctors can optimize compatibility combinations and find more effective prescriptions, which is helpful for accurate medication. Data mining methods analysed the compatibility rules and core drugs of TCM prescriptions in medical records by computing the frequency and co-occurrence relations of drugs in TCM prescriptions [27, 28], which mainly concentrate on analysing medical records and can handle large-scale data [29, 30]. They cannot comprehend the implication of drugs in these records. Top-10 drugs with most correct core drugs for treating CPD are found in each drug community

Related Work
The Learning Framework
Literature acquisition
66 Pinyin taiD yangB
Experiential Results and Discussion
A Jiao Ma Qian Zi
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call