Abstract

Morbidity and mortality caused by infectious diseases rank first among all human illnesses. Many pathogenic mechanisms remain unclear, while misuse of antibiotics has led to the emergence of drug-resistant strains. Infectious diseases spread rapidly and pathogens mutate quickly, posing new threats to human health. However, with the increasing use of high-throughput screening of pathogen genomes, research based on big data mining and visualization analysis has gradually become a hot topic for studies of infectious disease prevention and control. In this paper, the framework was performed on four infectious pathogens (Fusobacterium, Streptococcus, Neisseria, and Streptococcus salivarius) through five functions: 1) genome annotation, 2) phylogeny analysis based on core genome, 3) analysis of structure differences between genomes, 4) prediction of virulence genes/factors with their pathogenic mechanisms, and 5) prediction of resistance genes/factors with their signaling pathways. The experiments were carried out from three angles: phylogeny (macro perspective), structure differences of genomes (micro perspective), and virulence and drug-resistance characteristics (prediction perspective). Therefore, the framework can not only provide evidence to support the rapid identification of new or unknown pathogens and thus plays a role in the prevention and control of infectious diseases, but also help to recommend the most appropriate strains for clinical and scientific research. This paper presented a new genome information visualization analysis process framework based on big data mining technology with the accommodation of the depth and breadth of pathogens in molecular level research.

Highlights

  • MATERIALS AND METHODSData published by the World Health Organization shows that the incidence and mortality of human infectious diseases rank highest among all human illness (Gilmour et al, 2013)

  • Characteristics of virulence and drug-resistance are closely associated with genomic differences

  • This paper presents a novel genome information visualization analysis framework based on big data mining technology

Read more

Summary

Introduction

MATERIALS AND METHODSData published by the World Health Organization shows that the incidence and mortality of human infectious diseases rank highest among all human illness (Gilmour et al, 2013). The emergence of drug-resistant bacterial strains poses new threats to human health, while the rapid mutation of pathogens is a huge obstacle for design and long term efficacy of vaccines. The emergence of drug-resistant Tuberculosis bacilli (TB) has allowed previously controlled TB infections to become rampant around the world (Cole, 2002). With the maturation of sequencing technology, large-scale sequencing and even whole-genome sequencing of pathogens has become an important method for research, prevention and control of infectious pathogens. Information visualization has increasingly become an important research direction but remains a challenge for the prevention and control of infectious diseases (Mao Ping and Wang, 2017; Kan et al, 2018; Chen et al, 2020b)

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call