Abstract

In biological research, scientists often need to use the information of the species to infer the evolutionary relationship among them. The evolutionary relationships are generally represented by a labeled binary tree, called the evolutionary tree (or phylogenetic tree). The phylogeny problem is computationally intensive, and thus it is suitable for parallel computing environment. In this paper, a fast algorithm for constructing Neighbor-Joining phylogenetic trees has been developed. The CPU time is drastically reduced as compared with sequential algorithms. The new algorithm includes three techniques: Firstly, a linear array A[N] is introduced to store the sum of every row of the distance matrix (the same as SK), which can eliminate many repeated (redundancy) computations, and the value of A[i] are computed only once at the beginning of the algorithm, and are updated by three elements in the iteration. Secondly, a very compact formula for the sum of all the branch lengths of OTUs (Operational Taxonomic Units) i and j has been designed. Thirdly, multiple parallel threads are used for computation of nearest neighboring pair.

Highlights

  • An understanding of evolutionary relationships is at the heart of modern pharmaceutical research for drug discovery, and is the basis for the design of genetically enhanced organisms

  • Evolutionary history is typically represented by an evolutionary tree [13,14,15]

  • An evolutionary tree is a leaf-labeled binary tree which tracks the genetic similarities of a set of closely related species

Read more

Summary

INTRODUCTION

An understanding of evolutionary relationships is at the heart of modern pharmaceutical research for drug discovery, and is the basis for the design of genetically enhanced organisms. The task of constructing the evolutionary tree for a set of species is known as the phylogeny problem. NJ method is very popular especially in multiple sequence alignment (MSA) because it is applicable to any type of evolutionary distance data. The most popular MSA tool CLUSTALW [7] uses NJ algorithm for constructing phylogenetic trees and the progressive alignment. For a part of this reason, MUSCLE and MAFFT use UPGMA [11] to construct phylogenetic tree since UPGMA reduces the time complexity to O(N3).

SEQUENTIAL NJ ALGORITHM
TIME COMPLEXITY ANALYSIS OF NJ ALGORITHM
PARALLEL FAST ALGORITHM FOR CONSTRUCTING NJ TREE
EXPERIMENT RESULTS:
CONCLUSIONS
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.