Abstract

A Niche approach for classifying sequence Datasets is achieved by the use of EGSP (Enhanced Generalized sequential pattern) algorithm. The EGSP brings out a prediction model for working with the Sequence Datasets in the domain of Gene and Protein datasets. The Method proceeds by the way of generalizing the datasets of both the supervised and Semi-supervised data. The generalization brings out the candidate sequences which paves path for a distinct component extraction. The sequences are generated based on the threshold value which is then followed by applying the EGSP algorithm which brings out the sequential pattern from the pruned sequences. The extracted sequential pattern is then clustered using a gene clustering algorithm. The algorithm (MNBC) Modified Naïve Bayes Classification computes the probabilistic components for each class. The accuracy obtained is far better than the traditional classification algorithms. The resultant classification provides a solution for prediction methods for the selected domain and its applications. The algorithm used gives an upper hand over the computational costs which have been drastically minimized over the existing methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call