Abstract

The objective of this paper is to design and implement machine learning based ensemble algorithm on dataset to fit into the models that can be understood and executed by machines. In this paper we discussed different algorithms and machine learning concepts that can be implemented on the datasets, we taken email spam filter dataset for experiment and analysis, as the Advanced persistent threat the latest threat is intruded using the emails and major intrusion is done through spam emails. Machine learning uses different datamining techniques and mechanisms and accepts the input-data and gives the output as the statistical analysis. We implemented different email classification algorithms on the datasets bsed on spam and ham emails where spear phishing methods are identified and implemented different classification and regression methods to get the accurate results. In this paper for the better results in spite of existing algorithms we introduced the ensemble methods such as boosting, bagging, stacking and voting for much accuracy and higher level of classification and combining different algorithm. This paper will measure different machine learning algorithms performance on spam email filtering on the huge datasets. The framework provides implementation of learning algorithms that you can apply to larger datasets. An obvious approach to making decisions more reliable is to combine the output of different models. We even compared the existing algorithms and proposed algorithm; comparison tables are drawn along with the statistical analysis, data and graphical analysis is given.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call