Standard Process For Data Mining Research Articles

Network Intrusion Detection System (NIDS) is an important part of Cyber safety and security. It plays a key role in all networked ICT systems in detecting rampant attacks such as Denial of Service (DoS) and ransom ware attacks. Existing methods are inadequate in terms of accuracy detection of attacks. However, the requirement for high accuracy detection of attacks using Deep Neural Network requires expensive computing resources which in turn make most organisations, and individuals shy away from it. This study therefore aims at designing a predictive model for network intrusion detection using deep neural networks with very limited computing resources. The study adopted Cross Industry Standard Process for Data Mining (CRISP-DM) as one of the formal methodologies and python was used for both testing and training, using crucial parameters such as the learning rate, number of epochs, neurons and hidden layers which greatly determined the accuracy level of the DNN algorithm. These parameters were experimented with values that are lesser compared to previous studies, training and evaluation were also done on the KDD99 data-set. The varying values of accuracy obtained from this study on four models with different numbers of layers of 50-epochs and learning rate of 0.01 achieved competitive results in comparison with the previous research of 100-1000 epochs and learning rate of 0.1. Therefore, the model with two layers attained same accuracy of 0.955 as the model with three layers from the previous study out of the four models tested in this study. Also, the models with three and four layers in this study attained an accuracy of 0.956, which is 0.001greater than the previous study's models. Keywords: Network-Based IDS, Host-Based IDS, Deep Neural Network, Denial of Service, Knowledge Discovery Dataset

Read full abstract

ObjectivesTo develop and to propose a machine learning model for predicting glaucoma and identifying its risk factors.MethodData analysis pipeline is designed for this study based on Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology. The main steps of the pipeline include data sampling, preprocessing, classification and evaluation and validation. Data sampling for providing the training dataset was performed with balanced sampling based on over-sampling and under-sampling methods. Data preprocessing steps were missing value imputation and normalization. For classification step, several machine learning models were designed for predicting glaucoma including Decision Trees (DTs), K-Nearest Neighbors (K-NN), Support Vector Machines (SVM), Random Forests (RFs), Extra Trees (ETs) and Bagging Ensemble methods. Moreover, in the classification step, a novel stacking ensemble model is designed and proposed using the superior classifiers.ResultsThe data were from Shahroud Eye Cohort Study including demographic and ophthalmology data for 5190 participants aged 40-64 living in Shahroud, northeast Iran. The main variables considered in this dataset were 67 demographics, ophthalmologic, optometric, perimetry, and biometry features for 4561 people, including 4474 non-glaucoma participants and 87 glaucoma patients. Experimental results show that DTs and RFs trained based on under-sampling of the training dataset have superior performance for predicting glaucoma than the compared single classifiers and bagging ensemble methods with the average accuracy of 87.61 and 88.87, the sensitivity of 73.80 and 72.35, specificity of 87.88 and 89.10 and area under the curve (AUC) of 91.04 and 94.53, respectively. The proposed stacking ensemble has an average accuracy of 83.56, a sensitivity of 82.21, a specificity of 81.32, and an AUC of 88.54.ConclusionsIn this study, a machine learning model is proposed and developed to predict glaucoma disease among persons aged 40-64. Top predictors in this study considered features for discriminating and predicting non-glaucoma persons from glaucoma patients include the number of the visual field detect on perimetry, vertical cup to disk ratio, white to white diameter, systolic blood pressure, pupil barycenter on Y coordinate, age, and axial length.

Read full abstract

Standard Process For Data Mining Research Articles

Related Topics

Articles published on Standard Process For Data Mining

Application of CRISP-DM methodology for managing human-wildlife conflicts: an empirical case study in India

Use of a business intelligence framework in the management of the quality of the electricity supply in small and medium-sized companies

Knowledge Discovery in Engineering Applications Using Machine Learning Techniques

Overcoming the pitfalls and perils of algorithms: A classification of machine learning biases and mitigation methods

Decision support system for fish quarantine measures in Indonesia

Modeling scientometric indicators using a statistical data ontology

A Predictive Model for Network Intrusion Detection System Using Deep Neural Network

Implementasi Algoritma Naïve Bayes Classifier untuk Mendeteksi Berita Palsu pada Sosial Media

DEMONSTRATING HOW A HIGH- GROWTH FRAMEWORK COULD BE USED TO ASSIST A SOCIAL ENTERPRISE TO IDENTIFY GROWTH FACTORS AND IMPROVE SUSTAINABILITY

Machine Learning Models for Predicting Financially Vigilant Low-Income Households

The Sustainability Data Science Life Cycle for automating multi-purpose LCA workflows for the analysis of large product portfolios

Penerapan Algoritma C4.5 Pada Imbalanced Dataset Untuk Memprediksi Kegagalan Angsuran Properti

The use of artificial neural networks and big data infrastructure for predictive analytics in solar energy

Quality Improvement of NH1X36B Pre-Printed Box with QM-CRISP DM Approach at PT X

Development of glaucoma predictive model and risk factors assessment based on supervised models

Artificial intelligence for last-mile logistics - Procedures and architecture

A data mining-based cross-industry process for predicting major bleeding in mechanical circulatory support.

Implementasi Metode Moving Average Sebagai Prediksi Penjualan Perlengkapan Pertanian Pada CV. Aneka Tani

A Study on Singapore’s Ageing Population in the Context of Eldercare Initiatives Using Machine Learning Algorithms

Automated Business Goal Extraction from E-mail Repositories to Bootstrap Business Understanding

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Standard Process For Data Mining Research Articles

Related Topics

Articles published on Standard Process For Data Mining

Application of CRISP-DM methodology for managing human-wildlife conflicts: an empirical case study in India

Use of a business intelligence framework in the management of the quality of the electricity supply in small and medium-sized companies

Knowledge Discovery in Engineering Applications Using Machine Learning Techniques

Overcoming the pitfalls and perils of algorithms: A classification of machine learning biases and mitigation methods

Decision support system for fish quarantine measures in Indonesia

Modeling scientometric indicators using a statistical data ontology

A Predictive Model for Network Intrusion Detection System Using Deep Neural Network

Implementasi Algoritma Naïve Bayes Classifier untuk Mendeteksi Berita Palsu pada Sosial Media

DEMONSTRATING HOW A HIGH- GROWTH FRAMEWORK COULD BE USED TO ASSIST A SOCIAL ENTERPRISE TO IDENTIFY GROWTH FACTORS AND IMPROVE SUSTAINABILITY

Machine Learning Models for Predicting Financially Vigilant Low-Income Households

The Sustainability Data Science Life Cycle for automating multi-purpose LCA workflows for the analysis of large product portfolios

Penerapan Algoritma C4.5 Pada Imbalanced Dataset Untuk Memprediksi Kegagalan Angsuran Properti

The use of artificial neural networks and big data infrastructure for predictive analytics in solar energy

Quality Improvement of NH1X36B Pre-Printed Box with QM-CRISP DM Approach at PT X

Development of glaucoma predictive model and risk factors assessment based on supervised models

Artificial intelligence for last-mile logistics - Procedures and architecture

A data mining-based cross-industry process for predicting major bleeding in mechanical circulatory support.

Implementasi Metode Moving Average Sebagai Prediksi Penjualan Perlengkapan Pertanian Pada CV. Aneka Tani

A Study on Singapore’s Ageing Population in the Context of Eldercare Initiatives Using Machine Learning Algorithms

Automated Business Goal Extraction from E-mail Repositories to Bootstrap Business Understanding