Binary Classification Datasets Research Articles

Multi-class classification is one of the major challenges in machine learning and an ongoing research issue. Classification algorithms are generally binary, but they must be extended to multi-class problems for real-world application. Multi-class classification is more complex than binary classification. In binary classification, only the decision boundaries of one class are to be known, whereas in multiclass classification, several boundaries are involved. The objective of this investigation is to propose a metaheuristic, optimized, multi-level classification learning system for forecasting in civil and construction engineering. The proposed system integrates the firefly algorithm (FA), metaheuristic intelligence, decomposition approaches, the one-against-one (OAO) method, and the least squares support vector machine (LSSVM). The enhanced FA automatically fine-tunes the hyperparameters of the LSSVM to construct an optimized LSSVM classification model. Ten benchmark functions are used to evaluate the performance of the enhanced optimization algorithm. Two binary-class datasets related to geotechnical engineering, concerning seismic bumps and soil liquefaction, are then used to clarify the application of the proposed system to binary problems. Further, this investigation uses multi-class cases in civil engineering and construction management to verify the effectiveness of the model in the diagnosis of faults in steel plates, quality of water in a reservoir, and determining urban land cover. The results reveal that the system predicts faults in steel plates with an accuracy of 91.085%, the quality of water in a reservoir with an accuracy of 93.650%, and urban land cover with an accuracy of 87.274%. To demonstrate the effectiveness of the proposed system, its predictive accuracy is compared with that of a non-optimized baseline model, single multi-class classification algorithms (sequential minimal optimization (SMO), the Multiclass Classifier, the Naïve Bayes, the library support vector machine (LibSVM) and logistic regression) and prior studies. The analytical results show that the proposed system is promising project analytics software to help decision makers solve multi-level classification problems in engineering applications.

Read full abstract

The responses of plants to climate change are typically reflected in the changes in leaf and flowering phenology. By exploiting the strength and simplicity of repeated digital photography and color indices, a majority of the phenological studies have been successful at investigating leaf phenology, while flowering phenology is rarely studied using the automatic capture and analysis of repeated photography. In this study, we trained and tested 5 different pretrained Convolutional Neural Network (CNN) algorithms to detect flowering events from images of white colored flowering trees and analyzed the possible factors that can affect the performance of the models. We collected images from the web and processed the images into a binary classification dataset in which a positive label indicated a tree in bloom. We also installed time-lapse cameras and captured images to validate the performances of the models in the real-world. Regarding the CNN architectures, the VGG16, ResNet50, ResNet101, MobileNet, and NASNet models were adopted, and the model weights were pretrained using the ImageNet-1000 dataset. After 20 epochs of training with 16,005 images, all of the models were successfully trained, reaching over 98% test accuracy, and 4 models reached over 99% test accuracy. All the models also showed accurate and stable performances in detecting flowering in time-series datasets with a minor inconstancy at the beginning of the flowering stages. Overall, the NASNet model showed the best performance in both the test dataset and the time-series datasets. A detailed analysis of the performance revealed that the models were especially prone to misclassify images with small relative flowering areas and were affected by the number of samples in the training dataset. We concluded that the preprocessing of the images and the size of the training dataset are essential for the high performance of the models compared to the architecture of the individual models. Furthermore, in addition to the need for a larger dataset, the proper resolution is required to successfully detect flowering from repeated photography, and most current phenological networks do not meet this condition. We suggest that mid-range photography combined with CNN algorithms can be a legitimate approach to properly accumulate and automatically process the data for studying flowering phenology.

Read full abstract

Binary Classification Datasets Research Articles

Related Topics

Articles published on Binary Classification Datasets

An Investigation of SMOTE based Methods for Imbalanced Datasets with Data Complexity Analysis

MiNB: Minority Sensitive Naïve Bayesian Algorithm for Multi-Class Classification of Unbalanced Data

Online News Sentiment Classification Using DistilBERT

TwoClsBalancer: Sınıf Dengesizliği Problemi İçin Makine Öğrenmesine Dayalı Etkileşimli Bir Web Uygulaması

On Using Classification Datasets to Evaluate Graph Outlier Detection: Peculiar Observations and New Insights.

Intrusion Detection Model Based on TF.IDF and C4.5 Algorithms

Deep/Transfer Learning with Feature Space Ensemble Networks (FeatSpaceEnsNets) and Average Ensemble Networks (AvgEnsNets) for Change Detection Using DInSAR Sentinel-1 and Optical Sentinel-2 Satellite Data Fusion

Evolutionary multiple instance boosting framework for weakly supervised learning

Metaheuristic Optimized Multi-Level Classification Learning System for Engineering Management

Feature Selection on Elite Hybrid Binary Cuckoo Search in Binary Label Classification.

Minimum class variance class-specific extreme learning machine for imbalanced classification

AdaDT: An adaptive decision tree for addressing local class imbalance based on multiple split criteria

Utilizing machine learning for detecting flowering in mid-range digital repeat photography

Learning a Deep Similarity Network for Hyperspectral Image Classification

DTO-SMOTE: Delaunay Tessellation Oversampling for Imbalanced Data Sets

Hyperparameter tuning methods in automated machine learning

An ADMM Based Framework for AutoML Pipeline Configuration

Reduced Dilation-Erosion Perceptron for Binary Classification

Improving Classification Accuracy Using Hybrid of Extreme Learning Machine and Artificial Algae Algorithm with Multi-Light Source

A NSGA2-LR wrapper approach for feature selection in network intrusion detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Binary Classification Datasets Research Articles

Related Topics

Articles published on Binary Classification Datasets

An Investigation of SMOTE based Methods for Imbalanced Datasets with Data Complexity Analysis

MiNB: Minority Sensitive Naïve Bayesian Algorithm for Multi-Class Classification of Unbalanced Data

Online News Sentiment Classification Using DistilBERT

TwoClsBalancer: Sınıf Dengesizliği Problemi İçin Makine Öğrenmesine Dayalı Etkileşimli Bir Web Uygulaması

On Using Classification Datasets to Evaluate Graph Outlier Detection: Peculiar Observations and New Insights.

Intrusion Detection Model Based on TF.IDF and C4.5 Algorithms

Deep/Transfer Learning with Feature Space Ensemble Networks (FeatSpaceEnsNets) and Average Ensemble Networks (AvgEnsNets) for Change Detection Using DInSAR Sentinel-1 and Optical Sentinel-2 Satellite Data Fusion

Evolutionary multiple instance boosting framework for weakly supervised learning

Metaheuristic Optimized Multi-Level Classification Learning System for Engineering Management

Feature Selection on Elite Hybrid Binary Cuckoo Search in Binary Label Classification.

Minimum class variance class-specific extreme learning machine for imbalanced classification

AdaDT: An adaptive decision tree for addressing local class imbalance based on multiple split criteria

Utilizing machine learning for detecting flowering in mid-range digital repeat photography

Learning a Deep Similarity Network for Hyperspectral Image Classification

DTO-SMOTE: Delaunay Tessellation Oversampling for Imbalanced Data Sets

Hyperparameter tuning methods in automated machine learning

An ADMM Based Framework for AutoML Pipeline Configuration

Reduced Dilation-Erosion Perceptron for Binary Classification

Improving Classification Accuracy Using Hybrid of Extreme Learning Machine and Artificial Algae Algorithm with Multi-Light Source

A NSGA2-LR wrapper approach for feature selection in network intrusion detection