Binary Classification Problem Research Articles

Unveiling the source of an image is one of the most effective ways to validate the originality, authenticity, and reliability in the field of digital forensics. Source camera device identification can identify the specific camera device used to take a photo under investigation. While great progress has been made by the photo-response non-uniformity (PRNU)-based methods over the past decade, the challenge of instance-level source camera device linking, which verifies whether two images in question were captured by the same camera device, remains significant. This challenge is mainly due to the absence of auxiliary images to construct a clean camera fingerprint for each camera, particularly dealing with small image sizes. To overcome this limitation, in this paper, we formulate the task of source device linking as a binary classification problem and propose a simple yet effective framework based on a context-aware deep Siamese network. We take advantage of a Siamese architecture to extract the intrinsic camera device-related noise patterns from a pair of image patches in parallel for comparisons without any auxiliary images. Moreover, a recurrent criss-cross group is utilized to aggregate contextual information in the noise residual maps to alleviate the problem that PRNU noise maps are easily contaminated by the additive noises from image contents. For reliable device linking, we employ a patch-selection strategy on a pair of test images to adaptively choose suitable image patch pairs according to image contents. The final decision of a pair of test images is obtained from the average similarity score of the selected image patch pairs. Compared with existing state-of-the-art methods, our proposed framework can achieve better performance on both the tasks of source camera identification and source device linking without any prior knowledge, i.e., reliable camera fingerprints, regardless of whether the camera devices are “seen” or “unseen” in the training stage. The experimental results on two standard image forensic datasets demonstrate that the proposed method not only shows robustness with respect to different image patch sizes and image quality degenerations, but also has a generalization ability across digital camera and smartphone devices.

Read full abstract

BackgroundIn medical device validation and verification studies, the area under the receiver operating characteristic curve (AUROC) is often used as a primary endpoint despite multiple reports showing its limitations. Hence, researchers are encouraged to consider alternative metrics as primary endpoints. A new metric called G4 is presented, which is the geometric mean of sensitivity, specificity, the positive predictive value, and the negative predictive value. G4 is part of a balanced metric family which includes the Unified Performance Measure (also known as P4) and the Matthews’ Correlation Coefficient (MCC). The purpose of this manuscript is to unveil the benefits of using G4 together with the balanced metric family when analyzing the overall performance of binary classifiers.ResultsSimulated datasets encompassing different prevalence rates of the minority class were analyzed under a multi-reader-multi-case study design. In addition, data from an independently published study that tested the performance of a unique ultrasound artificial intelligence algorithm in the context of breast cancer detection was also considered. Within each dataset, AUROC was reported alongside the balanced metric family for comparison. When the dataset prevalence and bias of the minority class approached 50%, all three balanced metrics provided equivalent interpretations of an AI’s performance. As the prevalence rate increased / decreased and the data became more imbalanced, AUROC tended to overvalue / undervalue the true classifier performance, while the balanced metric family was resistant to such imbalance. Under certain circumstances where data imbalance was strong (minority-class prevalence < 10%), MCC was preferred for standalone assessments while P4 provided a stronger effect size when evaluating between-groups analyses. G4 acted as a middle ground for maximizing both standalone assessments and between-groups analyses.ConclusionsUse of AUROC as the primary endpoint in binary classification problems provides misleading results as the dataset becomes more imbalanced. This is explicitly noticed when incorporating AUROC in medical device validation and verification studies. G4, P4, and MCC do not share this limitation and paint a more complete picture of a medical device’s performance in a clinical setting. Therefore, researchers are encouraged to explore the balanced metric family when evaluating binary classification problems.

Read full abstract

Binary Classification Problem Research Articles

Articles published on Binary Classification Problem

Complementary to Multiple Labels: A Correlation-Aware Correction Approach.

Data-Driven Insights Into Post-Earthquake Reconnaissance Findings: 2023 Türkiye Earthquake Sequence

On most informative regions for binary classification of schizophrenia based on resting state fMRI data done by selection of functionally homogeneous regions method

Predicting Employee Turnover in the Financial Company: A Comparative Study of CatBoost and XGBoost Models

Use of ResNets for HLB Disease Detection on Orange Leaves Using Terrestrial Multispectral Images

A new binary classifier robust on noisy domains based on kNN algorithm

Machine learning approach for detection of land subsidence induced by underground coal fire using multi-sensor satellite data

Financial Fraud Detection Study - Based on Logit Model

Automated Recognition of Submerged Body-like Objects in Sonar Images Using Convolutional Neural Networks

Unveiling image source: Instance-level camera device linking via context-aware deep Siamese network

OUCH: Oversampling and Undersampling Cannot Help Improve Accuracy in Our Bayesian Classifiers That Predict Preeclampsia

Enhancing Machine Learning Models Through PCA, SMOTE-ENN, and Stochastic Weighted Averaging

G4 & the balanced metric family – a novel approach to solving binary classification problems in medical device validation & verification studies

Automatic noise detection for ambulatory electrocardiogram in presence of ventricular arrhythmias through a machine learning approach

Enhanced QSVM with elitist non‐dominated sorting genetic optimisation algorithm for breast cancer diagnosis

LesionNet: an automated approach for skin lesion classification using SIFT features with customized convolutional neural network.

An Efficient Detection Mechanism of Network Intrusions in IoT Environments Using Autoencoder and Data Partitioning

RFMiD: Retinal Image Analysis for multi-Disease Detection challenge

TC-DTA: Predicting Drug-Target Binding Affinity With Transformer and Convolutional Neural Networks.

The ROC Curve Examination on Traveling Ionospheric Disturbances in FORMOSAT‐7/COSMIC‐2 IVM Ion Density Triggered by the 15 January 2022 Tonga Volcanic Eruption

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Binary Classification Problem Research Articles

Articles published on Binary Classification Problem

Complementary to Multiple Labels: A Correlation-Aware Correction Approach.

Data-Driven Insights Into Post-Earthquake Reconnaissance Findings: 2023 Türkiye Earthquake Sequence

On most informative regions for binary classification of schizophrenia based on resting state fMRI data done by selection of functionally homogeneous regions method

Predicting Employee Turnover in the Financial Company: A Comparative Study of CatBoost and XGBoost Models

Use of ResNets for HLB Disease Detection on Orange Leaves Using Terrestrial Multispectral Images

A new binary classifier robust on noisy domains based on kNN algorithm

Machine learning approach for detection of land subsidence induced by underground coal fire using multi-sensor satellite data

Financial Fraud Detection Study - Based on Logit Model

Automated Recognition of Submerged Body-like Objects in Sonar Images Using Convolutional Neural Networks

Unveiling image source: Instance-level camera device linking via context-aware deep Siamese network

OUCH: Oversampling and Undersampling Cannot Help Improve Accuracy in Our Bayesian Classifiers That Predict Preeclampsia

Enhancing Machine Learning Models Through PCA, SMOTE-ENN, and Stochastic Weighted Averaging

G4 & the balanced metric family – a novel approach to solving binary classification problems in medical device validation & verification studies

Automatic noise detection for ambulatory electrocardiogram in presence of ventricular arrhythmias through a machine learning approach

Enhanced QSVM with elitist non‐dominated sorting genetic optimisation algorithm for breast cancer diagnosis

LesionNet: an automated approach for skin lesion classification using SIFT features with customized convolutional neural network.

An Efficient Detection Mechanism of Network Intrusions in IoT Environments Using Autoencoder and Data Partitioning

RFMiD: Retinal Image Analysis for multi-Disease Detection challenge

TC-DTA: Predicting Drug-Target Binding Affinity With Transformer and Convolutional Neural Networks.

The ROC Curve Examination on Traveling Ionospheric Disturbances in FORMOSAT‐7/COSMIC‐2 IVM Ion Density Triggered by the 15 January 2022 Tonga Volcanic Eruption