Generalizable Model Research Articles

Modern deep learning training procedures rely on model regularization techniques such as data augmentation methods, which generate training samples that increase the diversity of data and richness of label information. A popular recent method, mixup, uses convex combinations of pairs of original samples to generate new samples. However, as we show in our experiments, mixup can produce undesirable synthetic samples, where the data is sampled off the manifold and can contain incorrect labels. We propose ζ\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\zeta $$\\end{document}-mixup, a generalization of mixup with provably and demonstrably desirable properties that allows convex combinations of T≥2\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$${T} \\ge 2$$\\end{document} samples, leading to more realistic and diverse outputs that incorporate information from T\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$${T}$$\\end{document} original samples by using a p-series interpolant. We show that, compared to mixup, ζ\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\zeta $$\\end{document}-mixup better preserves the intrinsic dimensionality of the original datasets, which is a desirable property for training generalizable models. Furthermore, we show that our implementation of ζ\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\zeta $$\\end{document}-mixup is faster than mixup, and extensive evaluation on controlled synthetic and 26 diverse real-world natural and medical image classification datasets shows that ζ\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\zeta $$\\end{document}-mixup outperforms mixup, CutMix, and traditional data augmentation techniques. The code will be released at https://github.com/kakumarabhishek/zeta-mixup.

Read full abstract

Background and objectiveDeep Learning models have emerged as a significant tool in generating efficient solutions for complex problems including cancer detection, as they can analyze large amounts of data with high efficiency and performance. Recent medical studies highlight the significance of molecular subtype detection in breast cancer, aiding the development of personalized treatment plans as different subtypes of cancer respond better to different therapies. MethodsIn this work, we propose a novel lightweight dual-channel attention-based deep learning model MOB-CBAM that utilizes the backbone of MobileNet-V3 architecture with a Convolutional Block Attention Module to make highly accurate and precise predictions about breast cancer. We used the CMMD mammogram dataset to evaluate the proposed model in our study. Nine distinct data subsets were created from the original dataset to perform coarse and fine-grained predictions, enabling it to identify masses, calcifications, benign, malignant tumors and molecular subtypes of cancer, including Luminal A, Luminal B, HER-2 Positive, and Triple Negative. The pipeline incorporates several image pre-processing techniques, including filtering, enhancement, and normalization, for enhancing the model's generalization ability. ResultsWhile identifying benign versus malignant tumors, i.e., coarse-grained classification, the MOB-CBAM model produced exceptional results with 99 % accuracy, precision, recall, and F1-score values of 0.99 and MCC of 0.98. In terms of fine-grained classification, the MOB-CBAM model has proven to be highly efficient in accurately identifying mass with (benign/malignant) and calcification with (benign/malignant) classification tasks with an impressive accuracy rate of 98 %. We have also cross-validated the efficiency of the proposed MOB-CBAM deep learning architecture on two datasets: MIAS and CBIS-DDSM. On the MIAS dataset, an accuracy of 97 % was reported for the task of classifying benign, malignant, and normal images, while on the CBIS-DDSM dataset, an accuracy of 98 % was achieved for the classification of mass with either benign or malignant, and calcification with benign and malignant tumors. ConclusionThis study presents lightweight MOB-CBAM, a novel deep learning framework, to address breast cancer diagnosis and subtype prediction. The model's innovative incorporation of the CBAM enhances precise predictions. The extensive evaluation of the CMMD dataset and cross-validation on other datasets affirm the model's efficacy.

Read full abstract

Generalizable Model Research Articles

Related Topics

Articles published on Generalizable Model

Multi-Context Point Cloud Dataset and Machine Learning for Railway Semantic Segmentation

Meta-learners for few-shot weakly-supervised medical image segmentation

Trade‐Off Between Light Deprivation and Desiccation in Intertidal Seagrasses Due To Periodic Tidal Inundation and Exposure: Insights From a Data‐Calibrated Model

Lexical-Semantic Content, Not Syntactic Structure, Is the Main Contributor to ANN-Brain Similarity of fMRI Responses in the Language Network.

Effects of primary health care and socioeconomic aspects on the dispersion of COVID-19 in the Brazilian Northeast: Ecological study of the first pandemic wave.

Enhancing quality control in emulsion-type sausage production: Predicting chemical composition of intact samples with near infrared spectroscopy

A versatile, semi-automated image analysis workflow for time-lapse camera trap image classification

Likelihood-based generalization of Markov parameter estimation and multiple shooting objectives in system identification

Photonic data analysis in 2050

The common drivers of children and young people’s health and wellbeing across 13 local government areas: a systems view

Aligning open educational resources to new taxonomies: How AI technologies can help and in which scenarios

Vehicular Fuel Consumption and CO2 Emission Estimation Model Integrating Novel Driving Behavior Data Using Machine Learning

Gendered sustainability: Are public spaces designed for girls good for everyone?: Examining female participation as a strategy for inclusive public space

Integrating convolutional neural network and constitutive model for rapid prediction of stress-strain curves in fibre reinforced polymers: A generalisable approach

MOB-CBAM: A dual-channel attention-based deep learning generalizable model for breast cancer molecular subtypes prediction using mammograms

Weighing in on the average weights: Measuring corporate social performance (CSP) score using DEA

A Generalizable Decision-Making Framework for Selecting Onsite versus Send-out Clinical Laboratory Testing.

A pretrain-finetune approach for improving model generalizability in outcome prediction of acute respiratory distress syndrome patients

Optimising brain age estimation through transfer learning: A suite of pre-trained foundation models for improved performance and generalisability in a clinical setting.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Generalizable Model Research Articles

Related Topics

Articles published on Generalizable Model

Multi-Context Point Cloud Dataset and Machine Learning for Railway Semantic Segmentation

Meta-learners for few-shot weakly-supervised medical image segmentation

Trade‐Off Between Light Deprivation and Desiccation in Intertidal Seagrasses Due To Periodic Tidal Inundation and Exposure: Insights From a Data‐Calibrated Model

Lexical-Semantic Content, Not Syntactic Structure, Is the Main Contributor to ANN-Brain Similarity of fMRI Responses in the Language Network.

Effects of primary health care and socioeconomic aspects on the dispersion of COVID-19 in the Brazilian Northeast: Ecological study of the first pandemic wave.

Enhancing quality control in emulsion-type sausage production: Predicting chemical composition of intact samples with near infrared spectroscopy

A versatile, semi-automated image analysis workflow for time-lapse camera trap image classification

Likelihood-based generalization of Markov parameter estimation and multiple shooting objectives in system identification

Photonic data analysis in 2050

The common drivers of children and young people’s health and wellbeing across 13 local government areas: a systems view

Aligning open educational resources to new taxonomies: How AI technologies can help and in which scenarios

Vehicular Fuel Consumption and CO2 Emission Estimation Model Integrating Novel Driving Behavior Data Using Machine Learning

Gendered sustainability: Are public spaces designed for girls good for everyone?: Examining female participation as a strategy for inclusive public space

Integrating convolutional neural network and constitutive model for rapid prediction of stress-strain curves in fibre reinforced polymers: A generalisable approach

MOB-CBAM: A dual-channel attention-based deep learning generalizable model for breast cancer molecular subtypes prediction using mammograms

Weighing in on the average weights: Measuring corporate social performance (CSP) score using DEA

A Generalizable Decision-Making Framework for Selecting Onsite versus Send-out Clinical Laboratory Testing.

A pretrain-finetune approach for improving model generalizability in outcome prediction of acute respiratory distress syndrome patients

Optimising brain age estimation through transfer learning: A suite of pre-trained foundation models for improved performance and generalisability in a clinical setting.