Entire Training Dataset Research Articles

This paper is concerned with the applicability of different strategies to improve the definition of prior probabilities and/or likelihoods of naïve Bayes (NB) classifiers. Standard NB method computes likelihoods and priors by means of normal distributions and evaluation of the entire training data-set, respectively. NB is one of the most prolific classification methods in data mining and machine learning. Despite decent efficiency facing good training data, NB classifiers present the intriguing assumption of conditional independence between the attributes. Several algorithms have been proposed to improve the effectiveness of NB classifiers by inserting discriminant approaches into its generative structure. From a reliability perspective, the standard NB approach might not explore the real capabilities of NB classifiers facing the lithologic classification problem. To cover such distrust, a novel approach considering four particular strategies are suggested and compared to the standard NB classification outcomes. At first, a kernel density estimation (KDE) is considered to ameliorate the likelihood models. We also apply the NB classifier in parts by separating the training data-set in individual wells in a committee architecture framework. A tuning strategy is also considered for automatic estimation of prior probabilities in an optimization-scheme. Another novel alternative, named as CRC, adapted to the standard NB classifier consists in defining priors based on depth zones from regional stratigraphic information in which to apply NB classifier. We prepare an extensive statistical investigation, based on precision, recall, classification errors, fscores and confusion matrices to bespeak the most relevant NB strategy for classification of electrofacies. Despite the decent classification outcomes for all above-mentioned strategies, CRC can be considered, by a narrow margin, the most prolific method to be applied as an improvement of the standard NB to classify rock units. Tests are performed on a validation well (i.e., 7-MP-50D-BA) of Massapê Field, in Recôncavo Basin, northeast Brazil to highlight the classification particularities provided by the improved strategy. A significant improvement in the classification of sandstones (i.e., from 68 % to 83 % accuracy) is observed, according to the confusion matrix analysis. Additionally, a minimal decreasing is observed into the classification of shales and slurries (i.e., from 92% to 90% and to 63% to 56%, respectively), which is acceptable according to the fscores and errors. This aspect reinforces the relevance of using the NB classifier jointly with previous geologic information to optimize the lithologic classification.

Read full abstract

Animal modeling of infectious diseases such as coronavirus disease 2019 (COVID-19) is important for exploration of natural history, understanding of pathogenesis, and evaluation of countermeasures. Preclinical studies enable rigorous control of experimental conditions as well as pre-exposure baseline and longitudinal measurements, including medical imaging, that are often unavailable in the clinical research setting. Computerized tomography (CT) imaging provides important diagnostic, prognostic, and disease characterization to clinicians and clinical researchers. In that context, automated deep-learning systems for the analysis of CT imaging have been broadly proposed, but their practical utility has been limited. Manual outlining of the ground truth (i.e., lung-lesions) requires accurate distinctions between abnormal and normal tissues that often have vague boundaries and is subject to reader heterogeneity in interpretation. Indeed, this subjectivity is demonstrated as wide inconsistency in manual outlines among experts and from the same expert. The application of deep-learning data-science tools has been less well-evaluated in the preclinical setting, including in nonhuman primate (NHP) models of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection/COVID-19, in which the translation of human-derived deep-learning tools is challenging. The automated segmentation of the whole lung and lung lesions provides a potentially standardized and automated method to detect and quantify disease. We used deep-learning-based quantification of the whole lung and lung lesions on CT scans of NHPs exposed to SARS-CoV-2. We proposed a novel multi-model ensemble technique to address the inconsistency in the ground truths for deep-learning-based automated segmentation of the whole lung and lung lesions. Multiple models were obtained by training the convolutional neural network (CNN) on different subsets of the training data instead of having a single model using the entire training dataset. Moreover, we employed a feature pyramid network (FPN), a CNN that provides predictions at different resolution levels, enabling the network to predict objects with wide size variations. We achieved an average of 99.4 and 60.2% Dice coefficients for whole-lung and lung-lesion segmentation, respectively. The proposed multi-model FPN outperformed well-accepted methods U-Net (50.5%), V-Net (54.5%), and Inception (53.4%) for the challenging lesion-segmentation task. We show the application of segmentation outputs for longitudinal quantification of lung disease in SARS-CoV-2-exposed and mock-exposed NHPs. Deep-learning methods should be optimally characterized for and targeted specifically to preclinical research needs in terms of impact, automation, and dynamic quantification independently from purely clinical applications.

Read full abstract

Entire Training Dataset Research Articles

Related Topics

Articles published on Entire Training Dataset

Training-free approach to constructing ensemble of local experts

Optimizing motor imagery BCI models with hard trials removal and model refinement

Application of a U-Net Neural Network to the Puccinia sorghi-Maize Pathosystem.

Compact Modeling of Advanced Gate-All-Around Nanosheet FETs Using Artificial Neural Network.

Two-Step Hyperparameter Optimization Method: Accelerating Hyperparameter Search by Using a Fraction of a Training Dataset

Combining Fuzzy Partitioning and Incremental Methods to Construct a Scalable Decision Tree on Large Datasets

Deep neural network based tissue deconvolution of circulating tumor cell RNA

Analysis of alternative strategies applied to Naïve-Bayes classifier into the recognition of electrofacies: Application in well-log data at Recôncavo Basin, North-East Brazil

Deep-Learning-Based Whole-Lung and Lung-Lesion Quantification Despite Inconsistent Ground Truth: Application to Computerized Tomography in SARS-CoV-2 Nonhuman Primate Models

Structure Destruction and Content Combination for Generalizable Anti-Spoofing

A Survey of the Recent Trends in Deep Learning Based Malware Detection

Pseudo loss active learning for deep visual tracking

A comparison of approaches to improve worst-case predictive model performance over patient subpopulations

Fabric defect classification using prototypical network of few-shot learning algorithm

A novel pure data-selection framework for day-ahead wind power forecasting

Impact of Intraoperative Data on Risk Prediction for Mortality After Intra-Abdominal Surgery.

Exploratory Radiomic Analysis of Conventional vs. Quantitative Brain MRI: Toward Automatic Diagnosis of Early Multiple Sclerosis.

Modeling of turbulent flames with the large eddy simulation–probability density function (LES–PDF) approach, stochastic fields, and artificial neural networks

PA-Cache: Evolving Learning-Based Popularity- Aware Content Caching in Edge Networks

A Few-Shot Learning Method Using Feature Reparameterization and Dual-Distance Metric Learning for Object Re-Identification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Entire Training Dataset Research Articles

Related Topics

Articles published on Entire Training Dataset

Training-free approach to constructing ensemble of local experts

Optimizing motor imagery BCI models with hard trials removal and model refinement

Application of a U-Net Neural Network to the Puccinia sorghi-Maize Pathosystem.

Compact Modeling of Advanced Gate-All-Around Nanosheet FETs Using Artificial Neural Network.

Two-Step Hyperparameter Optimization Method: Accelerating Hyperparameter Search by Using a Fraction of a Training Dataset

Combining Fuzzy Partitioning and Incremental Methods to Construct a Scalable Decision Tree on Large Datasets

Deep neural network based tissue deconvolution of circulating tumor cell RNA

Analysis of alternative strategies applied to Naïve-Bayes classifier into the recognition of electrofacies: Application in well-log data at Recôncavo Basin, North-East Brazil

Deep-Learning-Based Whole-Lung and Lung-Lesion Quantification Despite Inconsistent Ground Truth: Application to Computerized Tomography in SARS-CoV-2 Nonhuman Primate Models

Structure Destruction and Content Combination for Generalizable Anti-Spoofing

A Survey of the Recent Trends in Deep Learning Based Malware Detection

Pseudo loss active learning for deep visual tracking

A comparison of approaches to improve worst-case predictive model performance over patient subpopulations

Fabric defect classification using prototypical network of few-shot learning algorithm

A novel pure data-selection framework for day-ahead wind power forecasting

Impact of Intraoperative Data on Risk Prediction for Mortality After Intra-Abdominal Surgery.

Exploratory Radiomic Analysis of Conventional vs. Quantitative Brain MRI: Toward Automatic Diagnosis of Early Multiple Sclerosis.

Modeling of turbulent flames with the large eddy simulation–probability density function (LES–PDF) approach, stochastic fields, and artificial neural networks

PA-Cache: Evolving Learning-Based Popularity- Aware Content Caching in Edge Networks

A Few-Shot Learning Method Using Feature Reparameterization and Dual-Distance Metric Learning for Object Re-Identification