Efficient Sampling Scheme Research Articles

Ulcerative colitis is an idiopathic inflammatory disorder affecting the mucosa of the colon with superficial erosions and ulcers associated with bleeding. Severity assessment using current scoring schemes such as UCEIS and MAYO relies on the subjective interpretation of the physician and fails to take into account the size of the different lesions, their number and their distribution throughout the colon. Automatic lesion detection and grading procedures can enable fine-grained assessment of lesion severity for treatment follow-up. This work aims to learn automatic bleeding and ulcer lesion detectors. As such algorithms may need to be tuned to the characteristics of the equipment used to perform the colonoscopy, we train our detectors on a dataset obtained by the Gastroenterology group at the Bichat and Beaujon hospitals. The patients' videos were anonymous, analyzed after obtaining their consent. The study was approved by the local research study committee. To minimize expert annotation burden, only rectangular annotations are required instead of a precise delineation of lesion boundaries (Figure 1). However, this dataset contains many mislabeled pixels, especially in the corners of the rectangles. This affects the evaluation of the models' performance and our ability to find correct models. Standard sensitivity and specificity cannot be used effectively on this dataset. We propose to evaluate model sensitivity on the annotation level and keep specificity at the pixel level. On the training set, we consider that a model correctly identifies a lesion if it agrees with the expert on a subset of the annotation, and count the detected annotations weighted by their area. For robustness and ease of interpretation, we explore the set of linear classifiers, and propose an efficient sampling scheme that rejects trivial models. This method is evaluated on a database of 10 colonoscopy videos (5 training videos and 5 test videos). In spite of the limited quality of the annotations, we find lesion detectors with a good annotation-level sensitivity (93% specificity / 89% sensitivity for bleeding and 57% specificity / 83% sensitivity for ulcers) and visual performance (see Figure 1). The detector performance is computed reliably. We evaluated sensitivity and specificity on 20 random subsets containing 10% of the images, and obtain similar performance for the same patient for all models (cf Figure 2). However, we observe that the inter-patient performance is variable and that the best models can fail on some patients. In Figure 2, the sensitivity is below 20% is some cases. This suggests that the models are not universal because bleeding and ulcers have a different appearance in patients, and that this should be corrected before automatic detection. Alternatively, models should be adapted to the patient characteristics.

Read full abstract

Abstract. Traditional laboratory methods for acquiring soil information remain important for assessing key soil properties, soil functions and ecosystem services over space and time. Infrared spectroscopic modeling can link and massively scale up these methods for many soil characteristics in a cost-effective and timely manner. In Switzerland, only 10 % to 15 % of agricultural soils have been mapped sufficiently to serve spatial decision support systems, presenting an urgent need for rapid quantitative soil characterization. The current Swiss soil spectral library (SSL; n = 4374) in the mid-infrared range includes soil samples from the Biodiversity Monitoring Program (BDM), arranged in a regularly spaced grid across Switzerland, and temporally resolved data from the Swiss Soil Monitoring Network (NABO). Given that less than 2 % of the samples in the SSL originate from organic soils, we aimed to develop both an efficient calibration sampling scheme and accurate modeling strategy to estimate the soil carbon (SC) contents of heterogeneous samples between 0 and 2 m depth from 26 locations within two drained peatland regions (School of Agricultural, Forest and Food Sciences (HAFL) data set; n = 116). The focus was on minimizing the need for new reference analyses by efficiently mining the spectral information of the SSL. We used partial least square regressions (PLSRs), together with five repetitions of a location-grouped, 10-fold cross-validation, to predict SC ranging from 1 % to 52 % in the local HAFL data set. We compared the validation performance of different calibration schemes involving local models (1), models using the entire SSL combined with local samples (2), commonly referred to as spiking, and subsets of local and SSL samples optimized for the peatland target sites using the resampling local (RS-LOCAL) algorithm (3). Using local and RS-LOCAL calibrations with at least five local samples, we achieved similar validation results for predictions of SC up to 52 % (R2 = 0.93 to 0.97; bias = -0.07 to 1.65; root mean square error (RMSE) = 2.71 % to 3.89 % total carbon; ratio of performance to deviation (RPD) = 3.38 to 4.86; and ratio of performance to interquartile range (RPIQ) = 4.93 to 7.09). However, calibrations using RS-LOCAL only required five or 10 local samples for very accurate models (RMSE = 3.16 % and 2.71 % total carbon, respectively), while purely local calibrations required 50 samples for similarly accurate results (RMSE < 3 % total carbon). Of the three approaches, the entire SSL spiked with local samples for model calibration led to validations with the lowest performance in terms of R2, bias, RMSE, RPD and RPIQ. Hence, we show that a simple and comprehensible modeling approach, using RS-LOCAL together with a SSL, is an efficient and accurate strategy when using infrared spectroscopy. It decreases field and laboratory work, the bias of SSL spiking approaches and the uncertainty of local models. If adequately mined, the information in the SSL is sufficient to predict SC in new and independent study regions, even if the local soil characteristics are very different from the ones in the SSL. This will help to efficiently scale up the acquisition of quantitative soil information over space and time.

Read full abstract

Efficient Sampling Scheme Research Articles

Related Topics

Articles published on Efficient Sampling Scheme

P015 Detection of endoscopic lesions from limited quality annotations in colonoscopy videos

AUTOMATIC BLEEDING AND ULCER DETECTION FROM LIMITED QUALITY ANNOTATIONS IN ULCERATIVE COLITIS

Cross-classes domain inference with network sampling for natural resource inventory

An Efficient Plane-Waves Superposition Method for Improved Spatial Correlation in Simulated Reverberation Chambers

Common Neighbors Matter: Fast Random Walk Sampling with Common Neighbor Awareness

Quasi-Monte Carlo Approximations for Exponentiated Quadratic Kernel in Latent Force Models

An improved fuzzy adaptive teaching learning-based optimisation algorithm for generating pairwise test suites

Investigation of determinism-related issues in the Sobol′ low-discrepancy sequence for producing sound global sensitivity analysis indices

Generalization of the power‐law rating curve using hydrodynamic theory and Bayesian hierarchical modeling

How Can Nursing Teams Respond to Large-Scale COVID-19 Screening?

Towards computationally efficient prediction of molecular signatures from routine histology images

Reliability-based topology optimization using stochastic gradients

Estimating viral prevalence with data fusion for adaptive two-phase pooled sampling.

Sector subsampling for basal area ratio estimation: an alternative to big BAF sampling

A Dynamic Spatial Model for Chronic Wasting Disease in Colorado

Variational Monte Carlo Calculations of A≤4 Nuclei with an Artificial Neural-Network Correlator Ansatz.

Optimal sampling of dynamical large deviations via matrix product states.

Quantifying soil carbon in temperate peatlands using a mid-IR soil spectral library

The Modified-Half-Normal distribution: Properties and an efficient sampling scheme

To Sobol or not to Sobol? The effects of sampling schemes in systems biology applications

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Efficient Sampling Scheme Research Articles

Related Topics

Articles published on Efficient Sampling Scheme

P015 Detection of endoscopic lesions from limited quality annotations in colonoscopy videos

AUTOMATIC BLEEDING AND ULCER DETECTION FROM LIMITED QUALITY ANNOTATIONS IN ULCERATIVE COLITIS

Cross-classes domain inference with network sampling for natural resource inventory

An Efficient Plane-Waves Superposition Method for Improved Spatial Correlation in Simulated Reverberation Chambers

Common Neighbors Matter: Fast Random Walk Sampling with Common Neighbor Awareness

Quasi-Monte Carlo Approximations for Exponentiated Quadratic Kernel in Latent Force Models

An improved fuzzy adaptive teaching learning-based optimisation algorithm for generating pairwise test suites

Investigation of determinism-related issues in the Sobol′ low-discrepancy sequence for producing sound global sensitivity analysis indices

Generalization of the power‐law rating curve using hydrodynamic theory and Bayesian hierarchical modeling

How Can Nursing Teams Respond to Large-Scale COVID-19 Screening?

Towards computationally efficient prediction of molecular signatures from routine histology images

Reliability-based topology optimization using stochastic gradients

Estimating viral prevalence with data fusion for adaptive two-phase pooled sampling.

Sector subsampling for basal area ratio estimation: an alternative to big BAF sampling

A Dynamic Spatial Model for Chronic Wasting Disease in Colorado

Variational Monte Carlo Calculations of A≤4 Nuclei with an Artificial Neural-Network Correlator Ansatz.

Optimal sampling of dynamical large deviations via matrix product states.

Quantifying soil carbon in temperate peatlands using a mid-IR soil spectral library

The Modified-Half-Normal distribution: Properties and an efficient sampling scheme

To Sobol or not to Sobol? The effects of sampling schemes in systems biology applications