Class Predictions Research Articles

Recent country and continental-scale digital soil mapping efforts have used a single model to predict soil properties across large regions. However, different ecophysiographic regions within large-extent areas are likely to have different soil-landscape relationships so models built specifically for these regions may more accurately capture these relationships relative to a ‘global’ model. We ask the question: Is a single ‘global’ model sufficient or are regionally-specific models useful for accurate digital soil mapping? We test this question by modeling soil depth classes across the 432,000 km2 upper Colorado River Basin in the Western USA using a single global model, multiple ecophysiographic models, and ensembles of the ecophysiographic models.Effective soil depth class observations (n = 12,194) were derived from multiple soil databases. Fifty-seven environmental covariates were derived from a 30 m digital elevation model, climate data, satellite imagery, and aeroradiometric data. Three independent land classifications were used to stratify the area. Two expert-derived land classifications, USDA Major Land Resource Areas (MLRA) and US-EPA Level III ecoregions, divided the study area into multiple ecophysiographic regions based on vegetation and broad-scale physiographic differences. The third land classification divided the study area into broad landforms.Soil depth observations were split into separate training (n = 10,470) and validation (n = 1,724) datasets. First, a ‘global’ random forest model was used to model soil depth classes using all training observations and covariates. ‘Global’ denotes a model built with all training data across the extent of the area, not a model at world extent. Second, the land classifications were used to subset the observations into ecophysiographic sub-datasets and random forest models were refit for each region. Models fit by ecophysiographic region are referred to as regional models. Thirdly, predictions from each regional model were fused into regional-ensemble models. Accuracy, Brier scores, and Shannon’s entropy were used to compare model accuracy and uncertainty. Regional ecophysiographic models were also compared to models built for geographic areas that were defined solely to be approximately equal in area. Training dataset density and the imbalance ratio were investigated to determine if data characteristics influenced regional accuracy/uncertainty metrics.Accuracy for the global model using the validation set was 62.8%. Regional model accuracies ranged between 56.1% and 75.0%. We found: 1) useful inter-regional differences in global model accuracy were revealed when the global model was validated by region, 2) no consistent relationship between training observation density and accuracy/uncertainty metrics, 3) no meaningful differences in accuracy and uncertainty metrics between physiographic and geographic regions, 4) ensembles of regionally-specific models were approximately as accurate as global models, and 5) both region-specific models and ensembles of regional models were less uncertain than the global model. Overall, we recommend the use of soil depth class predictions made from MLRA regional ensemble models because this prediction had higher accuracy than the ecoregion ensemble model prediction, but lower uncertainty than both the global model and the landform ensemble model predictions. We answer our question: Ensembles of regionally-specific models are approximately as accurate as global models, but result in less uncertainty.

Read full abstract

Vehicle ownership modeling and prediction is a crucial task in the transportation planning processes which, traditionally, uses statistical models in the modeling process. However, with the advancement in computing power of computers and Artificial Intelligence, Machine Learning (ML) algorithms are becoming an alternative or a complement to the statistical models in modeling the transportation planning processes. Although the application of ML algorithms to the transportation planning processes—like mode choice, and traffic forecasting and demand modeling—have received much attention in research and abound in literature, scanty attention is paid to its application to vehicle ownership modeling especially in the context of small to medium cities in developing countries. Therefore, this study attempts to fill this gap by modeling vehicle ownership in the Greater Tamale Area (GTA), a typically small to medium city in Ghana. Using a cross sectional survey of formal sectors workers, data was collected between June–August 2018. The study applied nine different ML classification algorithms to the dataset using 10-fold cross-validation technique/s and the Cohen-Kappa static/statistic to evaluate the predictive performance of each of the algorithms, and the Permutation Feature Importance to examine the features that contribute significantly to the prediction of vehicle ownership in GTA. The results showed that Linear Support Vector Classification (LinearSVC) classifier performed well in comparison with the other classifiers with regards to the overall predictive ability of the classifiers. In terms of class predictions, K- Nearest Neighbors (KNN) classifier performs well for no-vehicle class whiles Linear Support Vector Classification (LinearSVC) and GaussianNB classifiers performs well for motorcycle ownership. LinearSVC and Logistic Regression classifiers performed well on the car ownership class. Also, the results indicated that travel mode choice, average monthly income, average travel distance to workplace, average monthly expenditure on transport, duration of travel to workplace, occupational rank, age, household size and marital status were significant in predicting vehicle ownership for most of the classifiers. These findings could help policies makers carve out strategies that would reduce vehicle ownership but improve personal mobility.

Read full abstract

Class Predictions Research Articles

Related Topics

Articles published on Class Predictions

REFINE: Prediction Fusion Network for Panoptic Segmentation

Semi-Supervised Learning with Variational Bayesian Inference and Maximum Uncertainty Regularization

Condition-CNN: A hierarchical multi-label fashion image classification model

Deep Neural Network for Differentiation of Brain Tumor Tissue Displayed by Confocal Laser Endomicroscopy.

Regional ensemble modeling reduces uncertainty for digital soil mapping

EEG-Based Eye Movement Recognition Using Brain-Computer Interface and Random Forests.

A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm

Modeling vehicle ownership with machine learning techniques in the Greater Tamale Area, Ghana.

A one-step Bayesian inversion framework for 3D reservoir characterization based on a Gaussian mixture model — A Norwegian Sea demonstration

Aerial Image Analysis Using Deep Learning for Electrical Overhead Line Network Asset Management

Deep Feature Representations for Variable-Sized Regions of Interest in Breast Histopathology.

Clustering Based Undersampling for Handling Class Imbalance in C4.5 Classification Algorithm

Markerless movement tracking using a machine-learning algorithm to assess arm movements during gait in children with HIV encephalopathy

The 𝑟-𝑑 class predictions in linear mixed models

Structural profile matrices for predicting structural properties of proteins.

Unimodal regularized neuron stick-breaking for ordinal classification

Denborazko serieen sailkapen goiztiarra helburu anitzeko optimizazio problema gisa aztertua

Predicting and mapping site index in operational forest inventories using bitemporal airborne laser scanner data

Input representations and classification strategies for automated human gait analysis

A convolutional neural network applied to measured time series for source range and ocean seabed classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Class Predictions Research Articles

Related Topics

Articles published on Class Predictions

REFINE: Prediction Fusion Network for Panoptic Segmentation

Semi-Supervised Learning with Variational Bayesian Inference and Maximum Uncertainty Regularization

Condition-CNN: A hierarchical multi-label fashion image classification model

Deep Neural Network for Differentiation of Brain Tumor Tissue Displayed by Confocal Laser Endomicroscopy.

Regional ensemble modeling reduces uncertainty for digital soil mapping

EEG-Based Eye Movement Recognition Using Brain-Computer Interface and Random Forests.

A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm

Modeling vehicle ownership with machine learning techniques in the Greater Tamale Area, Ghana.

A one-step Bayesian inversion framework for 3D reservoir characterization based on a Gaussian mixture model — A Norwegian Sea demonstration

Aerial Image Analysis Using Deep Learning for Electrical Overhead Line Network Asset Management

Deep Feature Representations for Variable-Sized Regions of Interest in Breast Histopathology.

Clustering Based Undersampling for Handling Class Imbalance in C4.5 Classification Algorithm

Markerless movement tracking using a machine-learning algorithm to assess arm movements during gait in children with HIV encephalopathy

The 𝑟-𝑑 class predictions in linear mixed models

Structural profile matrices for predicting structural properties of proteins.

Unimodal regularized neuron stick-breaking for ordinal classification

Denborazko serieen sailkapen goiztiarra helburu anitzeko optimizazio problema gisa aztertua

Predicting and mapping site index in operational forest inventories using bitemporal airborne laser scanner data

Input representations and classification strategies for automated human gait analysis

A convolutional neural network applied to measured time series for source range and ocean seabed classification