Context Of Machine Learning Research Articles

ContextMachine learning (ML) software systems are permeating many aspects of our life, such as healthcare, transportation, banking, and recruitment. These systems are trained with data that is often biased, resulting in biased behaviour. To address this issue, fairness testing approaches have been proposed to test ML systems for fairness, which predominantly focus on assessing classification-based ML systems. These methods are not applicable to regression-based systems, for example, they do not quantify the magnitude of the disparity in predicted outcomes, which we identify as important in the context of regression-based ML systems.Method:We conduct this study as design science research. We identify the problem instance in the context of emergency department (ED) wait-time prediction. In this paper, we develop an effective and efficient fairness testing approach to evaluate the fairness of regression-based ML systems. We propose fairness degree, which is a new fairness measure for regression-based ML systems, and a novel search-based fairness testing (SBFT) approach for testing regression-based machine learning systems. We apply the proposed solutions to ED wait-time prediction software.Results:We experimentally evaluate the effectiveness and efficiency of the proposed approach with ML systems trained on real observational data from the healthcare domain. We demonstrate that SBFT significantly outperforms existing fairness testing approaches, with up to 111% and 190% increase in effectiveness and efficiency of SBFT compared to the best performing existing approaches.Conclusion:These findings indicate that our novel fairness measure and the new approach for fairness testing of regression-based ML systems can identify the degree of fairness in predictions, which can help software teams to make data-informed decisions about whether such software systems are ready to deploy. The scientific knowledge gained from our work can be phrased as a technological rule; to measure the fairness of the regression-based ML systems in the context of emergency department wait-time prediction use fairness degree and search-based techniques to approximate it.

ABSTRACT Label noise is a commonly encountered problem in learning building extraction tasks; its presence can reduce performance and increase learning complexity. This is especially true for cases where high resolution aerial drone imagery is used, as the labels may not perfectly correspond/align with the actual objects in the imagery. In general machine learning and computer vision context, labels refer to the associated class of data, and in remote sensing-based building extraction refer to pixel-level classes. Dense label noise in building extraction tasks has rarely been formalized and assessed. We formulate a taxonomy of label noise models for building extraction tasks, which incorporates both pixel-wise and dense models. While learning dense prediction under label noise, the differences between the ground truth clean label and observed noisy label can be encoded by error matrices indicating locations and type of noisy pixel-level labels. In this work, we explicitly learn to approximate error matrices for improving building extraction performance; essentially, learning dense prediction of label noise as a subtask of a larger building extraction task. We propose two new model frameworks for learning building extraction under dense real-world label noise, and consequently two new network architectures, which approximate the error matrices as intermediate predictions. The first model learns the general error matrix as an intermediate step and the second model learns the false positive and false-negative error matrices independently, as intermediate steps. Approximating intermediate error matrices can generate label noise saliency maps, for identifying labels having higher chances of being mis-labelled. We have used ultra-high-resolution aerial images, noisy observed labels from OpenStreetMap, and clean labels obtained after careful annotation by the authors. When compared to the baseline model trained and tested using clean labels, our intermediate false positive-false negative error matrix model provides Intersection-Over-Union gain of 2.74% and F1-score gain of 1.75% on the independent test set. Furthermore, our proposed models provide much higher recall than currently used deep learning models for building extraction, while providing comparable precision. We show that intermediate false positive-false negative error matrix approximation can improve performance under label noise.

Context Of Machine Learning Research Articles

Related Topics

Articles published on Context Of Machine Learning

Deep Neural Network-Aided Soft-Demapping in Coherent Optical Systems: Regression Versus Classification

A Fuzzy Approach to Drum Cymbals Classification

Search-based fairness testing for regression-based machine learning systems

Moral dilemmas for moral machines

Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four-body interactions.

The Markov Random Field in Materials Applications: A synoptic view for signal processing and materials readers

MACHINE LEARNING IN THE FIELD OF MANUFACTURING

Dense prediction of label noise for learning building extraction from aerial drone imagery

On the classification of simple and complex biological images using Krawtchouk moments and Generalized pseudo-Zernike moments: a case study with fly wing images and breast cancer mammograms.

A machine learning based asset pricing factor model comparison on anomaly portfolios

Entropy, cross-entropy, relative entropy: Deformation theory (a)

A MACHINE LEARNING BASED ASSET PRICING FACTOR MODEL EXTENSION COMPARISON ON ANOMALY PORTFOLIOS

Response-adaptive trial designs with accelerated Thompson sampling.

Item response theory as a feature selection and interpretation tool in the context of machine learning.

Annotating affective dimensions in user-generated content

Functional analysis of generalized linear models under non-linear constraints with applications to identifying highly-cited papers

Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts.

Neutrosophic-based machine learning context for the trustworthiness of devices in the internet of things

Beyond kappa: an informational index for diagnostic agreement in dichotomous and multivalue ordered-categorical ratings

Adaptive context‐aware service optimization in mobile cloud computing accounting for security aspects

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Context Of Machine Learning Research Articles

Related Topics

Articles published on Context Of Machine Learning

Deep Neural Network-Aided Soft-Demapping in Coherent Optical Systems: Regression Versus Classification

A Fuzzy Approach to Drum Cymbals Classification

Search-based fairness testing for regression-based machine learning systems

Moral dilemmas for moral machines

Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four-body interactions.

The Markov Random Field in Materials Applications: A synoptic view for signal processing and materials readers

MACHINE LEARNING IN THE FIELD OF MANUFACTURING

Dense prediction of label noise for learning building extraction from aerial drone imagery

On the classification of simple and complex biological images using Krawtchouk moments and Generalized pseudo-Zernike moments: a case study with fly wing images and breast cancer mammograms.

A machine learning based asset pricing factor model comparison on anomaly portfolios

Entropy, cross-entropy, relative entropy: Deformation theory (a)

A MACHINE LEARNING BASED ASSET PRICING FACTOR MODEL EXTENSION COMPARISON ON ANOMALY PORTFOLIOS

Response-adaptive trial designs with accelerated Thompson sampling.

Item response theory as a feature selection and interpretation tool in the context of machine learning.

Annotating affective dimensions in user-generated content

Functional analysis of generalized linear models under non-linear constraints with applications to identifying highly-cited papers

Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts.

Neutrosophic-based machine learning context for the trustworthiness of devices in the internet of things

Beyond kappa: an informational index for diagnostic agreement in dichotomous and multivalue ordered-categorical ratings

Adaptive context‐aware service optimization in mobile cloud computing accounting for security aspects