Tree Kernel Research Articles

The Rashômon Effect, applied in Explainable Machine Learning, refers to the disagreement between the explanations provided by various attribution explainers and to the dissimilarity across multiple explanations generated by a particular explainer for a single instance from the dataset (differences between feature importances and their associated signs and ranks), an undesirable outcome especially in sensitive domains such as healthcare or finance. We propose a method inspired from textual-case based reasoning for aligning explanations from various explainers in order to resolve the disagreement and dissimilarity problems. We iteratively generated a number of 100 explanations for each instance from six popular datasets, using three prevalent feature attribution explainers: LIME, Anchors and SHAP (with the variations Tree SHAP and Kernel SHAP) and consequently applied a global cluster-based aggregation strategy that quantifies alignment and reveals similarities and associations between explanations. We evaluated our method by weighting the \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\:k$$\\end{document}-NN algorithm with agreed feature overlap explanation weights and compared it to a non-weighted \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\:k$$\\end{document}-NN predictor, having as task binary classification. Also, we compared the results of the weighted \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\:k$$\\end{document}-NN algorithm using aggregated feature overlap explanation weights to the weighted \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\:k$$\\end{document}-NN algorithm using weights produced by a single explanation method (either LIME, SHAP or Anchors). Our global alignment method benefited the most from a hybridization with feature importance scores (information gain), that was essential for acquiring a more accurate estimate of disagreement, for enabling explainers to reach a consensus across multiple explanations and for supporting effective model learning through improved classification performance.

Read full abstract

AbstractRegression test prioritization (RTP) is an active research field, aiming at re‐ordering the tests in a test suite to maximize the rate at which faults are detected. A number of RTP strategies have been proposed, leveraging different factors to reorder tests. Some techniques include an analysis of changed source code, to assign higher priority to tests stressing modified parts of the codebase. Still, most of these change‐based solutions focus on simple text‐level comparisons among versions. We believe that measuring source code changes in a more refined way, capable of discriminating between mere textual changes (e.g., renaming of a local variable) and more structural changes (e.g., changes in the control flow), could lead to significant benefits in RTP, under the assumption that major structural changes are also more likely to introduce faults. To this end, we propose two novel RTP techniques that leverage tree kernels (TK), a class of similarity functions largely used in Natural Language Processing on tree‐structured data. In particular, we apply TKs to abstract syntax trees of source code, to more precisely quantify the extent of structural changes in the source code, and prioritize tests accordingly. We assessed the effectiveness of the proposals by conducting an empirical study on five real‐world Java projects, also used in a number of RTP‐related papers. We automatically generated, for each considered pair of software versions (i.e., old version, new version) in the evolution of the involved projects, 100 variations with artificially injected faults, leading to over 5k different software evolution scenarios overall. We compared the proposed prioritization approaches against well‐known prioritization techniques, evaluating both their effectiveness and their execution times. Our findings show that leveraging more refined code change analysis techniques to quantify the extent of changes in source code can lead to relevant improvements in prioritization effectiveness, while typically introducing negligible overheads due to their execution.

Read full abstract

Tree Kernel Research Articles

Related Topics

Articles published on Tree Kernel

Clarity in complexity: how aggregating explanations resolves the disagreement problem

Platelet Metabolites as Candidate Biomarkers in Sepsis Diagnosis and Management Using the Proposed Explainable Artificial Intelligence Approach.

Detection of network false information based on artificial intelligence models

Ordinal Pattern Tree: A New Representation Method for Brain Network Analysis.

Physico-chemical and nutritional characteristics of kernels oil from two mangoes varieties (Amélie and Kent) harvested at Orodara in Burkina Faso

Regression test prioritization leveraging source code similarity with tree kernels

Predicting oil content of Australian beauty leaf tree kernel samples using near infrared spectroscopy combined with chemometrics

Paraphrasing identification Using ACV-tree kernel

Relational Analysis of College English Vocabulary - A Reflection Based on Semantic Association Network Modeling

Design of a Biogas Power Plant That Uses Olive Tree Pruning and Olive Kernels in Achaia, Western Greece

Method of Training a Kernel Tree

A novel diagnosis method for schizophrenia based on globus pallidus data

A Short-Text Similarity Model Combining Semantic and Syntactic Information

Bench-scale integrated bone and biochar bed treatment of geogenic fluoride contaminated groundwater from Bongo in Ghana

A Closer Look at the Kernels Generated by the Decision and Regression Tree Ensembles

Colour Fastness of Silk Fabrics Dyed with Extracts from Oil Palm Tree Kernel Shell and Effect of Metal Ion Mordanting on Fabric Colour

A Robust Framework for Automated Screening of Diabetic Patient Using ECG Signals

A Machine Learning Challenge: Detection of Cardiac Amyloidosis Based on Bi-Atrial and Right Ventricular Strain and Cardiac Function.

An unsupervised semantic text similarity measurement model in resource-limited scenes

Gradient Boosting over Linguistic-Pattern-Structured Trees for Learning Protein–Protein Interaction in the Biomedical Literature

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Tree Kernel Research Articles

Related Topics

Articles published on Tree Kernel

Clarity in complexity: how aggregating explanations resolves the disagreement problem

Platelet Metabolites as Candidate Biomarkers in Sepsis Diagnosis and Management Using the Proposed Explainable Artificial Intelligence Approach.

Detection of network false information based on artificial intelligence models

Ordinal Pattern Tree: A New Representation Method for Brain Network Analysis.

Physico-chemical and nutritional characteristics of kernels oil from two mangoes varieties (Amélie and Kent) harvested at Orodara in Burkina Faso

Regression test prioritization leveraging source code similarity with tree kernels

Predicting oil content of Australian beauty leaf tree kernel samples using near infrared spectroscopy combined with chemometrics

Paraphrasing identification Using ACV-tree kernel

Relational Analysis of College English Vocabulary - A Reflection Based on Semantic Association Network Modeling

Design of a Biogas Power Plant That Uses Olive Tree Pruning and Olive Kernels in Achaia, Western Greece

Method of Training a Kernel Tree

A novel diagnosis method for schizophrenia based on globus pallidus data

A Short-Text Similarity Model Combining Semantic and Syntactic Information

Bench-scale integrated bone and biochar bed treatment of geogenic fluoride contaminated groundwater from Bongo in Ghana

A Closer Look at the Kernels Generated by the Decision and Regression Tree Ensembles

Colour Fastness of Silk Fabrics Dyed with Extracts from Oil Palm Tree Kernel Shell and Effect of Metal Ion Mordanting on Fabric Colour

A Robust Framework for Automated Screening of Diabetic Patient Using ECG Signals

A Machine Learning Challenge: Detection of Cardiac Amyloidosis Based on Bi-Atrial and Right Ventricular Strain and Cardiac Function.

An unsupervised semantic text similarity measurement model in resource-limited scenes

Gradient Boosting over Linguistic-Pattern-Structured Trees for Learning Protein–Protein Interaction in the Biomedical Literature