Random Forest Similarity Maps: A Scalable Visual Representation for Global and Local Interpretation

Dipankar Mazumdar,Fernando V Paulovich,Mário Popolin Neto

doi:10.3390/electronics10222862

Dipankar Mazumdar, Fernando V Paulovich + Show 1 more

Open Access

https://doi.org/10.3390/electronics10222862

Copy DOI

Abstract

Machine Learning prediction algorithms have made significant contributions in today’s world, leading to increased usage in various domains. However, as ML algorithms surge, the need for transparent and interpretable models becomes essential. Visual representations have shown to be instrumental in addressing such an issue, allowing users to grasp models’ inner workings. Despite their popularity, visualization techniques still present visual scalability limitations, mainly when applied to analyze popular and complex models, such as Random Forests (RF). In this work, we propose Random Forest Similarity Map (RFMap), a scalable interactive visual analytics tool designed to analyze RF ensemble models. RFMap focuses on explaining the inner working mechanism of models through different views describing individual data instance predictions, providing an overview of the entire forest of trees, and highlighting instance input feature values. The interactive nature of RFMap allows users to visually interpret model errors and decisions, establishing the necessary confidence and user trust in RF models and improving performance.

Highlights

Machine Learning (ML) algorithms have seen widespread usage in numerous fields over the past few years
We present two usage scenarios to evaluate the effectiveness of Random Forest Similarity Map (RFMap) in interpreting and visualizing Random Forest (RF) models
Karen uses our RFMap system to visualize and interpret an RF model she has developed to classify breast cancer diagnosis. The dataset she uses to train the RF model is from the University of Wisconsin (Wisconsin Breast Cancer Diagnostic [65]), and it contains samples of solid breast masses collected from 569 patients, out of which 357 were labeled as Benign (B) and 212 as Malignant (M)

Summary

Introduction

Machine Learning (ML) algorithms have seen widespread usage in numerous fields over the past few years. The need to have better predictive performance in real-life use cases often leads to an intrinsic problem: interpreting the produced results [7]. COMPAS (Correctional Offender Management Profiling for Alternative Sanctions), a software for judging the likelihood of a criminal defendant becoming a recidivist, has been widely criticized for its biased racial decisions [8]. It was observed from the algorithm results that people of color were at greater risk of recidivism than white defendants, and the reasons are not clear since race is not used for prediction. Since ML techniques have become ubiquitous, especially in crucial decision-making involving humans, there is a considerable demand to explain the complex algorithm workability and help decision-makers gain confidence and trust in the algorithm [9]

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Nov 20, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Random Forest Similarity Maps: A Scalable Visual Representation for Global and Local Interpretation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Comparative analysis of machine learning algorithms for ECG-based heart attack prediction: A study using Bangladeshi patient data
Md Alif Sheakh ... Sakibul Hasan
World Journal of Advanced Research and Reviews | VOL. 23
Md Alif Sheakh, et. al. Md Alif Sheakh ... Sakibul Hasan
30 Sep 2024
World Journal of Advanced Research and Reviews | VOL. 23

Seeing the Forest for the Trees: Random Forest Models for Predicting Survival in Kidney Transplant Recipients.
Ruth Sapir-Pichhadze ... Bruce Kaplan
Transplantation | VOL. 104
Ruth Sapir-Pichhadze, et. al.Ruth Sapir-Pichhadze ... Bruce Kaplan
01 May 2020
Transplantation | VOL. 104

An Explainable Artificial Intelligence Framework for the Deterioration Risk Prediction of Hepatitis Patients.
Junfeng Peng ... Yi Teng
Journal of Medical Systems | VOL. 45
Junfeng Peng, et. al.Junfeng Peng ... Yi Teng
13 Apr 2021
Journal of Medical Systems | VOL. 45

An effective implementation and assessment of a random forest classifier as a soil spatial predictive model
Gaurav Shukla ... Pradeep Kumar Garg
International Journal of Remote Sensing | VOL. 39
Gaurav Shukla, et. al.Gaurav Shukla ... Pradeep Kumar Garg
25 Jan 2018
International Journal of Remote Sensing | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Random Forest Similarity Maps: A Scalable Visual Representation for Global and Local Interpretation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics