Interpretable Multiclass Models for Corporate Credit Rating Capable of Expressing Doubt

Lennart Obermann,Stephan Waack

doi:10.3389/fams.2016.00016

Lennart Obermann, Stephan Waack

Open Access

https://doi.org/10.3389/fams.2016.00016

Copy DOI

Abstract

Corporate credit rating is a process to classify commercial enterprises based on their creditworthiness. Machine learning algorithms can construct classification models, but in general they do not tend to be 100% accurate. Since they can be used as decision support for experts, interpretable models are desirable. Unfortunately, interpretable models are provided by only few machine learners. Furthermore, credit rating often is a multiclass problem with more than two rating classes. Due to this fact, multiclass classification is often achieved via meta-algorithms using multiple binary learners. However, most state-of-the-art meta-algorithms destroy the interpretability of binary models. In this study, we present Thresholder, a binary interpretable threshold-based disjunctive normal form (DNF) learning algorithm in addition to modifications of popular multiclass meta-algorithms which maintain the interpretability of our binary classifier. Furthermore, we present an approach to express doubt in the decision of our model. Performance and model size are compared with other interpretable approaches for learning DNFs (RIPPER) and decision trees (C4.5) as well as non-interpretable models like random forests, artificial neural networks, and support vector machines. We evaluate their performances on three real-life data sets divided into three rating classes. In this case study all threshold-based and interpretable models perform equally well and significantly better than other methods. Our new Thresholder algorithm builds the smallest models while its performance is as good as the best methods of our case study. Furthermore, Thresholder marks many potential misclassifications in advance with a doubt label without increasing the classification error.

Highlights

The evaluation of the economic situation of commercial enterprises is an important task because inaccurate predictions may lead to huge financial losses
We showed in a prior study [3] that interpretable models are not inferior to black-box models in insolvency prediction, by comparing Decision Tree (DT), disjunctive normal form (DNF), Random Forests (RFs), Artificial Neural Networks (ANNs), and Support Vector Machines (SVMs)
Those performed slightly better than non-interpretable models in a prior study on insolvency prediction [3]

Summary

Introduction

The evaluation of the economic situation of commercial enterprises is an important task because inaccurate predictions may lead to huge financial losses. Machine learning methods using annual accounts offer an automated and objective way to achieve high prediction rates for this task. These models cannot completely replace expensive experts. Our goal is to build objective models with a low prediction error as a helpful decision support for experts in credit rating. We focus on the interpretability of models. There are binary tasks such as insolvency prediction and multiclass tasks

Objectives

Methods

Results

Conclusion