High-stakes Applications Research Articles

Deep neural networks (DNNs) have achieved unprecedented success across many scientific and engineering fields in the last decades. Despite its empirical success, unfortunately, recent studies have shown that there are various failure modes and blindspots in DNN models which may result in unexpected serious failures and potential harms, e.g. the existence of adversarial examples and small perturbations. This is not acceptable especially for safety critical and high stakes applications in the real-world, including healthcare, self-driving cars, aircraft control systems, hiring and malware detection protocols. Moreover, it has been challenging to understand why and when DNNs will fail due to their complicated structures and black-box behaviors. Lacking interpretability is one critical issue that may seriously hinder the deployment of DNNs in high-stake applications, which need interpretability to trust the prediction, to understand potential failures, and to be able to mitigate harms and eliminate biases in the model. To make DNNs trustworthy and reliable for deployment, it is necessary and urgent to develop methods and tools that can (i) quantify and improve their robustness against adversarial and natural perturbations, and (ii) understand their underlying behaviors and further correct errors to prevent injuries and damages. These are the important first steps to enable Trustworthy AI and Trustworthy Machine Learning. In this talk, I will survey a series of research efforts in my lab contributed to tackling the grand challenges in (i) and (ii). In the first part of my talk, I will overview our research effort in Robust Machine Learning since 2017, where we have proposed the first attack-agnostic robustness evaluation metric, the first efficient robustness certification algorithms for various types of perturbations, and efficient robust learning algorithms across supervised learning to deep reinforcement learning. In the second part of my talk, I will survey a series of exciting results in my lab on accelerating interpretable machine learning and explainable AI. Specifically, I will show how we could bring interpretability into deep learning by leveraging recent advances in multi-modal models. I'll present recent works in our group on automatically dissecting neural networks with open vocabulary concepts, designing interpretable neural networks without concept labels, and briefly overview our recent efforts on demystifying black-box DNN training process, automated neuron explanations for Large Language Models and the first robustness evaluation of a family of neuron-level interpretation techniques.

Deep-learning based Automatic Essay Scoring (AES) systems are being actively used in various high-stake applications in education and testing. However, little research has been put to understand and interpret the black-box nature of deep-learning-based scoring algorithms. While previous studies indicate that scoring models can be easily fooled, in this paper, we explore the reason behind their surprising adversarial brittleness. We utilize recent advances in interpretability to find the extent to which features such as coherence, content, vocabulary, and relevance are important for automated scoring mechanisms. We use this to investigate the oversensitivity (i.e., large change in output score with a little change in input essay content) and overstability (i.e., little change in output scores with large changes in input essay content) of AES. Our results indicate that autoscoring models, despite getting trained as “end-to-end” models with rich contextual embeddings such as BERT, behave like bag-of-words models. A few words determine the essay score without the requirement of any context making the model largely overstable. This is in stark contrast to recent probing studies on pre-trained representation learning models, which show that rich linguistic features such as parts-of-speech and morphology are encoded by them. Further, we also find that the models have learnt dataset biases, making them oversensitive. The presence of a few words with high co-occurrence with a certain score class makes the model associate the essay sample with that score. This causes score changes in ∼95% of samples with an addition of only a few words. To deal with these issues, we propose detection-based protection models that can detect oversensitivity and samples causing overstability with high accuracies. We find that our proposed models are able to detect unusual attribution patterns and flag adversarial samples successfully.

High-stakes Applications Research Articles

Articles published on High-stakes Applications

The Evaluation of Machine Learning Techniques for Isotope Identification Contextualized by Training and Testing Spectral Similarity

Towards Trustworthy Deep Learning

Conservative Policy Construction Using Variational Autoencoders for Logged Data With Missing Values.

Open your black box classifier.

RESCU-SQL: Oblivious Querying for the Zero Trust Cloud

Interpreting convolutional neural network classifiers applied to laser-induced breakdown optical emission spectra

Optimal Sparse Regression Trees.

Falcon: A Privacy-Preserving and Interpretable Vertical Federated Learning System

A Review of Partial Information Decomposition in Algorithmic Fairness and Explainability.

Automatic Essay Scoring Systems Are Both Overstable And Oversensitive: Explaining Why And Proposing Defenses

AI Accountability: Approaches, Affecting Factors, and Challenges

In-Processing Modeling Techniques for Machine Learning Fairness: A Survey

Fair classification via domain adaptation: A dual adversarial learning approach.

Why did AI get this one wrong? — Tree-based explanations of machine learning model predictions

Combining physics-based and data-driven techniques for reliable hybrid analysis and modeling using the corrective source term approach

Bayesian autoencoders with uncertainty quantification: Towards trustworthy anomaly detection

Role of Human-AI Interaction in Selective Prediction

Identifying interactions in omics data for clinical biomarker discovery using symbolic regression.

Fair active learning

Deciding Fast and Slow: The Role of Cognitive Biases in AI-assisted Decision-making

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

High-stakes Applications Research Articles

Articles published on High-stakes Applications

The Evaluation of Machine Learning Techniques for Isotope Identification Contextualized by Training and Testing Spectral Similarity

Towards Trustworthy Deep Learning

Conservative Policy Construction Using Variational Autoencoders for Logged Data With Missing Values.

Open your black box classifier.

RESCU-SQL: Oblivious Querying for the Zero Trust Cloud

Interpreting convolutional neural network classifiers applied to laser-induced breakdown optical emission spectra

Optimal Sparse Regression Trees.

Falcon: A Privacy-Preserving and Interpretable Vertical Federated Learning System

A Review of Partial Information Decomposition in Algorithmic Fairness and Explainability.

Automatic Essay Scoring Systems Are Both Overstable And Oversensitive: Explaining Why And Proposing Defenses

AI Accountability: Approaches, Affecting Factors, and Challenges

In-Processing Modeling Techniques for Machine Learning Fairness: A Survey

Fair classification via domain adaptation: A dual adversarial learning approach.

Why did AI get this one wrong? — Tree-based explanations of machine learning model predictions

Combining physics-based and data-driven techniques for reliable hybrid analysis and modeling using the corrective source term approach

Bayesian autoencoders with uncertainty quantification: Towards trustworthy anomaly detection

Role of Human-AI Interaction in Selective Prediction

Identifying interactions in omics data for clinical biomarker discovery using symbolic regression.

Fair active learning

Deciding Fast and Slow: The Role of Cognitive Biases in AI-assisted Decision-making