Technologies for Reliable AI Test and Evaluation

Lei Hamilton,Sanjeev Mohindra,Garrett Botkin,Olivia Brown,Michael Yee,Vincent Mancuso,Justin Goodwin

doi:10.1609/aaaiss.v2i1.27679

Technologies for Reliable AI Test and Evaluation

Lei Hamilton, Sanjeev Mohindra + Show 5 more

Open Access

https://doi.org/10.1609/aaaiss.v2i1.27679

Copy DOI

Journal: Proceedings of the AAAI Symposium Series

Publication Date: Jan 22, 2024

#Computer Vision Dataset #Artificial Intelligence + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Artificial intelligence (AI) is revolutionizing many industries, while at the same time facing challenges to safe and reliable use such as vulnerability to adversarial attacks and data drift. Although many AI test and evaluation (T&E) tools exist, integrating them is difficult. Under a program funded by the Chief Digital and AI Office (CDAO), we are developing a library to simplify the AI T&E process by providing user- and developer-friendly interfaces for composing T&E workflows. We illustrate the effectiveness of this approach with an example that compares clean and perturbed accuracy of two models on a computer vision dataset.

Full Text