“Quasi-Metacognitive Machines: Why We Don’t Need Morally Trustworthy AI and Communicating Reliability is Enough”

John Dorsch,Ophelia Deroy

doi:10.1007/s13347-024-00752-w

John Dorsch, Ophelia Deroy

Open Access

PDF Available

https://doi.org/10.1007/s13347-024-00752-w

Copy DOI

Export

Save

Cite

Journal: Philosophy & Technology	Publication Date: May 3, 2024
Citations: 1	License type: CC BY 4.0

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Many policies and ethical guidelines recommend developing “trustworthy AI”. We argue that developing morally trustworthy AI is not only unethical, as it promotes trust in an entity that cannot be trustworthy, but it is also unnecessary for optimal calibration. Instead, we show that reliability, exclusive of moral trust, entails the appropriate normative constraints that enable optimal calibration and mitigate the vulnerability that arises in high-stakes hybrid decision-making environments, without also demanding, as moral trust would, the anthropomorphization of AI and thus epistemically dubious behavior. The normative demands of reliability for inter-agential action are argued to be met by an analogue to procedural metacognitive competence (i.e., the ability to evaluate the quality of one’s own informational states to regulate subsequent action). Drawing on recent empirical findings that suggest providing reliability scores (e.g., F1-scores) to human decision-makers improves calibration in the AI system, we argue that reliability scores provide a good index of competence and enable humans to determine how much they wish to rely on the system.

Full Text