Securing Machine Learning: Understanding Adversarial Attacks and Bias Mitigation

Archit Lakhani,Neyah Rohit

doi:10.38124/ijisrt/ijisrt24jun1671

Abstract

This paper offers a comprehensive examination of adversarial vulnerabilities in machine learning (ML) models and strategies for mitigating fairness and bias issues. It analyses various adversarial attack vectors encompassing evasion, poisoning, model inversion, exploratory probes, and model stealing, elucidating their potential to compromise model integrity and induce misclassification or information leakage. In response, a range of defence mechanisms including adversarial training, certified defences, feature transformations, and ensemble methods are scrutinized, assessing their effectiveness and limitations in fortifying ML models against adversarial threats. Furthermore, the study explores the nuanced landscape of fairness and bias in ML, addressing societal biases, stereotypes reinforcement, and unfair treatment, proposing mitigation strategies like fairness metrics, bias auditing, de-biasing techniques, and human-in-the-loop approaches to foster fairness, transparency, and ethical AI deployment. This synthesis advocates for interdisciplinary collaboration to build resilient, fair, and trustworthy AI systems amidst the evolving technological paradigm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Innovative Science and Research Technology (IJISRT)	Publication Date: Jul 11, 2024
Citations: 1	License type: cc-by-nc

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Securing Machine Learning: Understanding Adversarial Attacks and Bias Mitigation

Abstract

Talk to us