A comprehensive analysis on software vulnerability detection datasets: trends, challenges, and road ahead

Yuejun Guo,Seifeddine Bettaieb,Fran Casino

doi:10.1007/s10207-024-00888-y

Abstract

As society’s dependence on information and communication systems (ICTs) grows, so does the necessity of guaranteeing the proper functioning and use of such systems. In this context, it is critical to enhance the security and robustness of the DevSecOps pipeline through timely vulnerability detection. Usually, AI-based models enable desirable features such as automation, performance, and efficacy. However, the quality of such models highly depends on the datasets used during the training stage. The latter encompasses a series of challenges yet to be solved, such as access to extensive labelled datasets with specific properties, such as well-represented and balanced samples. This article explores the current state of practice of software vulnerability datasets and provides a classification of the main challenges and issues. After an extensive analysis, it describes a set of guidelines and desirable features that datasets should guarantee. The latter is applied to create a new dataset, which fulfils these properties, along with a descriptive comparison with the state of the art. Finally, a discussion on how to foster good practices among researchers and practitioners sets the ground for further research and continued improvement within this critical domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comprehensive analysis on software vulnerability detection datasets: trends, challenges, and road ahead

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Security

Lead the way for us

Journal: International Journal of Information Security	Publication Date: Jul 23, 2024
License type: CC BY 4.0

Similar Papers

Methods and means of synthesis and generation of signals – physical carriers of data in modern information and communication systems
І.Д Горбенко ... Є.А Семенко
Radiotekhnika | VOL. 3
І.Д Горбенко, et. al.І.Д Горбенко ... Є.А Семенко
16 Sep 2020
Radiotekhnika | VOL. 3

ICTS and e-Governance in Africa
Noluthando Mncwango
Digital Policy Studies | VOL. 3
Noluthando MncwangoNoluthando Mncwango
05 Aug 2024
Digital Policy Studies | VOL. 3

ДОСЛІДЖЕННЯ КОНТЕКСТНО-ЧУТЛИВОГО АЛГОРИТМУ МОНІТОРИНГУ КІБЕРБЕЗПЕКИ НА ОСНОВІ РЕКУРЕНТНИХ НЕЙРОННИХ МЕРЕЖ
M Klymash ... Yu Pyrih
Information and communication technologies, electronic engineering | VOL. 4
M Klymash, et. al.M Klymash ... Yu Pyrih
12 May 2024
Information and communication technologies, electronic engineering | VOL. 4

Features of the Development of Transceivers for Information and Communication Systems Considering the Distribution of Radar Operating Frequencies in the Frequency Range
Alexey S Podstrigaev ... Nikita S Myazin
-
Alexey S Podstrigaev, et. al.Alexey S Podstrigaev ... Nikita S Myazin
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comprehensive analysis on software vulnerability detection datasets: trends, challenges, and road ahead

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Security