Predicting CVSS Metric via Description Interpretation

Joana Cabral Costa,Hugo Proenca,Pedro R M Inacio,Joao B F Sequeiros,Tiago Roxo

doi:10.1109/access.2022.3179692

Abstract

Cybercrime affects companies worldwide, costing millions of dollars annually. The constant increase of threats and vulnerabilities raises the need to handle vulnerabilities in a prioritized manner. This prioritization can be achieved through Common Vulnerability Scoring System (CVSS), typically used to assign a score to a vulnerability. However, there is a temporal mismatch between the vulnerability finding and score assignment, which motivates the development of approaches to aid in this aspect. We explore the use of Natural Language Processing (NLP) models in CVSS score prediction given vulnerability descriptions. We start by creating a vulnerability dataset from the National Vulnerability Database (NVD). Then, we combine text pre-processing and vocabulary addition to improve the model accuracy and interpret its prediction reasoning by assessing word importance, via Shapley values. Experiments show that the combination of Lemmatization and 5,000-word addition is optimal for DistilBERT, the outperforming model in our experiments of the NLP methods, achieving state-of-the-art results. Furthermore, specific events (such as an attack on a known software) tend to influence model prediction, which may hinder CVSS prediction. Combining Lemmatization with vocabulary addition mitigates this effect, contributing to increased accuracy. Finally, binary classes benefit the most from pre-processing techniques, particularly when one class is much more prominent than the other. Our work demonstrates that DistilBERT is a state-of-the-art model for CVSS prediction, demonstrating the applicability of deep learning approaches to aid in vulnerability handling. The code and data are available at https://github.com/Joana-Cabral/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Predicting CVSS Metric via Description Interpretation

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Vulnerability severity scoring and bounties: why the disconnect?
Nuthan Munaiah ... Andrew Meneely
-
Nuthan Munaiah, et. al.Nuthan Munaiah ... Andrew Meneely
13 Nov 2016
13 Nov 2016

Categorization of Cybersecurity Vulnerabilities Utilizing Machine Learning Methods for Vulnerability Management

-

12 Mar 2021
12 Mar 2021

A New CVSS-Based Tool to Mitigate the Effects of Software Vulnerabilities
Assad Ali ... Ron Ruhl
International Journal for Information Security Research | VOL. 2
Assad Ali, et. al.Assad Ali ... Ron Ruhl
01 Sep 2012
International Journal for Information Security Research | VOL. 2

The common vulnerability scoring system (CVSS) and its applicability to federal agency systems
Peter Mell ... Karen Scarfone
-
Peter Mell, et. al.Peter Mell ... Karen Scarfone
01 Jan 2007
01 Jan 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predicting CVSS Metric via Description Interpretation

Abstract

Talk to us

Similar Papers

More From: IEEE Access