Inconsistent measurement and incorrect detection of software names in security vulnerability reports

Hongyu Sun,Guoliang Ou,Ziqiu Zheng,Lei Liao,He Wang,Yuqing Zhang

doi:10.1016/j.cose.2023.103477

Abstract

As the number of vulnerability databases established by various nations continues to grow, they have accumulated hundreds of thousands of security vulnerability reports, which play a crucial role in protecting system security. However, many databases are found to lack essential information, contain inaccuracies, or are inconsistent with others. Despite these challenges, the importance of vulnerability databases continues to grow. Current research on vulnerability databases is limited to software version and vulnerability reproduction, but the software names, an essential component of vulnerability databases, have not been extensively studied. Understanding the consistency of software names in different vulnerability databases is crucial for improving the accuracy of vulnerability databases.The paper introduces VERNIER, an automated method for measuring inconsistencies in 789,954 sets of software names from nine security vulnerability databases (including CVE and NVD) from 1999 to 2019. We utilized a named entity recognition (NER) model with exceptional accuracy (99.5%) and F1 score (95.1%) to extract software names from unstructured Chinese and English vulnerability reports. VERNIER assesses software names' inconsistency at character and semantic levels. The results indicate that inconsistent software names are prevalent in vulnerability databases. The average of the exact matching rate between NVD and other mainstream databases, such as CVE, is only 20.3% at the character-level and 43.3% at the semantic-level. We also discover internal inconsistencies between the structured and unstructured software names inside the same vulnerability database (e.g., NVD). To mitigate the inconsistency, we implement an alert tool using inconsistencies to detect incorrect software names. This tool can effectively warn and correct software names.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Inconsistent measurement and incorrect detection of software names in security vulnerability reports

Abstract

Talk to us

Similar Papers

More From: Computers & Security

Lead the way for us

Journal: Computers & Security	Publication Date: Sep 14, 2023
Citations: 1

Similar Papers

Detecting Inconsistent Vulnerable Software Version in Security Vulnerability Reports
Hansong Ren ... Liao Lei
-
Hansong Ren, et. al.Hansong Ren ... Liao Lei
01 Jan 2021
01 Jan 2021

BiodiViz: Leveraging NER and RE for Automated Knowledge Graph Generation in Biodiversity Research
Angela Shannen Tan ... Roselyn Gabud
Biodiversity Information Science and Standards | VOL. 8
Angela Shannen Tan, et. al.Angela Shannen Tan ... Roselyn Gabud
29 Oct 2024
Biodiversity Information Science and Standards | VOL. 8

IFVD: Design of Intelligent Fusion Framework for Vulnerability Data Based on Text Measures
Ruike Li ... Xudong Cao
-
Ruike Li, et. al.Ruike Li ... Xudong Cao
01 Aug 2020
01 Aug 2020

Analysis and Research on Security Vulnerability Database
Jing Fang ... Yingbo Li
-
Jing Fang, et. al.Jing Fang ... Yingbo Li
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inconsistent measurement and incorrect detection of software names in security vulnerability reports

Abstract

Talk to us

Similar Papers

More From: Computers &amp; Security

More From: Computers & Security