A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

Soham Dasgupta,Anupam Joshi,Aritran Piplai,Anantaa Kotal

doi:10.1109/bigdata50022.2020.9378482

Abstract

Named Entity Recognition (NER) is important in the cybersecurity domain. It helps researchers extract cyber threat information from unstructured text sources. The extracted cyber-entities or key expressions can be used to model a cyber-attack described in an open-source text. A large number of general-purpose NER algorithms have been published that work well in text analysis. These algorithms do not perform well when applied to the cybersecurity domain. In the field of cybersecurity, the open-source text available varies greatly in complexity and under-lying structure of the sentences. General-purpose NER algorithms can misrepresent domain-specific words, such as malicious and javascript. In this paper, we compare the recent deep learning-based NER algorithms on a cybersecurity dataset. We created a cybersecurity dataset collected from various sources, including Microsoft Security Bulletin and Adobe Security Updates. Some of these approaches proposed in literature were not used for Cybersecurity. Others are innovations proposed by us. This comparative study helps us identify the NER algorithms that are robust and can work well in sentences taken from a large number of cybersecurity sources. We tabulate their performance on the test set and identify the best NER algorithm for a cybersecurity corpus. We also discuss the different embedding strategies that aid in the process of NER for the chosen deep learning algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The Advance of Deep Learning Based Named Entity Recognition
Wenxuan Li
Highlights in Science, Engineering and Technology | VOL. 12
Wenxuan LiWenxuan Li
26 Aug 2022
Highlights in Science, Engineering and Technology | VOL. 12

Named Entity Recognition in Clinical Domain
Maria C Martinis
Reference Module in Life Sciences | VOL. -
Maria C MartinisMaria C Martinis
01 Jan 2024
Reference Module in Life Sciences | VOL. -

Coner: A Collaborative Approach for Long-Tail Named Entity Recognition in Scientific Publications
Daniel Vliegenthart ... Sepideh Mesbah
-
Daniel Vliegenthart, et. al.Daniel Vliegenthart ... Sepideh Mesbah
01 Jan 2019
01 Jan 2019

Deep Learning based Named Entity Recognition for the Bodo Language
Sanjib Narzary ... Bidisha Som
Procedia Computer Science | VOL. 235
Sanjib Narzary, et. al.Sanjib Narzary ... Bidisha Som
01 Jan 2024
Procedia Computer Science | VOL. 235

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

Abstract

Talk to us

Similar Papers