Infer the missing facts of D3FEND using knowledge graph representation learning

Anish Khobragade,Shashikant Ghumbre,Vinod Pachghare

doi:10.1108/ijwis-03-2023-0042

Abstract

PurposeMITRE and the National Security Agency cooperatively developed and maintained a D3FEND knowledge graph (KG). It provides concepts as an entity from the cybersecurity countermeasure domain, such as dynamic, emulated and file analysis. Those entities are linked by applying relationships such as analyze, may_contains and encrypt. A fundamental challenge for collaborative designers is to encode knowledge and efficiently interrelate the cyber-domain facts generated daily. However, the designers manually update the graph contents with new or missing facts to enrich the knowledge. This paper aims to propose an automated approach to predict the missing facts using the link prediction task, leveraging embedding as representation learning.Design/methodology/approachD3FEND is available in the resource description framework (RDF) format. In the preprocessing step, the facts in RDF format converted to subject–predicate–object triplet format contain 5,967 entities and 98 relationship types. Progressive distance-based, bilinear and convolutional embedding models are applied to learn the embeddings of entities and relations. This study presents a link prediction task to infer missing facts using learned embeddings.FindingsExperimental results show that the translational model performs well on high-rank results, whereas the bilinear model is superior in capturing the latent semantics of complex relationship types. However, the convolutional model outperforms 44% of the true facts and achieves a 3% improvement in results compared to other models.Research limitations/implicationsDespite the success of embedding models to enrich D3FEND using link prediction under the supervised learning setup, it has some limitations, such as not capturing diversity and hierarchies of relations. The average node degree of D3FEND KG is 16.85, with 12% of entities having a node degree less than 2, especially there are many entities or relations with few or no observed links. This results in sparsity and data imbalance, which affect the model performance even after increasing the embedding vector size. Moreover, KG embedding models consider existing entities and relations and may not incorporate external or contextual information such as textual descriptions, temporal dynamics or domain knowledge, which can enhance the link prediction performance.Practical implicationsLink prediction in the D3FEND KG can benefit cybersecurity countermeasure strategies in several ways, such as it can help to identify gaps or weaknesses in the existing defensive methods and suggest possible ways to improve or augment them; it can help to compare and contrast different defensive methods and understand their trade-offs and synergies; it can help to discover novel or emerging defensive methods by inferring new relations from existing data or external sources; and it can help to generate recommendations or guidance for selecting or deploying appropriate defensive methods based on the characteristics and objectives of the system or network.Originality/valueThe representation learning approach helps to reduce incompleteness using a link prediction that infers possible missing facts by using the existing entities and relations of D3FEND.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Infer the missing facts of D3FEND using knowledge graph representation learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Web Information Systems

Lead the way for us

Journal: International Journal of Web Information Systems	Publication Date: Aug 16, 2023
Citations: 2

Similar Papers

Fine-Grained Evaluation of Knowledge Graph Embedding Models in Downstream Tasks
Yuxin Zhang ... Bohan Li
-
Yuxin Zhang, et. al.Yuxin Zhang ... Bohan Li
01 Jan 2020
01 Jan 2020

Multiple Run Ensemble Learning with Low-Dimensional Knowledge Graph Embeddings
Chengjin Xu ... Jens Lehmann
-
Chengjin Xu, et. al.Chengjin Xu ... Jens Lehmann
18 Jul 2021
18 Jul 2021

Fine-Grained Evaluation of Knowledge Graph Embedding Model in Knowledge Enhancement Downstream Tasks
Yuxin Zhang ... Han Yang
Big Data Research | VOL. 25
Yuxin Zhang, et. al.Yuxin Zhang ... Han Yang
02 Mar 2021
Big Data Research | VOL. 25

Rule-based data augmentation for knowledge graph embedding
Guangyao Li ... Wei Hu
AI Open | VOL. 2
Guangyao Li, et. al.Guangyao Li ... Wei Hu
01 Jan 2020
AI Open | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Infer the missing facts of D3FEND using knowledge graph representation learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Web Information Systems