Natural Language Processing Applied to Forensics Information Extraction With Transformers and Graph Visualization

Fillipe Barros Rodrigues,William Ferreira Giozza,Robson De Oliveira Albuquerque,Luis Javier García Villalba

doi:10.1109/tcss.2022.3159677

Abstract

Digital forensics analysis is a slow process mainly due to the large amount and variety of data. Some forensic tools help categorize files by type and allow automatization of tasks, like named entity recognition (NER). NER is a key component in many natural language processing (NLP) applications, such as relation extraction (RE) and information retrieval. The introduction of neural networks and transformer architectures in the last few years made it possible to develop more accurate models in different languages. This work proposes a reproducible setup to build a forensic pipeline for information extraction using NLP of texts. Our results show that it is possible to develop both NER and RE models in any language and tune its hyper-parameters to achieve state-of-art performance and build comprehensive knowledge graphs, decreasing the amount of time required for human supervision and review. We also find that solving this task in phases can further improve the performance, not only for digital investigation applications, but also for general-purpose information extraction and analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Natural Language Processing Applied to Forensics Information Extraction With Transformers and Graph Visualization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems

Lead the way for us

Journal: IEEE Transactions on Computational Social Systems	Publication Date: Jan 1, 2024
Citations: 6

Similar Papers

BiodiViz: Leveraging NER and RE for Automated Knowledge Graph Generation in Biodiversity Research
Angela Shannen Tan ... Roselyn Gabud
Biodiversity Information Science and Standards | VOL. 8
Angela Shannen Tan, et. al.Angela Shannen Tan ... Roselyn Gabud
29 Oct 2024
Biodiversity Information Science and Standards | VOL. 8

A Joint Learning Model to Extract Entities and Relations for Chinese Literature Based on Self-Attention
Li-Xin Liang ... Lin Lin
Mathematics | VOL. 10
Li-Xin Liang, et. al.Li-Xin Liang ... Lin Lin
24 Jun 2022
Mathematics | VOL. 10

Integrating deep learning architectures for enhanced biomedical relation extraction: a pipeline approach.
M Janina Sarol ... Halil Kilicoglu
Database : the journal of biological databases and curation | VOL. 2024
M Janina Sarol, et. al.M Janina Sarol ... Halil Kilicoglu
28 Aug 2024
Database : the journal of biological databases and curation | VOL. 2024

Development and Validation of a Model to Identify Critical Brain Injuries Using Natural Language Processing of Text Computed Tomography Reports
...
JAMA network open | VOL. 5
, et. al. ...
16 Aug 2022
JAMA network open | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Natural Language Processing Applied to Forensics Information Extraction With Transformers and Graph Visualization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems