REST: A Thread Embedding Approach for Identifying and Classifying User-Specified Information in Security Forums

Joobin Gharibshah,Michalis Faloutsos,Evangelos E Papalexakis

doi:10.1609/icwsm.v14i1.7293

Abstract

How can we extract useful information from a security forum? We focus on identifying threads of interest to a security professional: (a) alerts of worrisome events, such as attacks, (b) offering of malicious services and products, (c) hacking information to perform malicious acts, and (d) useful security-related experiences. The analysis of security forums is in its infancy despite several promising recent works. Novel approaches are needed to address the challenges in this domain: (a) the difficulty in specifying the “topics” of interest efficiently, and (b) the unstructured and informal nature of the text. We propose, REST, a systematic methodology to: (a) identify threads of interest based on a, possibly incomplete, bag of words, and (b) classify them into one of the four classes above. The key novelty of the work is a multi-step weighted embedding approach: we project words, threads and classes in appropriate embedding spaces and establish relevance and similarity there. We evaluate our method with real data from three security forums with a total of 164k posts and 21K threads. First, REST robustness to initial keyword selection can extend the user-provided keyword set and thus, it can recover from missing keywords. Second, REST categorizes the threads into the classes of interest with superior accuracy compared to five other methods: REST exhibits an accuracy between 63.3-76.9%. We see our approach as a first step for harnessing the wealth of information of online forums in a user-friendly way, since the user can loosely specify her keywords of interest.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

REST: A Thread Embedding Approach for Identifying and Classifying User-Specified Information in Security Forums

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media

Lead the way for us

Journal: Proceedings of the International AAAI Conference on Web and Social Media	Publication Date: May 26, 2020
Citations: 8

Similar Papers

An Empirical Study of Malicious Threads in Security Forums
Joobin Gharibshah ... Zhabiz Gharibshah
-
Joobin Gharibshah, et. al.Joobin Gharibshah ... Zhabiz Gharibshah
13 May 2019
13 May 2019

Extracting actionable information from Security Forums
Joobin Gharibshah ... Michalis Faloutsos
-
Joobin Gharibshah, et. al.Joobin Gharibshah ... Michalis Faloutsos
13 May 2019
13 May 2019

HGE: Embedding Temporal Knowledge Graphs in a Product Space of Heterogeneous Geometric Subspaces
Jiaxin Pan ... Steffen Staab
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Jiaxin Pan, et. al.Jiaxin Pan ... Steffen Staab
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

IKEA: Unsupervised domain-specific keyword-expansion
Joobin Gharibshah ... Jakapun Tachaiya
-
Joobin Gharibshah, et. al.Joobin Gharibshah ... Jakapun Tachaiya
10 Nov 2022
10 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

REST: A Thread Embedding Approach for Identifying and Classifying User-Specified Information in Security Forums

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media