Identifying malicious social media contents using multi-view Context-Aware active learning

Sreyasee Das Bhattacharjee,William J Tolone,Ved Suhas Paranjape

doi:10.1016/j.future.2019.03.015

Abstract

Abstract This paper presents a semi-supervised, multi-view, active learning method, which uses an optimized set of most informative samples and utilizes domain specific context information to efficiently and effectively identify malicious forum content in web-based social media platforms. As research shows, the task of automated identification of malicious forum posts, which also helps in detecting their associated key suspects in web forums, faces numerous challenges: (1) Online data, particularly social media data originate from diverse and heterogeneous sources and are largely unstructured; (2) Online data characteristics evolve quickly; and, (3) There are limited amounts of ground truth data to support the development of effective classification technologies in a strictly supervised scenario. In order to address the above challenges, the proposed human–machine collaborative, semi-supervised learning method is designed to efficiently and effectively identify harmful, provocative, or fabricated forum content by observing only a small number of annotated samples. Our learning framework is initiated by modeling initial view-dependent classifiers from a limited labeled data collection and allows each, in an interactive manner, to evolve dynamically into a sophisticated model by observing data patterns from a shared shortlist of most informative samples, identified via a graph-based optimization method and solved by a maximum flow algorithm. By designing a context rich metric definition in a data-driven manner, the proposed framework is able to learn a sufficiently robust classification model, that utilizes only a small number of human annotated samples, typically 1–2 orders of magnitude fewer as compared to a fully supervised solution. We validate our method using a large collection of flagged words with a wide range of origins, words frequently appearing in web-based forums and manually verified by multiple experienced, independent domain experts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identifying malicious social media contents using multi-view Context-Aware active learning

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems

Lead the way for us

Journal: Future Generation Computer Systems	Publication Date: May 11, 2019
Citations: 23

Similar Papers

Identifying extremism in social media with multi-view context-aware subset optimization
Sreyasee Das Bhattacharjee ... Bala Venkatram Balantrapu
-
Sreyasee Das Bhattacharjee, et. al.Sreyasee Das Bhattacharjee ... Bala Venkatram Balantrapu
01 Dec 2017
01 Dec 2017

WLAN monopole antenna design by Siamese convolutional neural network and KNN exploiting Gaussian process
Yubo Tian ... J Zhu
MATEC Web of Conferences | VOL. 395
Yubo Tian, et. al.Yubo Tian ... J Zhu
01 Jan 2024
MATEC Web of Conferences | VOL. 395

Construction of image processing procedures from a small number of learning samples using the IMPRESS vision expert system
Toshihiro Hamada ... Jun‐Ichi Hasegawa
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87
Toshihiro Hamada, et. al.Toshihiro Hamada ... Jun‐Ichi Hasegawa
07 Oct 2004
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87

A speaker‐adaptation technique for context‐dependent models represented by hidden markov networks
Jun-Ichi Takami ... Shigeki Sagayama
Systems and Computers in Japan | VOL. 27
Jun-Ichi Takami, et. al.Jun-Ichi Takami ... Shigeki Sagayama
01 Jan 1996
Systems and Computers in Japan | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifying malicious social media contents using multi-view Context-Aware active learning

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems