Abstract

Conventionally, the Internet consists of three parts: Surface, Deep, and Dark Webs. Inthe last two decades, a massive increase in illicit activities took place on the differentplatforms of the Dark Web. Moreover, social networks on Dark Web implicateextremism dissemination on a wide scale. In this paper, we propose an approach togenerate textual patterns from discussions on Dark Web terrorist forums employingData Mining techniques. The discovered patterns help identify the influential membersand extract critical topics. We describe our system modules that perform datapreprocessing, text preprocessing with TF-IDF weighting, outlier detection, clusteringevaluation, clustering, and clustering validation, implemented with the RapidMinertool. We apply K-Means as the Clustering method with different distance metrics,evaluate the clustering process using Elbow and Silhouette methods, and validate itusing Davies-Bouldin Index. Furthermore, we investigate the effects of altering thedistance metrics for outlier detection on the Clustering results.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.