Abstract

The system for filtering spam posts on social media is preferred to obtain the relevant content and expected by users. The previous works on spam detection have been done to filter irrelevant content on email and social media based on text or image separately. Due to the social media posts are commonly in the form of image, text, or both, the multimodal data is preferred to improve the capability of system in handling filtering content on social media. In addition, a spam post containing multimodal data sometimes does not indicate spam in both data but only one. To improve the performance of system, we propose a weighted multimodal approach for filtering content from spam posts in social media using Convolutional Neural Network (CNN). The mechanism of weighted multimodal is by weighting of spam prediction results from image and text data. We also investigate the performance of CNN architectures for spam post detection that are 3-layer, 5-layer, AlexNet and VGG16. The performance of each architectures is evaluated by 8000 Indonesian posts in the form of image and text taken from Instagram posts. The results show that the highest accuracy achieves 0.9850 based on the combination of image and text by using a 5-layer architecture. The average accuracy of all CNN architectures using multimodal data is higher than only using image and text data separately.

Highlights

  • Spam is the use of electronic devices to transmit nonrelevant messages or information to a wide number of recipients

  • A multimodal data is expected to improve the accuracy of spam detection

  • The highest accuracy of image data is obtained by VGG16 architecture and the accuracy is 0.8475

Read more

Summary

Introduction

Spam is the use of electronic devices to transmit nonrelevant messages or information to a wide number of recipients. Social media is one of technology that is currently widely used by the community as a way of exchanging information and moments in the form of message, image and videos. This social media capability allowed the irrelevant content, such as ads, to be distributed. That causes a lot of irrelevant information that is not expected by social media users. An automatic application for detecting spam in the social media to obtain the information that is useful and expected by users is preferred. Spam on social media may be in the form of comments or posts which the receiver or user does not expected

Objectives
Methods
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.