Online news portals are one of the main sources of information currently most accessed by the public. With the number of Internet users increasing day by day, online news portal business entrepreneurs must find ways to remain able to compete to attract netizens’ attention to open their news portals. One of the techniques used is to implement clickbait to increase visitor traffic or pageviews. The characteristic of clickbait headlines is that they hide the facts or actual news content in the title section. Clickbait’s goal is to get curious Internet users to click on the link with the headline. This research aims to detect Indonesian-language clickbait headlines automatically by utilizing Indonesian language news headline data obtained from several online news portals with the highest amount of traffic on the Internet. This research uses the Recurrent Neural Network method based on Long Short-Term Memory. The model is built using word embedding to represent text data into vector data. The result of this research is a model with a testing accuracy of 83% and a system that is able to classify headlines whether they are clickbait or not.
Read full abstract