Abstract

Predicting the popularity of web content is widely regarded as an important but challenging task. Online news articles are typical examples of this. In particular, owing to their time-sensitive nature, it is preferable to predict the popularity of news articles before their publication. To achieve this, this study proposes a named entity topic model (NETM) to extract the textual factors that can drive popularity growth. Here, each named entity is assumed to have a popularity-gain distribution over all semantic topics. The popularity of a news article is considered as the accumulation of popularity gains generated by its named entities (NEs) over all the topics. By learning the popularity-gain matrix for each named entity, the popularity of any news article can be predicted. Experiments on two collections of news articles demonstrate that the proposed NETM can outperform existing models in terms of accuracy. Additionally, the popularity-gain matrix learned by the NETM can be used to effectively explain the popularity of specific news articles.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.