Fast topic discovery from web search streams

Di Jiang,Wilfred Ng,Kenneth Wai-Ting Leung

doi:10.1145/2566486.2567965

Abstract

Web search involves voluminous data streams that record millions of users' interactions with the search engine. Recently latent topics in web search data have been found to be critical for a wide range of search engine applications such as search personalization and search history warehousing. However, the existing methods usually discover latent topics from web search data in an offline and retrospective fashion. Hence, they are increasingly ineffective in the face of the ever-increasing web search data that accumulate in the format of online streams. In this paper, we propose a novel probabilistic topic model, the Web Search Stream Model (WSSM), which is delicately calibrated for handling two salient features of the web search data: it is in the format of streams and in massive volume. We further propose an efficient parameter inference method, the Stream Parameter Inference (SPI) to efficiently train WSSM with massive web search streams. Based on a large-scale search engine query log, we conduct extensive experiments to verify the effectiveness and efficiency of WSSM and SPI. We observe that WSSM together with SPI discovers latent topics from web search streams faster than the state-of-the-art methods while retaining a comparable topic modeling accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast topic discovery from web search streams

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Mining web search topics with diverse spatiotemporal patterns
Di Jiang ... Wilfred Ng
-
Di Jiang, et. al.Di Jiang ... Wilfred Ng
28 Jul 2013
28 Jul 2013

Cross-Lingual Topic Discovery From Multilingual Search Engine Query Log
Di Jiang ... Yongxin Tong
ACM Transactions on Information Systems | VOL. 35
Di Jiang, et. al.Di Jiang ... Yongxin Tong
21 Sep 2016
ACM Transactions on Information Systems | VOL. 35

Predicting Antimicrobial Drug Consumption using Web Search Data
Niels Dalum Hansen ... Ingemar J Cox
-
Niels Dalum Hansen, et. al.Niels Dalum Hansen ... Ingemar J Cox
23 Apr 2018
23 Apr 2018

Multiwave COVID-19 Prediction from Social Awareness Using Web Search and Mobility Data
Jiawei Xue ... Jianzhu Ma
-
Jiawei Xue, et. al.Jiawei Xue ... Jianzhu Ma
14 Aug 2022
14 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast topic discovery from web search streams

Abstract

Talk to us

Similar Papers