Abstract

The China Internet Network Information Centre (CNNIC) published that internet users around the world mostly spent 10-16 hours per week online. For effective advertising and social information publishing on the internet, how to dig out the commercial value from users' online behaviour becomes a new challenge compared with the traditional recommendation system. In this paper, we propose a novel system named 'online commercial intention (OCI) detection system' using users' global web browsing history to predict potential purchasing products on an online shopping platform. A 'commercial keyword dictionary (KD)' that reveals the relationship between user queries and product categories is firstly set up by analysing the click distribution of billion queries on the shopping platform. Footprints of millions of internet users are gathered and the raw page contents are crawled. Keywords in these pages are extracted using N-gram algorithm and commercial probabilities are estimated with query frequency (QF), inverse category frequency (ICF), etc. The page OCI is estimated by merging the KD matrices of its commercial keywords. In order to increase categories' coherence and accuracy, we provide a category similarity model to observe the distance between top N categories. The experiment results show that category prediction accuracy reaches 86% with manual evaluation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call