Previous personalized hashtag recommendations have been able to recommend suitable hashtags for a given microblog. Despite their performance improvement, we argue that three challenges remain unexplored. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">First</i> , prior studies capture user interests solely from user-hashtag interactions that are directly connected (i.e., first-order relations), making them unable to deal with multiple user behaviors, including user-user social and hashtag-hashtag co-occurrence, and also restrict relations from similar users that are indirectly connected (i.e., high-order relations). <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Second</i> , previous works personalize content at the microblog level, ignoring the personalized aspects that users have for each word in the microblog. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Third</i> , past studies capture correlations among hashtags in the same microblog from only the left-side correlations, restricting the right-side correlations. In this paper, we propose a novel integral model for personalized hashtag recommendation named PAC-MAN, which explores high-order multiple relations to model fruitful user and hashtag representation before fusing with word representation for word-level personalization and integrating with sequenceless hashtag correlation for the recommendation. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">First</i> , to derive fruitful user and hashtag representation from higher-order multiple relations, Multi-relational Attentive Network (MAN) applies GNN to jointly capture relations on three communities: (1) user-hashtag interaction; (2) user-user social; and (3) hashtag-hashtag co-occurrence. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Second</i> , to personalize content at a word level, Person-And-Content based BERT (PAC) extends BERT to input not only word representations from the microblog but also the fruitful user representation from MAN, allowing each word to be fused with user aspects. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Finally</i> , to capture sequenceless hashtag correlations, the fruitful hashtag representations from MAN that contain the hashtag’s community perspectives are inserted into BERT to integrate with the hashtag’s word-semantic perspectives, and a hashtag prediction task is then conducted under the mask concept, enabling hashtag correlations to be obtained from both left and right sides without sequence constraints. Extensive experiments on the Twitter dataset demonstrate that PAC-MAN consistently outperforms state-of-the-art methods, including neural network based and traditional graph based methods, over precision, recall, and F1-score metrics.
Read full abstract