This work is inspired by high-definition (HD) image generation techniques. When the user's interests are viewed as different frames of varying clarity, the unclear parts of one interest frame can be clarified by other interest frames. The user's overall HD interest portrait can be viewed as a fusion of multiple interest frames through detail compensation. Based on this inspiration, we propose a model for generating HD interest portrait called interest frame for recommendation (IF4Rec). First, we present a fine-grained pixel-level user interest mining method, Pixel embedding (PE) uses positional coding techniques to mine atomic-level interest pixel matrices in multiple dimensions, such as time, space, and frequency. Then, using an atomic-level interest pixel matrix, we propose Item2Frame to generate several interest frames for a user. The similarity score of each item is calculated to fill the multi-interest pixel clusters, through an improved self-attention mechanism. Finally, stimulated by HD image generation techniques, we initially present an interest frame noise compensation method. By utilizing the multihead attention mechanism, pixel-level optimization and noise complementation are performed between multi-interest frames, and an HD interest portrait is achieved. Experiments show that our model mines users' interests well. On five publicly available datasets, our model outperforms the baselines.
Read full abstract