Multi-modal recommendation algorithm fusing visual and textual features.

Xuefeng Hu,Wenting Yu,Yun Wu,Yukang Chen

doi:10.1371/journal.pone.0287927

Abstract

In recommender systems, the lack of interaction data between users and items tends to lead to the problem of data sparsity and cold starts. Recently, the interest modeling frameworks incorporating multi-modal features are widely used in recommendation algorithms. These algorithms use image features and text features to extend the available information, which alleviate the data sparsity problem effectively, but they also have some limitations. On the one hand, multi-modal features of user interaction sequences are not considered in the interest modeling process. On the other hand, the aggregation of multi-modal features often employs simple aggregators, such as sums and concatenation, which do not distinguish the importance of different feature interactions. In this paper, to tackle this, we propose the FVTF (Fusing Visual and Textual Features) algorithm. First, we design a user history visual preference extraction module based on the Query-Key-Value attention to model users' historical interests by using of visual features. Second, we design a feature fusion and interaction module based on the multi-head bit-wise attention to adaptively mine important feature combinations and update the higher-order attention fusion representation of features. We conduct experiments on the Movielens-1M dataset, and the experiments show that FVTF achieved the best performance compared with the benchmark recommendation algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Jun 29, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-modal recommendation algorithm fusing visual and textual features.

Abstract

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Using ALBERT and Multi-modal Circulant Fusion for Fake News Detection
Xingang Wang ... Xiaoyu Liu
-
Xingang Wang, et. al.Xingang Wang ... Xiaoyu Liu
09 Oct 2022
09 Oct 2022

Multimodal Recommender Systems: A Survey
Qidong Liu ... Jingtong Gao
ACM Computing Surveys | VOL. -
Qidong Liu, et. al.Qidong Liu ... Jingtong Gao
10 Sep 2024
ACM Computing Surveys | VOL. -

Research commentary on recommendations with side information: A survey and research directions
Zhu Sun ... Guibing Guo
Electronic Commerce Research and Applications | VOL. 37
Zhu Sun, et. al.Zhu Sun ... Guibing Guo
03 Aug 2019
Electronic Commerce Research and Applications | VOL. 37

An Image-Text Sentiment Analysis Method Using Multi-Channel Multi-Modal Joint Learning
Lianting Gong ... Jianzhong Yang
Applied Artificial Intelligence | VOL. 38
Lianting Gong, et. al.Lianting Gong ... Jianzhong Yang
31 Dec 2025
Applied Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-modal recommendation algorithm fusing visual and textual features.

Abstract

Talk to us

Similar Papers

More From: PloS one