Formalizing Multimedia Recommendation through Multimodal Deep Learning

Daniele Malitesta,Eugenio Di Sciascio,Giandomenico Cornacchia,Tommaso Di Noia,Claudio Pomo,Felice Antonio Merra

doi:10.1145/3662738

Daniele Malitesta, Eugenio Di Sciascio + Show 4 more

Open Access

https://doi.org/10.1145/3662738

Copy DOI

Abstract

Recommender systems (RSs) provide customers with a personalized navigation experience within the vast catalogs of products and services offered on popular online platforms. Despite the substantial success of traditional RSs, recommendation remains a highly challenging task, especially in specific scenarios and domains. For example, human affinity for items described through multimedia content (e.g., images, audio, and text), such as fashion products, movies, and music, is multi-faceted and primarily driven by their diverse characteristics. Therefore, by leveraging all available signals in such scenarios, multimodality enables us to tap into richer information sources and construct more refined user/item profiles for recommendations. Despite the growing number of multimodal techniques proposed for multimedia recommendation, the existing literature lacks a shared and universal schema for modeling and solving the recommendation problem through the lens of multimodality. Given the recent advances in multimodal deep learning for other tasks and scenarios where precise theoretical and applicative procedures exist, we also consider it imperative to formalize a general multimodal schema for multimedia recommendation. In this work, we first provide a comprehensive literature review of multimodal approaches for multimedia recommendation from the last eight years. Second, we outline the theoretical foundations of a multimodal pipeline for multimedia recommendation by identifying and formally organizing recurring solutions/patterns; at the same time, we demonstrate its rationale by conceptually applying it to selected state-of-the-art approaches in multimedia recommendation. Third, we conduct a benchmarking analysis of recent algorithms for multimedia recommendation within Elliot, a rigorous framework for evaluating recommender systems, where we re-implement such multimedia recommendation approaches. Finally, we highlight the significant unresolved challenges in multimodal deep learning for multimedia recommendation and suggest possible avenues for addressing them. The primary aim of this work is to provide guidelines for designing and implementing the next generation of multimodal approaches in multimedia recommendation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Recommender Systems	Publication Date: Apr 29, 2024
Citations: 2	License type: mit

R Discovery Prime

R Discovery Prime

Formalizing Multimedia Recommendation through Multimodal Deep Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Recommender Systems

Lead the way for us

Similar Papers

A survey of multimodal hybrid deep learning for computer vision: Architectures, applications, trends, and challenges
Khaled Bayoudh
Information Fusion | VOL. 105
Khaled BayoudhKhaled Bayoudh
30 Dec 2023
Information Fusion | VOL. 105

Multimedia Recommendation System for Video Game Based on High-Level Visual Semantic Features
Fasiha Ikram ... Humera Farooq
Scientific Programming | VOL. 2022
Fasiha Ikram, et. al.Fasiha Ikram ... Humera Farooq
03 Feb 2022
Scientific Programming | VOL. 2022

A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding.
Osval A Montesinos-López ... José Crossa
Genetics | VOL. -
Osval A Montesinos-López, et. al.Osval A Montesinos-López ... José Crossa
05 Nov 2024
Genetics | VOL. -

The multimedia recommendation algorithm based on probability graphical model
Chen Li ... Qian Zhao
Multimedia Tools and Applications | VOL. 81
Chen Li, et. al.Chen Li ... Qian Zhao
29 Oct 2020
Multimedia Tools and Applications | VOL. 81

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Formalizing Multimedia Recommendation through Multimodal Deep Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Recommender Systems