Content-based recommendation for podcast audio-items using natural language processing techniques

Zhou Xing,Marzieh Parandehgheibi,Fei Xiao,Nilesh Kulkarni,Chris Pouliot

doi:10.1109/bigdata.2016.7840872

Abstract

A podcast combines the liveliness of a FM radio channel with the economy of internet blog posting. They are especially convenient for scenarios when there is limited internet ability and connectivity for example in the car, the gym, etc. While both the volume and heterogeneity of content is huge it becomes operationally difficult to manually categorize or tag these audio items, thus manage them in a system for users to discover. Furthermore, due to the incompleteness of audio associated meta data there are not enough features for a typical recommender system to learn the item similarities thus make recommendations. In this paper we propose and examine a novel approach to generate latent embeddings for podcast items utilizing the aggregated information from all the text-based features associated with the audioitems. These embeddings that are generated using well established Natural Language Processing (NLP) techniques for the podcast items can be used to measure or indicate the content similarity among the various podcast items. Both GPU (CUDA) and CPU computing architectures are experimented and bench marked for the model training, cross-validation of the content predictions on large scale datasets.

Full Text