Pre-trained Transformer-based Models Research Articles

Deep learning-based recommender systems have gained much attention due to the advantage of encoding content-based information, such as user textual reviews and item descriptions, images, or videos, without the trouble of manually crafting feature vectors. However, those systems are trained from scratch with randomly initialized parameters, where the training process can take a long time to converge. With the most recent breakthroughs in Natural Language Processing using transfer learning, pre-trained transformer-based models now provide a better foundation for textual information encoding. This inspires us to propose a transformer-based recommender system using transfer learning. As the first core contribution in this work, we apply transfer learning to the system, by fine-tuning the pre-trained transformer models for information encoding. The experiment result shows that the proposed system outperforms several other deep learning-based recommender systems on multiple datasets. As the second core contribution, we propose a novel user vector encoding algorithm that assists all the models to achieve a better performance, when the user content information is not available.

The increasing popularity of social media has made the creation and spread of rumors much easier. Widespread rumors on social media could cause devastating damages to society and individuals. Automatically detecting rumors in a timely manner is greatly needed but also very challenging technically. In this paper, we propose a new deep feature fusion method that employs the linguistic characteristics of the source tweet text and the underlying patterns of the propagation tree of the source tweet for Twitter rumor detection. Specifically, the pre-trained Transformer-based model is applied to extract context-sensitive linguistic features from the short source tweet text. A novel sequential encoding method is proposed to embed the propagation tree of a source tweet into the vector space. A convolutional neural network (CNN) architecture is then developed to extract temporal-structural features from the encoded propagation tree. The performance of the proposed deep feature fusion method is evaluated with two public Twitter rumor datasets. The results demonstrate that the proposed method achieves significantly better detection performance than other state-of-the-art baseline methods.

Pre-trained Transformer-based Models Research Articles

Articles published on Pre-trained Transformer-based Models

Performing forced alignment with Wav2vec 2.0

Making recommendations using transfer learning

Deep Feature Fusion for Rumor Detection on Twitter

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Pre-trained Transformer-based Models Research Articles

Articles published on Pre-trained Transformer-based Models

Performing forced alignment with Wav2vec 2.0

Making recommendations using transfer learning

Deep Feature Fusion for Rumor Detection on Twitter