Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection

Yufeng Wang,Shuangrong Liu,Jidong Duan,Kun Ma,Jia Yu,Songqian Li,Zhihao Hou

doi:10.3390/fi11070155

Abstract

Social network services for self-media, such as Weibo, Blog, and WeChat Public, constitute a powerful medium that allows users to publish posts every day. Due to insufficient information transparency, malicious marketing of the Internet from self-media posts imposes potential harm on society. Therefore, it is necessary to identify news with marketing intentions for life. We follow the idea of text classification to identify marketing intentions. Although there are some current methods to address intention detection, the challenge is how the feature extraction of text reflects semantic information and how to improve the time complexity and space complexity of the recognition model. To this end, this paper proposes a machine learning method to identify marketing intentions from large-scale We-Media data. First, the proposed Latent Semantic Analysis (LSI)-Word2vec model can reflect the semantic features. Second, the decision tree model is simplified by decision tree pruning to save computing resources and reduce the time complexity. Finally, this paper examines the effects of classifier associations and uses the optimal configuration to help people efficiently identify marketing intention. Finally, the detailed experimental evaluation on several metrics shows that our approaches are effective and efficient. The F1 value can be increased by about 5%, and the running time is increased by 20%, which prove that the newly-proposed method can effectively improve the accuracy of marketing news recognition.

Highlights

BackgroundNew media are forms of media that are native to computers and mobile phones for redistribution
This paper has proposed an efficient ensemble learning method to identify marketing intention, which is used to find the marketing news on the Internet and provide a reference value for identifying marketing news
Latent Semantic Analysis (LSI) stands for the text set as an m × n-dimensional matrix |X|, where m is the size of the dictionary, n is the number of texts, and the element (i, j) of the matrix is the rate of the ith word in the jth text

Summary

Background

New media are forms of media that are native to computers and mobile phones for redistribution. Some examples of new media are virtual worlds, social networking, Internet-oriented websites, and self-media platforms [1]. The algorithm needs to train the model according to the news content that has been marked with the correct category and use the model to classify the news of unknown categories automatically. Against this background, this paper has proposed an efficient ensemble learning method to identify marketing intention, which is used to find the marketing news on the Internet and provide a reference value for identifying marketing news

Challenges

Contributions

Organization

Feature Extraction

Text Classification

Approach

Preprocessing

LSI-Word2vec

Extraction Processing

Stacking

Experimental Setup

Experimental Parameters

Metrics

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Future Internet	Publication Date: Jul 10, 2019
Citations: 21	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet

Lead the way for us

Similar Papers

Performance Analysis of Air Pollution Classification Prediction Map with Decision Tree and ANN
Rizky Fauzi Ramadhani ... Yuliant Sibaroni
Journal of Computer System and Informatics (JoSYC) | VOL. 3
Rizky Fauzi Ramadhani, et. al.Rizky Fauzi Ramadhani ... Yuliant Sibaroni
05 Sep 2022
Journal of Computer System and Informatics (JoSYC) | VOL. 3

Modeling the organic matter of water using the decision tree coupled with bootstrap aggregated and least-squares boosting
Hichem Tahraoui ... Abdeltif Amrane
Environmental Technology & Innovation | VOL. 27
Hichem Tahraoui, et. al.Hichem Tahraoui ... Abdeltif Amrane
01 Aug 2022
Environmental Technology & Innovation | VOL. 27

Early Recognition of the Preference for Exclusive Breastfeeding in Current China: A Prediction Model based on Decision Trees
Yiting Wang ... Yingying Zhang
Scientific Reports | VOL. 10
Yiting Wang, et. al.Yiting Wang ... Yingying Zhang
21 Apr 2020
Scientific Reports | VOL. 10

Decision tree models as a classifier of endothelial function based on strength, pulmonary and cardiac function in COPD: Preliminary analysis
Nathany Souza Schafauser ... Bruna Shara Vidal De Oliveira
-
Nathany Souza Schafauser, et. al.Nathany Souza Schafauser ... Bruna Shara Vidal De Oliveira
07 Sep 2020
07 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet