Sequential One-step Estimator by Sub-sampling for Customer Churn Analysis with Massive Data sets

Feifei Wang,Shuyuan Wu,Tianchen Gao,Hansheng Wang,Danyang Huang

doi:10.1111/rssc.12597

Abstract

AbstractCustomer churn is one of the most important concerns for large companies. Currently, massive data are often encountered in customer churn analysis, which bring new challenges for model computation. To cope with these concerns, sub-sampling methods are often used to accomplish data analysis tasks of large scale. To cover more informative samples in one sampling round, classic sub-sampling methods need to compute sampling probabilities for all data points. However, this method creates a huge computational burden for data sets of large scale and therefore, is not applicable in practice. In this study, we propose a sequential one-step (SOS) estimation method based on repeated sub-sampling data sets. In the SOS method, data points need to be sampled only with probabilities, and the sampling step is conducted repeatedly. In each sampling step, a new estimate is computed via one-step updating based on the newly sampled data points. This leads to a sequence of estimates, of which the final SOS estimate is their average. We theoretically show that both the bias and the standard error of the SOS estimator can decrease with increasing sub-sampling sizes or sub-sampling times. The finite sample SOS performances are assessed through simulations. Finally, we apply this SOS method to analyse a real large-scale customer churn data set in a securities company. The results show that the SOS method has good interpretability and prediction power in this real application.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sequential One-step Estimator by Sub-sampling for Customer Churn Analysis with Massive Data sets

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society: Series C (Applied Statistics)

Lead the way for us

Journal: Journal of the Royal Statistical Society: Series C (Applied Statistics)	Publication Date: Sep 19, 2022
Citations: 1

Similar Papers

Analysis of the business customer churn based on decision tree method
Hongxia Ma ... Jianxia Wang
-
Hongxia Ma, et. al.Hongxia Ma ... Jianxia Wang
01 Aug 2009
01 Aug 2009

Analysis of Customer Churn in Telecommunication Industry with Machine Learning Methods
Özge Nalan Bi̇li̇şi̇k ... Damla Tuğba Sarp
Düzce Üniversitesi bilim ve teknoloji dergisi | VOL. 11
Özge Nalan Bi̇li̇şi̇k, et. al.Özge Nalan Bi̇li̇şi̇k ... Damla Tuğba Sarp
24 Oct 2023
Düzce Üniversitesi bilim ve teknoloji dergisi | VOL. 11

Using Machine Learning Algorithms to Analyze Customer Churn in the Software as a Service (SaaS) Industry
Levent ÇALLI ... Sena KASIM
Academic Platform Journal of Engineering and Smart Systems | VOL. 10
Levent ÇALLI, et. al.Levent ÇALLI ... Sena KASIM
30 Sep 2022
Academic Platform Journal of Engineering and Smart Systems | VOL. 10

Extracting Knowledge from Massive Astronomical Data Sets
Massimo Brescia ... Maurizio Paolillo
-
Massimo Brescia, et. al.Massimo Brescia ... Maurizio Paolillo
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequential One-step Estimator by Sub-sampling for Customer Churn Analysis with Massive Data sets

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society: Series C (Applied Statistics)