Optimal Subsampling for Data Streams with Measurement Constrained Categorical Responses

Jun Yu,Zhiqiang Ye,Mingyao Ai,Ping Ma

doi:10.1080/10618600.2024.2421990

Abstract

High-velocity, large-scale data streams have become pervasive. Frequently, the associated labels for such data prove costly to measure and are not always available upfront. Consequently, the analysis of such data poses a significant challenge. In this article, we develop a method that addresses this challenge by employing an online subsampling procedure and a multinomial logistic model for efficient analysis of high-velocity, large-scale data streams. Our algorithm is designed to sequentially update parameter estimation based on the A-optimality criterion. Moreover, it significantly increases computational efficiency while imposing minimal storage requirements. Theoretical properties are rigorously established to quantify the asymptotic behavior of the estimator. The method’s efficacy is further demonstrated through comprehensive numerical studies on both simulated and real-world datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal Subsampling for Data Streams with Measurement Constrained Categorical Responses

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics

Lead the way for us

Similar Papers

Approximate Cross-Validated Mean Estimates for Bayesian Hierarchical Regression Models
Amy Zhang ... Le Bao
Journal of Computational and Graphical Statistics | VOL. ahead-of-print
Amy Zhang, et. al.Amy Zhang ... Le Bao
13 Nov 2024
Journal of Computational and Graphical Statistics | VOL. ahead-of-print

Efficient Large-Scale Nonstationary Spatial Covariance Function Estimation Using Convolutional Neural Networks
Pratik Nag ... Ying Sun
Journal of Computational and Graphical Statistics | VOL. ahead-of-print
Pratik Nag, et. al.Pratik Nag ... Ying Sun
11 Nov 2024
Journal of Computational and Graphical Statistics | VOL. ahead-of-print

High-Dimensional Block Diagonal Covariance Structure Detection Using Singular Vectors
Jan O Bauer
Journal of Computational and Graphical Statistics | VOL. just-accepted
Jan O BauerJan O Bauer
04 Nov 2024
Journal of Computational and Graphical Statistics | VOL. just-accepted

Optimal Subsampling for Data Streams with Measurement Constrained Categorical Responses
Jun Yu ... Ping Ma
Journal of Computational and Graphical Statistics | VOL. just-accepted
Jun Yu, et. al.Jun Yu ... Ping Ma
29 Oct 2024
Journal of Computational and Graphical Statistics | VOL. just-accepted

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal Subsampling for Data Streams with Measurement Constrained Categorical Responses

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics