Efficient construction of approximate ad-hoc ML models through materialization and reuse

Sona Hasani,Gautam Das,Abolfazl Asudeh,Nick Koudas,Saravanan Thirumuruganathan

doi:10.14778/3236187.3269462

Abstract

Machine learning has become an essential toolkit for complex analytic processing. Data is typically stored in large data warehouses with multiple dimension hierarchies. Often, data used for building an ML model are aligned on OLAP hierarchies such as location or time. In this paper, we investigate the feasibility of efficiently constructing approximate ML models for new queries from previously constructed ML models by leveraging the concepts of model materialization and reuse . For example, is it possible to construct an approximate ML model for data from the year 2017 if one already has ML models for each of its quarters? We propose algorithms that can support a wide variety of ML models such as generalized linear models for classification along with K-Means and Gaussian Mixture models for clustering. We propose a cost based optimization framework that identifies appropriate ML models to combine at query time and conduct extensive experiments on real-world and synthetic datasets. Our results indicate that our framework can support analytic queries on ML models, with superior performance, achieving dramatic speedups of several orders in magnitude on very large datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient construction of approximate ad-hoc ML models through materialization and reuse

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Jul 1, 2018
Citations: 8

Similar Papers

Efficient construction of approximate ad-hoc ML models through materialization and reuse
Sona Hasani ... Abolfazl Asudeh
Proceedings of the VLDB Endowment | VOL. 11
Sona Hasani, et. al.Sona Hasani ... Abolfazl Asudeh
01 Jul 2018
Proceedings of the VLDB Endowment | VOL. 11

Efficient construction of approximate ad-hoc ML models through materialization and reuse
...
-
, et. al. ...
01 Jul 2018
01 Jul 2018

Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis
Xuan Song ... Chunting Wang
International Journal of Medical Informatics | VOL. 151
Xuan Song, et. al.Xuan Song ... Chunting Wang
08 May 2021
International Journal of Medical Informatics | VOL. 151

A Review of Computational Intelligence Models for Brain Tumour Classification and Prediction
Justice Kwame Appati ... Godfred Akwetey Brown
International Journal of Software Science and Computational Intelligence | VOL. 13
Justice Kwame Appati, et. al.Justice Kwame Appati ... Godfred Akwetey Brown
01 Oct 2021
International Journal of Software Science and Computational Intelligence | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient construction of approximate ad-hoc ML models through materialization and reuse

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment