Distributed numerical and machine learning computations via two-phase execution of aggregated join trees

Dimitrije Jankov,Chris Jermaine,Shangyu Luo,Binhang Yuan

doi:10.14778/3450980.3450991

Distributed numerical and machine learning computations via two-phase execution of aggregated join trees

Dimitrije Jankov, Chris Jermaine + Show 2 more

https://doi.org/10.14778/3450980.3450991

Copy DOI

Journal: Proceedings of the VLDB Endowment	Publication Date: Mar 1, 2021
Citations: 6

Affiliation: Rice University

#Lineage Information #Numerical Computations + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

When numerical and machine learning (ML) computations are expressed relationally, classical query execution strategies (hash-based joins and aggregations) can do a poor job distributing the computation. In this paper, we propose a two-phase execution strategy for numerical computations that are expressed relationally, as aggregated join trees (that is, expressed as a series of relational joins followed by an aggregation). In a pilot run, lineage information is collected; this lineage is used to optimally plan the computation at the level of individual records. Then, the computation is actually executed. We show experimentally that a relational system making use of this two-phase strategy can be an excellent platform for distributed ML computations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Proceedings of the VLDB Endowment

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.