Compressed linear algebra for large-scale machine learning

Ahmed Elgohary,Berthold Reinwald,Matthias Boehm,Peter J Haas,Frederick R Reiss

doi:10.14778/2994509.2994515

Abstract

Large-scale machine learning (ML) algorithms are often iterative, using repeated read-only data access and I/O-bound matrix-vector multiplications to converge to an optimal model. It is crucial for performance to fit the data into single-node or distributed main memory. General-purpose, heavy- and lightweight compression techniques struggle to achieve both good compression ratios and fast decompression speed to enable block-wise uncompressed operations. Hence, we initiate work on compressed linear algebra (CLA), in which lightweight database compression techniques are applied to matrices and then linear algebra operations such as matrix-vector multiplication are executed directly on the compressed representations. We contribute effective column compression schemes, cache-conscious operations, and an efficient sampling-based compression algorithm. Our experiments show that CLA achieves in-memory operations performance close to the uncompressed case and good compression ratios that allow us to fit larger datasets into available memory. We thereby obtain significant end-to-end performance improvements up to 26x or reduced memory requirements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Compressed linear algebra for large-scale machine learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Aug 1, 2016
Citations: 70

Similar Papers

Scaling Machine Learning via Compressed Linear Algebra
Ahmed Elgohary ... Berthold Reinwald
ACM SIGMOD Record | VOL. 46
Ahmed Elgohary, et. al.Ahmed Elgohary ... Berthold Reinwald
12 May 2017
ACM SIGMOD Record | VOL. 46

Compressed linear algebra for declarative large-scale machine learning
Ahmed Elgohary ... Frederick R Reiss
Communications of the ACM | VOL. 62
Ahmed Elgohary, et. al.Ahmed Elgohary ... Frederick R Reiss
24 Apr 2019
Communications of the ACM | VOL. 62

Highly efficient secure linear algebra for private machine learning classifications over malicious clients in the post-quantum world
Artrim Kjamilji ... Osman Berke Güney
Journal of King Saud University - Computer and Information Sciences | VOL. 35
Artrim Kjamilji, et. al.Artrim Kjamilji ... Osman Berke Güney
25 Aug 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 35

A survey on machine learning in array databases
Sebastián Villarroya ... Peter Baumann
Applied Intelligence | VOL. 53
Sebastián Villarroya, et. al.Sebastián Villarroya ... Peter Baumann
12 Aug 2022
Applied Intelligence | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compressed linear algebra for large-scale machine learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment