On the performance of learned data structures

Paolo Ferragina,Fabrizio Lillo,Giorgio Vinciguerra

doi:10.1016/j.tcs.2021.04.015

Paolo Ferragina, Fabrizio Lillo + Show 1 more

Open Access

https://doi.org/10.1016/j.tcs.2021.04.015

Copy DOI

Abstract

A recent trend in algorithm design consists of augmenting classic data structures with machine learning models, which are better suited to reveal and exploit patterns and trends in the input data so to achieve outstanding practical improvements in space occupancy and time efficiency. This is especially known in the context of indexing data structures for big data where, despite few attempts in evaluating their asymptotic efficiency, theoretical results are yet missing in showing that learned indexes are provably better than classic indexes, such as B-tree s and their variants. In this paper, we present the first mathematically-grounded answer to this problem by exploiting a link with a mean exit time problem over a proper stochastic process which, we show, is related to the space and time complexity of these learned indexes. As a corollary of this general analysis, we show that plugging this result in the (learned) PGM-index, we get a learned data structure which is provably better than B-tree s.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theoretical Computer Science	Publication Date: Apr 28, 2021
Citations: 12	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

On the performance of learned data structures

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science

Lead the way for us

Similar Papers

A study of learning likely data structure properties using machine learning models
Muhammad Usman ... Sarfraz Khurshid
International Journal on Software Tools for Technology Transfer | VOL. 22
Muhammad Usman, et. al.Muhammad Usman ... Sarfraz Khurshid
07 Jun 2020
International Journal on Software Tools for Technology Transfer | VOL. 22

Opportunistic data structures with applications
P Ferragina ... G Manzini
-
P Ferragina, et. al.P Ferragina ... G Manzini
12 Nov 2000
12 Nov 2000

Prognosing post-treatment outcomes of head and neck cancer using structured data and machine learning: A systematic review.
Mohammad Moharrami ... Michael Glogauer
PloS one | VOL. 19
Mohammad Moharrami, et. al.Mohammad Moharrami ... Michael Glogauer
24 Jul 2024
PloS one | VOL. 19

EESD special issue: AI and data‐driven methods in earthquake engineering – (Part 1)
Xinzheng Lu ... Henry Burton
Earthquake Engineering & Structural Dynamics | VOL. 52
Xinzheng Lu, et. al.Xinzheng Lu ... Henry Burton
04 May 2023
Earthquake Engineering & Structural Dynamics | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the performance of learned data structures

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science