Lossy Conservative Update (LCU) Sketch: Succinct Approximate Count Storage

Amit Goyal,Hal Daume

doi:10.1609/aaai.v25i1.7976

Abstract

In this paper, we propose a variant of the conservativeupdate Count-Min sketch to further reduce the overestimation error incurred. Inspired by ideas from lossy counting, we divide a stream of items into multiple windows, and decrement certain counts in the sketch at window boundaries. We refer to this approach as a lossy conservative update (LCU). The reduction in overestimation error of counts comes at the cost of introducing under-estimation error in counts. However, in our intrinsic evaluations, we show that the reduction in overestimation is much greater than the under-estimation error introduced by our method LCU. We apply our LCU framework to scale distributional similarity computations to web-scale corpora. We show that this technique is more efficient in terms of memory, and time, and more robust than conservative update with Count-Min (CU) sketch on this task.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lossy Conservative Update (LCU) Sketch: Succinct Approximate Count Storage

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Aug 4, 2011
Citations: 10

Similar Papers

Implementation of Hashing Algorithms in Stream Mining
Edi Muskardin ... Maja Matetic
-
Edi Muskardin, et. al.Edi Muskardin ... Maja Matetic
01 Oct 2018
01 Oct 2018

FCM-sketch
Cha Hwan Song ... Pravein Govindan Kannan
-
Cha Hwan Song, et. al.Cha Hwan Song ... Pravein Govindan Kannan
23 Nov 2020
23 Nov 2020

Soft Error Tolerant Count Min Sketches
Pedro Reviriego ... Marco Ottavi
IEEE Transactions on Computers | VOL. 70
Pedro Reviriego, et. al.Pedro Reviriego ... Marco Ottavi
17 Apr 2020
IEEE Transactions on Computers | VOL. 70

Development of an image processing-based ball tracking program for table tennis
Je-Heon Moon ... Ju-Sung Lee
Korean Journal of Sport Science | VOL. 31
Je-Heon Moon, et. al.Je-Heon Moon ... Ju-Sung Lee
30 Jun 2020
Korean Journal of Sport Science | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lossy Conservative Update (LCU) Sketch: Succinct Approximate Count Storage

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence