Data-intensive spatial filtering in large numerical simulation datasets

Kalin Kanov ,Randal Burns ,Greg Eyink ,Charles Meneveau ,Alexander S Szalay

doi:10.5555/2388996.2389078

Abstract

We present a query processing framework for the efficient evaluation of spatial filters on large numerical simulation datasets stored in a data-intensive cluster. Previously, filtering of large numerical simulations stored in scientific databases has been impractical owing to the immense data requirements. Rather, filtering is done during simulation or by loading snapshots into the aggregate memory of an HPC cluster. Our system performs filtering within the database and supports large filter widths. We present two complementary methods of execution: I/O streaming computes a batch filter query in a single sequential pass using incremental evaluation of decomposable kernels, summed volumes generates an intermediate data set and evaluates each filtered value by accessing only eight points in this dataset. We dynamically choose between these methods depending upon workload characteristics. The system allows us to perform filters against large data sets with little overhead: query performance scales with the cluster's aggregate I/O throughput.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-intensive spatial filtering in large numerical simulation datasets

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Data-intensive spatial filtering in large numerical simulation datasets
Kalin Kanov ... Charles Meneveau
-
Kalin Kanov, et. al.Kalin Kanov ... Charles Meneveau
01 Nov 2012
01 Nov 2012

Fast data-oriented microaggregation algorithm for large numerical datasets
Reza Mortazavi ... Saeed Jalili
Knowledge-Based Systems | VOL. 67
Reza Mortazavi, et. al.Reza Mortazavi ... Saeed Jalili
21 May 2014
Knowledge-Based Systems | VOL. 67

Parallel spectral element method for guided wave based structural health monitoring
P Kudela ... P Fiborek
Smart Materials and Structures | VOL. 29
P Kudela, et. al.P Kudela ... P Fiborek
10 Aug 2020
Smart Materials and Structures | VOL. 29

ReVisE: Remote visualization environment for large numerical simulation datasets.
Stepan Orlov ... Vyacheslav Reshetnikov
PloS one | VOL. 16
Stepan Orlov, et. al.Stepan Orlov ... Vyacheslav Reshetnikov
27 Jul 2021
PloS one | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-intensive spatial filtering in large numerical simulation datasets

Abstract

Talk to us

Similar Papers