Data I/O management approach for the post-hoc visualization of big simulation data results

Jorji Nonaka,Kenji Ono,Eduardo C Inacio,Mario A R Dantas,Yasuhiro Kawashima,Tomohiro Kawanabe,Fumiyoshi Shoji

doi:10.1142/s1793962318400068

Abstract

Leading-edge supercomputers, such as the K computer, have generated a vast amount of simulation results, and most of these datasets were stored on the file system for the post-hoc analysis such as visualization. In this work, we first investigated the data generation trends of the K computer by analyzing some operational log data files. We verified a tendency of generating large amounts of distributed files as simulation outputs, and in most cases, the number of files has been proportional to the number of utilized computational nodes, that is, each computational node producing one or more files. Considering that the computational cost of visualization tasks is usually much smaller than that required for large-scale numerical simulations, a flexible data input/output (I/O) management mechanism becomes highly useful for the post-hoc visualization and analysis. In this work, we focused on the xDMlib data management library, and its flexible data I/O mechanism in order to enable flexible data loading of big computational climate simulation results. In the proposed approach, a pre-processing is executed on the target distributed files for generating a light-weight metadata necessary for the elaboration of the data assignment mapping used in the subsequent data loading process. We evaluated the proposed approach by using a 32-node visualization cluster, and the K computer. Besides the inevitable performance penalty associated with longer data loading time, when using smaller number of processes, there is a benefit for avoiding any data replication via copy, conversion, or extraction. In addition, users will be able to freely select any number of nodes, without caring about the number of distributed files, for the post-hoc visualization and analysis purposes.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data I/O management approach for the post-hoc visualization of big simulation data results

Abstract

Talk to us

Similar Papers

More From: International Journal of Modeling, Simulation, and Scientific Computing

Lead the way for us

Journal: International Journal of Modeling, Simulation, and Scientific Computing	Publication Date: May 24, 2018
Citations: 6

Similar Papers

Variability of effects of spatial climate data aggregation on regional yield simulation by crop models
...
Climate research | VOL. 65
, et. al. ...
28 Sep 2015
Climate research | VOL. 65

Comment on hess-2023-28
-
-
--
11 Apr 2023
Comment on hess-2023-28
-

Comment on hess-2023-28
-
-
--
06 Apr 2023
Comment on hess-2023-28
-

Seasonal soil moisture and crop yield prediction with fifth-generation seasonal forecasting system (SEAS5) long-range meteorological forecasts in a land surface modelling approach
Theresa Boas ... Andrew Western
Hydrology and earth system sciences | VOL. 27
Theresa Boas, et. al.Theresa Boas ... Andrew Western
29 Aug 2023
Hydrology and earth system sciences | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data I/O management approach for the post-hoc visualization of big simulation data results

Abstract

Talk to us

Similar Papers

More From: International Journal of Modeling, Simulation, and Scientific Computing