Abstract

Reverse time migration (RTM) is an algorithm widely used in the oil and gas industry to process seismic data. It is a computationally intensive task that suits well in parallel computers. Methods such as RTM can be parallelized in shared memory systems through scheduling iterations of parallel loops to threads. However, several aspects, such as memory size and hierarchy, number of cores, and input size, make optimal scheduling very challenging. In this paper, we introduce a run-time strategy to automatically tune the dynamic scheduling of parallel loops iterations in iterative applications, such as the RTM, in multicore systems. The proposed method aims to reduce the execution time of such applications. To find the optimal granularity, we propose a coupled simulated annealing (CSA) based auto-tuning strategy that adjusts the chunk size of work that OpenMP parallel loops assign dynamically to worker threads during the initialization of a 3D RTM application. Experiments performed with different computational systems and input sizes show that the proposed method is consistently better than the default OpenMP schedulers, static, auto, and guided, causing the application to be up to 33% faster. We show that the possible reason for this performance is the reduction of cache misses, mainly level L3, and low overhead, inferior to 2%. Having shown to be robust and scalable for the 3D RTM, the proposed method could also improve the performance of similar wave-based algorithms, such as full-waveform inversion (FWI) and other iterative applications.

Highlights

  • Seismic reflection surveying is the best known and used geophysical method for subsurface imaging

  • We first present the basics of our target application: the reverse time migration (RTM) (Section II), the parallelization strategies made available by OpenMP (Section III) and the optimization method that comprises the proposed auto-tuning, the coupled simulated annealing (CSA) (Section IV)

  • The proposed work is applied in the second degree of parallelization, where different loops of the RTM operation of each CS gather are parallelized among the cores of a multicore system, with OpenMP

Read more

Summary

INTRODUCTION

Seismic reflection surveying is the best known and used geophysical method for subsurface imaging. The computational cost is the main factor limiting the application of RTM, as well as for several other geophysical algorithms [4], [5] For this reason, parallel computing techniques have been widely applied to these methods (e.g., [6]). This paper presents an execution time auto-tuning strategy to automatically find an optimal chunk size for OpenMP [9] dynamic scheduling of parallel loops. This method aims to reduce the run time of iterative applications. We first present the basics of our target application: the RTM (Section II), the parallelization strategies made available by OpenMP (Section III) and the optimization method that comprises the proposed auto-tuning, the CSA (Section IV).

REVERSE TIME MIGRATION FORMULATION
COUPLED SIMULATED ANNEALING
IMPLEMENTATION ASPECTS OF RTM
CSA-BASED AUTO-TUNING
17: CSA generates a new solution for each optimizer from the time measures
CSA AUTO-TUNING PARAMETERIZATION
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call