Abstract

In this paper we consider the possibility of using the gem5 computer system simulator to determine such metrics of parallel program code as its acceleration and the average workload of cpu cores on computer systems with shared memory. The working process in gem5 consists of three stages: building a model of a computer system, loading a software environment into the built model and executing a user program within this environment. The considered approach is used to evaluate the efficiency of a parallel program for modeling the transport of pollutants in the atmosphere. The program assumes both intensive work with memory and strong interaction between computational threads within the framework of parallel computing. For this purpose, two models of computer systems were created in the gem5 simulator: the simplest functional model of the “cores–RAM ” type and the detailed clock model of the “cores–three-level cache–RAM ” type. By running the program on both computer models and comparing obtained results with those observed on a real computer, it is shown that the computer system models created in gem5 are operable, correctly reproduce the functionality of real computer systems, and can be used to obtain fairly accurate estimates of such metrics as the acceleration of the program and the average workload of cpu cores. It should be noted that the accuracy of the estimates slightly depends on the number of simulated cores in computer models. For example, the acceleration accuracy obtained on the detailed clock model is 0.7 % on five cores and might increase up to 10 % on 32 cores.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call