Abstract

IntroductionThe Secure Unified Research Environment (SURE) is a high-powered computing environment located within Sax Institute (Sydney, Australia). SURE was established through the financial support of the Australian Government National Collaborative Research Infrastructure Strategy (NCRIS) as part of the Population Health Research Network (PHRN). SURE is approved by the Australian Government as the only secure platform for analysing unit record level sensitive health and other Australian Government data, providing computational resources and secure infrastructure in a form of virtual machines (VMs) accessible by approved researchers in Australia and overseas.
 Objectives and ApproachWe aim to compare computational performance of SURE VMs of different configurations with the performance of physical computers by running a series of standardised computational tasks involving different numbers of central processingunit (CPU) cores available on each computer. The approach utilised the benchmark test maintained by the H2O.ai group (https://h2oai.github.io/db-benchmark/). The results were measured over the datasets of different sizes, ranging from 500MB to 50GB in Random Access Memory (RAM).
 ResultsOur benchmarking outcomes have revealed that computational efficiency of physical computers uniformly outperform the efficiency of the current standard SURE VM configuration offerings, sometimes demonstrating a nearly double performance. For the range of typical analytical tasks assessed, computational performance greatly benefits from extending the number of computational cores available on a machine.
 Conclusion / ImplicationsSURE is a highly valuable tool enabling research and collaborations involving confidential population-based data. The shortage of RAM and CPU cores can be a major bottleneck even for moderately large datasets. VMs currently offered by SURE yet fall short of reaching computational performance of physical desktop computers. The results are to guide the funders and providers of secure remote access data laboratories responsible for providing Research Infrastructure as a Service (IaaS) tailored to meet the needs of participating research groups.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call