Investigating Memory Optimization of Hash-index for Next Generation Sequencing on Multi-core Architecture

Wendi Wang,Guangming Tan,Ninghui Sun,Wen Tang,Linchuan Li,Peiheng Zhang

doi:10.1109/ipdpsw.2012.83

Abstract

Next Generation Sequencing (NGS) is gaining interests due to the increased requirements and the decreased sequencing cost. The important and prerequisite step of most NGS applications is the mapping of short sequences, called reads, to the template reference sequences. Both the explosion of NGS data with over billions of reads generated each day and the data intensive computations pose great challenges to the capability of existing computing systems. In this paper, we take a hash index based algorithm (PerM) as an example to investigate the optimization approaches for accelerating NGS reads mapping on multi-core architectures. First, we propose a new parallel algorithm that reorders bucket access in hash index among multiple threads so that data locality in shared cache is improved. Second, in order to reduce the number of empty hash bucket, we propose a serialized hash index compression algorithm, which coincides with the sequential access nature of our new parallel algorithm. With reduced hash index size, it also becomes possible for us to use longer hash keys, which alleviates the hash conflicts and improves the query performance. Our experiment on an 8-socket 8-cores Intel Xeon X7550 SMP with 128 GB memory shows that the new parallel algorithm reduces LLC miss ratio to be 8%~15% of the original algorithm and the overall performance is improved by 4~11 times (6 times avg.).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Investigating Memory Optimization of Hash-index for Next Generation Sequencing on Multi-core Architecture

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Abstract 4892: Methods for accurate reporting of confidence intervals in clinical applications of next generation sequencing (NGS)
Erin L Crawford ... James C Willey
Cancer Research | VOL. 75
Erin L Crawford, et. al.Erin L Crawford ... James C Willey
01 Aug 2015
Cancer Research | VOL. 75

Application of next generation sequencing in HIV drug resistance studies in Africa, 2005–2019: A systematic review
Phindulo Mathobo ... Pascal O Bessong
Scientific African | VOL. 12
Phindulo Mathobo, et. al.Phindulo Mathobo ... Pascal O Bessong
01 Jul 2021
Scientific African | VOL. 12

Application of Next Generation Sequencing (NGS) technology in forensic science: A review
Yakubu Magaji Yuguda
GSC Biological and Pharmaceutical Sciences | VOL. 23
Yakubu Magaji Yuguda Yakubu Magaji Yuguda
30 May 2023
GSC Biological and Pharmaceutical Sciences | VOL. 23

Abstract B017: Clinical utility of next generation sequencing in advanced colorectal cancer: The earlier the better
Ho Jung An ... Hyunho Kim
Molecular Cancer Therapeutics | VOL. 22
Ho Jung An, et. al.Ho Jung An ... Hyunho Kim
01 Dec 2023
Molecular Cancer Therapeutics | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Investigating Memory Optimization of Hash-index for Next Generation Sequencing on Multi-core Architecture

Abstract

Talk to us

Similar Papers