SCBI_MapReduce, a New Ruby Task-Farm Skeleton for Automated Parallelisation and Distribution in Chunks of Sequences: The Implementation of a Boosted Blast+

Darío Guerrero-Fernández,M Gonzalo Claros,Juan Falgueras

doi:10.1155/2013/707540

Darío Guerrero-Fernández, M Gonzalo Claros + Show 1 more

Open Access

https://doi.org/10.1155/2013/707540

Copy DOI

Journal: Computational Biology Journal	Publication Date: Oct 27, 2013
Citations: 30	License type: CC BY 3.0

Affiliation: Universidad de Málaga

Abstract

Current genomic analyses often require the managing and comparison of big data using desktop bioinformatic software that was not developed regarding multicore distribution. The task-farm SCBI_MAPREDUCE is intended to simplify the trivial parallelisation and distribution of new and legacy software and scripts for biologists who are interested in using computers but are not skilled programmers. In the case of legacy applications, there is no need of modification or rewriting the source code. It can be used from multicore workstations to heterogeneous grids. Tests have demonstrated that speed-up scales almost linearly and that distribution in small chunks increases it. It is also shown that SCBI_MAPREDUCE takes advantage of shared storage when necessary, is fault-tolerant, allows for resuming aborted jobs, does not need special hardware or virtual machine support, and provides the same results than a parallelised, legacy software. The same is true for interrupted and relaunched jobs. As proof-of-concept, distribution of a compiled version of BLAST+ in the SCBI_DISTRIBUTED_BLAST gem is given, indicating that other blast binaries can be used while maintaining the same SCBI_DISTRIBUTED_BLAST code. Therefore, SCBI_MAPREDUCE suits most parallelisation and distribution needs in, for example, gene and genome studies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SCBI_MapReduce, a New Ruby Task-Farm Skeleton for Automated Parallelisation and Distribution in Chunks of Sequences: The Implementation of a Boosted Blast+

Abstract

Talk to us

Similar Papers

More From: Computational Biology Journal

Lead the way for us

Similar Papers

Double-Precision FPUs in High-Performance Computing: An Embarrassment of Riches?
Jens Domke ... Kazuaki Matsumura
-
Jens Domke, et. al.Jens Domke ... Kazuaki Matsumura
01 May 2019
01 May 2019

Rigel
John H Kelm ... Daniel R Johnson
ACM SIGARCH Computer Architecture News | VOL. 37
John H Kelm, et. al.John H Kelm ... Daniel R Johnson
15 Jun 2009
ACM SIGARCH Computer Architecture News | VOL. 37

Scalable and adaptive design test system for ground-based to airborne platforms
Tran-Chau C Nguyen ... Scott Rawlings
-
Tran-Chau C Nguyen, et. al.Tran-Chau C Nguyen ... Scott Rawlings
01 Sep 2016
01 Sep 2016

Energy-efficient physically tagged caches for embedded processors with virtual memory
P Petrov ... D Tracy
-
P Petrov, et. al.P Petrov ... D Tracy
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SCBI_MapReduce, a New Ruby Task-Farm Skeleton for Automated Parallelisation and Distribution in Chunks of Sequences: The Implementation of a Boosted Blast+

Abstract

Talk to us

Similar Papers

More From: Computational Biology Journal