High-throughput bioinformatics with the Cyrille2 pipeline system

Mark Wej Fiers,Ate Van Der Burgt,Roeland Chj Van Ham,Erwin Datema,Joost Cw De Groot

doi:10.1186/1471-2105-9-96

Abstract

BackgroundModern omics research involves the application of high-throughput technologies that generate vast volumes of data. These data need to be pre-processed, analyzed and integrated with existing knowledge through the use of diverse sets of software tools, models and databases. The analyses are often interdependent and chained together to form complex workflows or pipelines. Given the volume of the data used and the multitude of computational resources available, specialized pipeline software is required to make high-throughput analysis of large-scale omics datasets feasible.ResultsWe have developed a generic pipeline system called Cyrille2. The system is modular in design and consists of three functionally distinct parts: 1) a web based, graphical user interface (GUI) that enables a pipeline operator to manage the system; 2) the Scheduler, which forms the functional core of the system and which tracks what data enters the system and determines what jobs must be scheduled for execution, and; 3) the Executor, which searches for scheduled jobs and executes these on a compute cluster.ConclusionThe Cyrille2 system is an extensible, modular system, implementing the stated requirements. Cyrille2 enables easy creation and execution of high throughput, flexible bioinformatics pipelines.

Highlights

Modern omics research involves the application of high-throughput technologies that generate vast volumes of data
The software tools, models and databases that are used in this process need to be arranged in precise computational chains, where output of one analysis serves as the input of a subsequent analysis
Our local implementation of the Cyrille2 system runs on a dedicated server

Summary

Introduction

Modern omics research involves the application of high-throughput technologies that generate vast volumes of data. These data need to be pre-processed, analyzed and integrated with existing knowledge through the use of diverse sets of software tools, models and databases. The software tools, models and databases that are used in this process need to be arranged in precise computational chains, where output of one analysis serves as the input of a subsequent analysis. Such chains are often referred to as pipelines or workflows.

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Feb 12, 2008
Citations: 31	License type: cc-by

R Discovery Prime

R Discovery Prime

High-throughput bioinformatics with the Cyrille2 pipeline system

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

DolphinNext: a distributed data processing platform for high throughput genomics
Onur Yukselen ... Manuel Garber
BMC Genomics | VOL. 21
Onur Yukselen, et. al.Onur Yukselen ... Manuel Garber
19 Apr 2020
BMC Genomics | VOL. 21

SPADES—a specification and design system and its graphical interface
...
-
, et. al. ...
01 Aug 1985
01 Aug 1985

Developing Comprehensive Predictive and Prescriptive Management System for Flexible Pipeline Survivability
Siti Aishah Binti Ali Noor Razak ... Azam Syah Bin Jaafar
-
Siti Aishah Binti Ali Noor Razak, et. al.Siti Aishah Binti Ali Noor Razak ... Azam Syah Bin Jaafar
31 Oct 2022
31 Oct 2022

Flow Assurance Issues Related to Flexible Riser and Pipeline System Configuration
Mingxiu Li ... Leyuan Yu
-
Mingxiu Li, et. al.Mingxiu Li ... Leyuan Yu
05 May 2014
05 May 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High-throughput bioinformatics with the Cyrille2 pipeline system

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics