SegMine workflows for semantic microarray data analysis in Orange4WS

Vid Podpečan,Nada Lavrač,Igor Mozetič,Helena Motaln,Kimmo Kulovesi,Hannu Toivonen,Petra Kralj Novak,Laura Langohr,Kristina Gruden,Igor Trajkovski,Marko Petek

doi:10.1186/1471-2105-12-416

Abstract

BackgroundIn experimental data analysis, bioinformatics researchers increasingly rely on tools that enable the composition and reuse of scientific workflows. The utility of current bioinformatics workflow environments can be significantly increased by offering advanced data mining services as workflow components. Such services can support, for instance, knowledge discovery from diverse distributed data and knowledge sources (such as GO, KEGG, PubMed, and experimental databases). Specifically, cutting-edge data analysis approaches, such as semantic data mining, link discovery, and visualization, have not yet been made available to researchers investigating complex biological datasets.ResultsWe present a new methodology, SegMine, for semantic analysis of microarray data by exploiting general biological knowledge, and a new workflow environment, Orange4WS, with integrated support for web services in which the SegMine methodology is implemented. The SegMine methodology consists of two main steps. First, the semantic subgroup discovery algorithm is used to construct elaborate rules that identify enriched gene sets. Then, a link discovery service is used for the creation and visualization of new biological hypotheses. The utility of SegMine, implemented as a set of workflows in Orange4WS, is demonstrated in two microarray data analysis applications. In the analysis of senescence in human stem cells, the use of SegMine resulted in three novel research hypotheses that could improve understanding of the underlying mechanisms of senescence and identification of candidate marker genes.ConclusionsCompared to the available data analysis systems, SegMine offers improved hypothesis generation and data interpretation for bioinformatics in an easy-to-use integrated workflow environment.

Highlights

IntroductionBioinformatics researchers increasingly rely on tools that enable the composition and reuse of scientific workflows
In experimental data analysis, bioinformatics researchers increasingly rely on tools that enable the composition and reuse of scientific workflows
Because DAVID does not provide probabilistic search in large graphs that is provided in SegMine through Biomine services, only the results of the _rst step of the SegMine methodology, namely the sets of differentially expressed genes found by the SEGS algorithm, were used in the comparison

Summary

Introduction

Bioinformatics researchers increasingly rely on tools that enable the composition and reuse of scientific workflows. The utility of current bioinformatics workflow environments can be significantly increased by offering advanced data mining services as workflow components. Such services can support, for instance, knowledge discovery from diverse distributed data and knowledge sources (such as GO, KEGG, PubMed, and experimental databases). Cutting-edge data analysis approaches, such as semantic data mining, link discovery, and visualization, have not yet been made available to researchers investigating complex biological datasets. The field of microarray data analysis has shifted emphasis from methods for identifying individual differentially expressed genes to methods for identifying. A comparison of several software and web tools (OntoExpress, CLASSIFI, GoMiner, EASEonline, GeneMerge, FuncAssociate, GOTree Machine, etc.) has been performed by Khatri and Draghici [12]

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC bioinformatics	Publication Date: Oct 26, 2011
Citations: 63	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

SegMine workflows for semantic microarray data analysis in Orange4WS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Similar Papers

SIRT7 antagonizes human stem cell aging as a heterochromatin stabilizer
Shijia Bi ... Yan Yao
Protein & cell | VOL. 11
Shijia Bi, et. al.Shijia Bi ... Yan Yao
06 Jun 2020
Protein & cell | VOL. 11

Human stem cell aging: do mitochondrial DNA mutations have a causal role?
Holly L Baines ... Douglass M Turnbull
Aging Cell | VOL. 13
Holly L Baines, et. al.Holly L Baines ... Douglass M Turnbull
28 Jan 2014
Aging Cell | VOL. 13

Semantic Analysis Processes in Advanced Pattern Understanding Systems
Lidia Ogiela ... Marek R Ogiela
-
Lidia Ogiela, et. al.Lidia Ogiela ... Marek R Ogiela
01 Jan 2010
01 Jan 2010

Changes in the frequencies of human hematopoietic stem and progenitor cells with age and site
Tracy L Farrell ... John Graham Sharp
Experimental hematology | VOL. 42
Tracy L Farrell, et. al.Tracy L Farrell ... John Graham Sharp
15 Nov 2013
Experimental hematology | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SegMine workflows for semantic microarray data analysis in Orange4WS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics