BiCluE - Exact and heuristic algorithms for weighted bi-cluster editing of biomedical data

Peng Sun,Jan Baumbach,Jiong Guo

doi:10.1186/1753-6561-7-s7-s9

Abstract

BackgroundThe explosion of biological data has dramatically reformed today's biology research. The biggest challenge to biologists and bioinformaticians is the integration and analysis of large quantity of data to provide meaningful insights. One major problem is the combined analysis of data from different types. Bi-cluster editing, as a special case of clustering, which partitions two different types of data simultaneously, might be used for several biomedical scenarios. However, the underlying algorithmic problem is NP-hard.ResultsHere we contribute with BiCluE, a software package designed to solve the weighted bi-cluster editing problem. It implements (1) an exact algorithm based on fixed-parameter tractability and (2) a polynomial-time greedy heuristics based on solving the hardest part, edge deletions, first. We evaluated its performance on artificial graphs. Afterwards we exemplarily applied our implementation on real world biomedical data, GWAS data in this case. BiCluE generally works on any kind of data types that can be modeled as (weighted or unweighted) bipartite graphs.ConclusionsTo our knowledge, this is the first software package solving the weighted bi-cluster editing problem. BiCluE as well as the supplementary results are available online at http://biclue.mpi-inf.mpg.de.

Highlights

The enormous amount of available data from laboratories around the world has greatly shifted the focus of biologically motivated studies
UniProtKb/Swiss-Prot provides a database containing more than 53,000 annotated sequences, extracted and integrated from 205,244 published references and Protein Data Bank (PDB) has incorporated over 78,400 molecule structures
We focus on the exact and heuristic algorithms that cluster data from different types simultaneously, i.e. so called “bi- cluster editing”

Summary

Introduction

Background The enormous amount of available (sequential) data from laboratories around the world has greatly shifted the focus of biologically motivated studies. UniProtKb/Swiss-Prot provides a database containing more than 53,000 annotated sequences, extracted and integrated from 205,244 published references and Protein Data Bank (PDB) has incorporated over 78,400 molecule structures. Integrating, processing and analyzing large quantities of data from various sources have become the main challenge in modern bioinformatics. The explosion of biological data has dramatically reformed today’s biology research. The biggest challenge to biologists and bioinformaticians is the integration and analysis of large quantity of data to provide meaningful insights. One major problem is the combined analysis of data from different types. Bi-cluster editing, as a special case of clustering, which partitions two different types of data simultaneously, might be used for several biomedical scenarios.

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Proceedings	Publication Date: Dec 1, 2013
Citations: 18	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

BiCluE - Exact and heuristic algorithms for weighted bi-cluster editing of biomedical data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Proceedings

Lead the way for us

Similar Papers

Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
Peng Sun ... Jiong Guo
Journal of Integrative Bioinformatics | VOL. 9
Peng Sun, et. al.Peng Sun ... Jiong Guo
01 Jun 2012
Journal of Integrative Bioinformatics | VOL. 9

Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering
Peng Sun ... Jan Baumbach
Nucleic Acids Research | VOL. 42
Peng Sun, et. al.Peng Sun ... Jan Baumbach
20 Mar 2014
Nucleic Acids Research | VOL. 42

Path-Contractions, Edge Deletions and Connectivity Preservation
...
-
, et. al. ...
31 Aug 2017
31 Aug 2017

Fixed-Parameter Algorithms in Analysis of Heuristics for Extracting Networks in Linear Programs
Gregory Gutin ... Daniel Karapetyan
-
Gregory Gutin, et. al.Gregory Gutin ... Daniel Karapetyan
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BiCluE - Exact and heuristic algorithms for weighted bi-cluster editing of biomedical data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Proceedings