Clustering of Biological Datasets in the Era of Big Data.

Richard Röttger

doi:10.2390/biecoll-jib-2016-300

Abstract

Clustering is a long-standing problem in computer science and is applied in virtually any scientific field for exploring the inherent structure of datasets. In biomedical research, clustering tools have been utilized in manifold areas, among many others in expression analysis, disease subtyping or protein research. A plethora of different approaches have been developed but there is only little guideline what approach is the optimal in what particular situation. Furthermore, a typical cluster analysis is an entire process with several highly interconnected steps; from preprocessing, proximity calculation, the actual clustering to evaluation and optimization. Only when all steps seamlessly work together, an optimal result can be achieved. This renders a cluster analyses tiresome and error-prone especially for non-experts. A mere trial-and-error approach renders increasingly infeasible when considering the tremendous growth of available datasets; thus, a strategic and thoughtful course of action is crucial for a cluster analysis. This manuscript provides an overview of the crucial steps and the most common techniques involved in conducting a state-of-the-art cluster analysis of biomedical datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering of Biological Datasets in the Era of Big Data.

Abstract

Talk to us

Similar Papers

More From: Journal of integrative bioinformatics

Lead the way for us

Journal: Journal of integrative bioinformatics	Publication Date: Dec 22, 2016
Citations: 4

Similar Papers

Clustering of Biological Datasets in the Era of Big Data
Richard Röttger
Journal of Integrative Bioinformatics | VOL. 13
Richard RöttgerRichard Röttger
01 Mar 2016
Journal of Integrative Bioinformatics | VOL. 13

A Primer on Machine Learning.
Audrene S Edwards ... Bruce Kaplan
Transplantation | VOL. 105
Audrene S Edwards, et. al.Audrene S Edwards ... Bruce Kaplan
18 Aug 2020
Transplantation | VOL. 105

Development characteristics and research frontier report of Chinese medical education research with an international view
...
-
, et. al. ...
01 May 2019
01 May 2019

A Privacy Preserving Approach to Feasibility Analyses on Distributed Data Sources in Biomedical Research.
Heiko Scheel ... Thorsten Rottmann
Studies in health technology and informatics | VOL. 267
Heiko Scheel, et. al.Heiko Scheel ... Thorsten Rottmann
03 Sep 2019
Studies in health technology and informatics | VOL. 267

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering of Biological Datasets in the Era of Big Data.

Abstract

Talk to us

Similar Papers

More From: Journal of integrative bioinformatics