Partitional Clustering Algorithms Research Articles

Many medical image processing and analysis operations can benefit a great deal from prior information encoded in the form of models/atlases to capture variations over a population in form, shape, anatomic layout, and image appearance of objects. However, two fundamental questions have not been addressed in the literature: "How many models/atlases are needed for optimally encoding prior information to address the differing body habitus factor in that population?" and "Images of how many subjects in the given population are needed to optimally harness prior information?" We propose a method to seek answers to these questions. We assume that there is a well-defined body region of interest and a subject population under consideration, and that we are given a set of representative images of the body region for the population. After images are trimmed to the exact body region, a hierarchical agglomerative clustering algorithm partitions the set of images into a specified number of groups by using pairwise image (dis)similarity as a cost function. Optionally the images may be pre-registered among themselves prior to this partitioning operation. We define a measure called Residual Dissimilarity (RD) to determine the goodness of each partition. We then ascertain how RD varies as a function of the number of elements in the partition for finding the optimum number(s) of groups. Breakpoints in this function are taken as the recommended number of groups/models/atlases. Our results from analysis of sizeable CT data sets of adult patients from two body regions - thorax (346) and head and neck (298) - can be summarized as follows. (1) A minimum of 5 to 8 groups (or models/atlases) seems essential to properly capture information about differing anatomic forms and body habitus. (2) A minimum of 150 images from different subjects in a population seems essential to cover the anatomical variations for a given body region. (3) In grouping, body habitus variations seem to override differences due to other factors such as gender, with/without contrast enhancement in image acquisition, and presence of moderate pathology. This method may be helpful for constructing high quality models/atlases from a sufficiently large population of images and in optimally selecting the training image sets needed in deep learning strategies.

Read full abstract

Алгоритм PAM (Partitioning Around Medoids) представляет собой разделительный алгоритм кластеризации, в котором в качестве центров кластеров выбираются только кластеризуемые объекты (медоиды). Кластеризация на основе техники медоидов применяется в широком спектре приложений: сегментирование медицинских и спутниковых изображений, анализ ДНК-микрочипов и текстов и др. На сегодня имеются параллельные реализации PAM для систем GPU и FPGA, но отсутствуют таковые для многоядерных ускорителей архитектуры Intel Many Integrated Core (MIC). В настоящей статье предлагается новый параллельный алгоритм кластеризации PhiPAM для ускорителей Intel MIC. Вычисления распараллеливаются с помощью технологии OpenMP. Алгоритм предполагает использование специализированной компоновки данных в памяти и техники тайлинга, позволяющих эффективно векторизовать вычисления на системах Intel MIC. Эксперименты, проведенные на реальных наборах данных, показали хорошую масштабируемость алгоритма. The PAM (Partitioning Around Medoids) is a partitioning clustering algorithm where each cluster is represented by an object from the input dataset (called a medoid). The medoid-based clustering is used in a wide range of applications: the segmentation of medical and satellite images, the analysis of DNA microarrays and texts, etc. Currently, there are parallel implementations of PAM for GPU and FPGA systems, but not for Intel Many Integrated Core (MIC) accelerators. In this paper, we propose a novel parallel PhiPAM clustering algorithm for Intel MIC systems. Computations are parallelized by the OpenMP technology. The algorithm exploits a sophisticated memory data layout and loop tiling technique, which allows one to efficiently vectorize computations with Intel MIC. Experiments performed on real data sets show a good scalability of the algorithm.

Read full abstract

Partitional Clustering Algorithms Research Articles

Related Topics

Articles published on Partitional Clustering Algorithms

Improved fast partitional clustering algorithm for text clustering

Unsupervised KPIs-Based Clustering of Jobs in HPC Data Centers.

An Improved Mean Imputation Clustering Algorithm for Incomplete Data

Hybrid of Hierarchical and Partitional Clustering Algorithm for Gene Expression Data

K-PbC: an improved cluster center initialization for categorical data clustering

A constrained agglomerative clustering approach for unipartite and bipartite networks with application to credit networks

A Framework to Perform Asset Allocation Based on Partitional Clustering

Stochastic Numerical P Systems With Application in Data Clustering Problems

Clustering Mixed Numeric and Categorical Data With Cuckoo Search

Sequential dimension reduction and clustering of mixed-type data

Repartitioned Optimized K-Mean Centroid Based Partitioned Clustering using MapReduce in Analyzing High Dimensional Big Data

Structure and kinematics of the Taurus star-forming region from Gaia-DR2 and VLBI astrometry

How many models/atlases are needed as priors for capturing anatomic population variations?

Two-Stage Load Profiling of HV Feeders of a Distribution System

Fuzzy clustering and fuzzy c-means partition cluster analysis and validation studies on a subset of citescore dataset

Параллельный алгоритм кластеризации данных для многоядерных ускорителей Intel MIC

An improved approach to fuzzy clustering based on FCM algorithm and extended VIKOR method

A Fast Multiobjective Fuzzy Clustering with Multimeasures Combination

Selection of 'K' in K-means clustering using GA and VMA

Using bagging to enhance clustering procedures for planar shapes

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Partitional Clustering Algorithms Research Articles

Related Topics

Articles published on Partitional Clustering Algorithms

Improved fast partitional clustering algorithm for text clustering

Unsupervised KPIs-Based Clustering of Jobs in HPC Data Centers.

An Improved Mean Imputation Clustering Algorithm for Incomplete Data

Hybrid of Hierarchical and Partitional Clustering Algorithm for Gene Expression Data

K-PbC: an improved cluster center initialization for categorical data clustering

A constrained agglomerative clustering approach for unipartite and bipartite networks with application to credit networks

A Framework to Perform Asset Allocation Based on Partitional Clustering

Stochastic Numerical P Systems With Application in Data Clustering Problems

Clustering Mixed Numeric and Categorical Data With Cuckoo Search

Sequential dimension reduction and clustering of mixed-type data

Repartitioned Optimized K-Mean Centroid Based Partitioned Clustering using MapReduce in Analyzing High Dimensional Big Data

Structure and kinematics of the Taurus star-forming region from Gaia-DR2 and VLBI astrometry

How many models/atlases are needed as priors for capturing anatomic population variations?

Two-Stage Load Profiling of HV Feeders of a Distribution System

Fuzzy clustering and fuzzy c-means partition cluster analysis and validation studies on a subset of citescore dataset

Параллельный алгоритм кластеризации данных для многоядерных ускорителей Intel MIC

An improved approach to fuzzy clustering based on FCM algorithm and extended VIKOR method

A Fast Multiobjective Fuzzy Clustering with Multimeasures Combination

Selection of 'K' in K-means clustering using GA and VMA

Using bagging to enhance clustering procedures for planar shapes