Chameleon Clustering Research Articles

Medicine is a fast-moving field, and the number of medical publications has increased rapidly over recent years. How to find relevant information from this vast pool of research effectively and efficiently has therefore become highly challenges. Previous studies have demonstrated that data fusion can improve search performance if properly utilized. However, in most cases effectiveness is the only concern and efficiency is not considered. A fusion-based system is by nature more complicated and expensive computationally than other retrieval models such as BM25, because many component retrieval systems and an extra layer of fusion are required. The number of component retrieval systems involved is an important indicator of complexity of the fusion-based system. We aim to select the optimal k-subset of component retrieval systems for any given number k, to optimize both fusion performance and reduce the cost of data fusion. A clustering-based approach is proposed. First all the candidates are divided into clusters by the Chameleon clustering algorithm, then representatives from every cluster are chosen by Sequential Forward Selection for fusion. Evaluated with two datasets from TREC, the proposed method performs more effectively than the other baseline methods including the state-of-the-art subset selection method significantly. When either of the two typical fusion methods is used, an improvement rate of over 10% is observed for both measures Mean Average Precision and Recall-level Precision, and an improvement rate of over 5% is observed for both measures Precision at 10 document level and Mean Reciprocal Rank.

ResearchThe body composition model is closely related to the physiological characteristics of the human body. At the same time there can be a large number of physiological characteristics, many of which may be redundant or irrelevant. In existing human physiological feature selection algorithms, it is difficult to overcome the impact that redundancy and irrelevancy may have on human body composition modeling. This suggests a role for selection algorithms, where human physiological characteristics are identified using a combination of filtering and improved clustering. To do this, a feature filtering method based on Hilbert-Schmidt dependency criteria is first of all used to eliminate irrelevant features. After this, it is possible to use improved Chameleon clustering to increase the combination of sub-clusters amongst the characteristics, thereby removing any redundant features to obtain a candidate feature set for human body composition modeling. MethodWe report here on the use of an algorithm to filter the characteristic parameters in INBODY770 (this paper used INBODY 770 as body composition analyzer.) measurement data, which has three commonly-used impedance bands (1 kHZ, 250 kHZ, 500 kHZ). This algorithm is able to filter out parameters that have a low correlation with body composition BFM. The algorithm is also able to draw upon improved clustering techniques to reduce the initial feature set from 29 parameters to 10 parameters for any parameters of the 250 kHZ band that remain after filtering. In addition, we also examined the impact of different sample sizes on feature selection.ResultThe proposed algorithm is able to remove irrelevant and redundant features and the resulting correlation between the model and the body composition (BFM which is a whole body fat evaluation can better assess the body's overall fat and muscle composition.) is 0.978, thereby providing an improved model for prediction with a relative error of less than 0.12.

Chameleon Clustering Research Articles

Related Topics

Articles published on Chameleon Clustering

Clustering-based fusion for medical information retrieval

A human body physiological feature selection algorithm based on filtering and improved clustering.

Topology and Topic-Aware Service Clustering

基于耦合关系的医生用药异常分析

Chameleon Clustering Algorithm with Semantic Analysis Algorithm for Efficient Web Usage Mining

Parallel Chameleon Clustering Based on MapReduce

Parallel Algorithm for the Chameleon Clustering Algorithm using Dynamic Modeling

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Chameleon Clustering Research Articles

Related Topics

Articles published on Chameleon Clustering

Clustering-based fusion for medical information retrieval

A human body physiological feature selection algorithm based on filtering and improved clustering.

Topology and Topic-Aware Service Clustering

基于耦合关系的医生用药异常分析

Chameleon Clustering Algorithm with Semantic Analysis Algorithm for Efficient Web Usage Mining

Parallel Chameleon Clustering Based on MapReduce

Parallel Algorithm for the Chameleon Clustering Algorithm using Dynamic Modeling