Abstract

The paper presents the researches to determine the effectiveness of different criteria to estimate the complex biology objects clustering quality. The gene expression sequences of cancer patients were used as experimental data. The degree of the studied objects similarity was estimated by the comparison of the gene expression sequences profile using different metrics to estimate the objects proximity. The studies have shown that the best separating ability is obtained by using the correlation metric proximity of objects. Herewith the use of the CH criterion (Calinski-Harabasz) allows to get the most objective objects clustering by using simulated data. The presented research is focused mainly on the inductive model of the objective clustering, where the objects clustering is carried out concurrently on the two equal power subsets. In this case, the final decision about the objects grouping is accepted using the two subsets basing both on the internal clustering quality criteria estimating and the minimum value of the external criterion of clustering similarity.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.