Abstract

In this paper, we present a constrained Dirichlet process mixture model with labels as side information. Specifically, the labeled information is incorporated through the use of a product partition prior to give clusters of instances with similar labels a higher prior preference. The proposed formulation is further extended to handle multiple side information. The empirical results on several benchmark datasets show that our method can consistently improve its clustering performance as more labeled data become available. Even in the presence of noisy labels, the proposed method rarely performs worse than its unsupervised counterpart. The effectiveness of the proposed method is also demonstrated through an application of magnetic resonance imaging for identifying major brain tissues.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.