Abstract

AbstractCorrelation clustering possibly represents the most intuitive form of clustering construction. It gives solutions that can be approximated while automatically selecting the number of clusters. This approach handles scenarios where the focus is on relationships between the objects instead of on actual representations of the objects. The suitability of this method extends to the structured objects, for which feature vectors are not easy to obtain. Given the increasing scale of data these days, correlation clustering has become a powerful addition to the fields of data mining and agnostic learning. Correlation clustering considers a weighted graph G=(V,E), where the edge weight indicates whether two nodes are similar (positive edge weight) or different (negative edge weight). The task is to find a clustering that either maximizes agreements or minimizes disagreements. Unlike other clustering algorithms, this does not require choosing the number of clusters (k) in advance. The objective to minimize the sum of weights of the cut edges is independent of the number of clusters. Methodologies, such as approximations and linear programming formulations, have been used to approach this problem. This paper focuses on the problem of correlation clustering and lists the solutions proposed by various researchers. These solutions approach the problem using different computational techniques. Correlation clustering‐based applications such as entity de‐duplication, signed social networks, and problem of aggregating multiples have also been discussed.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.