Text Clustering Based on a Divide and Merge Strategy

Man Yuan,Yong Shi

doi:10.1016/j.procs.2015.07.153

Text Clustering Based on a Divide and Merge Strategy

Man Yuan, Yong Shi

Open Access

https://doi.org/10.1016/j.procs.2015.07.153

Copy DOI

Journal: Procedia Computer Science	Publication Date: Jan 1, 2015
Citations: 3	License type: cc-by-nc-nd

Affiliation: China Huarong Energy (China), University of Chinese Academy of Sciences, Beijing Institute of Big Data Research, Chinese Academy of Sciences

#Merge Strategy #Text Clustering Algorithm + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Abstract A text clustering algorithm is proposed to overcome the drawback of division based clustering method on sensitivity of estimated class number. Complex features including synonym and co-occurring words are extracted to make a feature space containing more semantic information. Then the divide and merge strategy helps the iteration converge to a reasonable cluster number. Experimental results showed that the dynamically updated center number prevent the deterioration of clustering result when k deviates from the real class numbers. When k is too small or large, the difference of clustering results between FC-DM and k-means is more obvious and FC-DM also outperformed other benchmark algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Procedia Computer Science

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.