Abstract

Deep clustering algorithms perform learning feature representations and clustering tasks jointly by using neural networks with significantly improved performance over the traditional k-means or spectral clustering. Some groundbreaking proposals extract data spaces directly in “bags of words” approach without considering the semantic information of each document as inputs to deep auto-encoder networks. But these algorithms suffer from inaccurate feature space from the encoder output when dealing with incomprehensible and high-dimensional data. For solving this problem in this paper, an Attention-based Deep Embedded Clustering (ADEC) algorithm is proposed to improve representation of data space. ADEC extracts high quality embedded features and performs clustering jointly with learning embedded features which are suitable for document clustering. The experimental result shows that the performance and accuracy of document clustering is improved significantly using the ADEC clustering framework on two datasets REUTERS-10K and REUTERS.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.