Abstract
Keyphrase Extraction (KE) aims to identify a concise set of words or phrases that effectively summarizes the core ideas of a document. Recent embedding-based models have achieved state-of-the-art performance by jointly modeling local and global contexts in Unsupervised Keyphrase Extraction (UKE). However, these models often ignore either sentence- or document-level contexts, leading directly to weak or incorrect global significance. Furthermore, they rely heavily on local significance, making them vulnerable to noisy data, particularly in long documents, resulting in unstable and suboptimal performance. Intuitively, hierarchical contexts enable a more accurate understanding of the candidates, thereby enhancing their global relevance. Inspired by this, we propose a novel Hierarchical Context-aware Unsupervised Keyphrase Extraction method called HCUKE. Specifically, HCUKE comprises three core modules: (i) a hierarchical context-based global significance measure module that incrementally learns global semantic information from a three-level hierarchical structure; (ii) a phrase-level local significance measure module that captures local semantic information by modeling the context interaction among candidates; and (iii) a candidate ranking module that integrates the measure scores with positional weights to compute a final ranking score. Extensive experiments on three benchmark datasets demonstrate that the proposed method significantly outperforms state-of-the-art baselines.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.