Abstract

Due to the limited length and freely constructed sentence structures, short text is different from normal text, which makes traditional algorithm of text representation does not work well on it. This paper proposes a model called Conceptual and Semantic Enrichment with Topic Model (CSET) by combining Biterm Topic Model (BTM), a widely used probabilistic topic model which is designed for short text with Probase, a large-scale probabilistic knowledge base. CSET is able to capture semantic relations between words to enrich short text. Our model enables large amount of applications that rely on semantic understanding of short text, including short text classification and word similarity measurement in context.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.