Abstract
At present, the main method of keyword extraction is usually based on topical discovery model, and the most popular model is LDA (Latent Dirichlet Allocation) topical model. However, LDA model has poor performance in dealing with short texts, which is the imperfection of the current topical discovery model. This paper propose a keywords extraction model SDMM (Statistical Dirichlet Multinomial model) to extract the keywords of short financial review. This keywords extraction model use DMM (Dirichlet Multinomial Model) for topic discovery, and calculate the similarity between the words and keywords, thus improve the performance of keyword extraction for short texts. Experimental results on several financial review datasets show that the proposed model is better than the existing models in evaluation indicator Precision, Recall and F-measure.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.