Abstract

There have been many domain-specific keyword extraction researches, but micro-blog- oriented keyword extraction is just beginning. This paper researches into the keyword extraction from Chinese micro-blog. Taking the characteristics of micro-blog into account, such as short, topic divergence, etc., we propose a Chinese micro-blog keyword extraction method based on the combination of multi features. Firstly create the graph model based on the co-occurrence between words, get a kind of weight based on the created graph model. The weight based on the graph model is sometimes same. In order to solve this problem, this method secondly proposes to create the semantic space based on the topic detection method, and get the statistical weight based on the semantic space. Finally, we take the location of words into account during the extraction, which is proved to be a very effective feature. Experimental results show that the proposed keyword extraction method is very successful.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call