Abstract

The authors defined an information measurement associated with a topic or semantics for a keyword.Firstly,the topic-based corpus was obtained.Then the latent semantic vector space model of the corpus was established.After that,the information measurement of the keyword was defined through the model.Accordingly,the amount of the topic information any document contained could be calculated.Lastly,the membership measurement which measured the membership degree of the document belonging to the topic was introduced.A measurement threshold was set,thereby it determined whether the documents belonging to the topic or not.The experimental results show that the definition of the information measurement can get over the difficulty of the word-match search and really reach the goal of the semantic-match search.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call