Abstract

To fit for the needs researching the security of Tibetan network public sentiment, the approach is to discover Tibetan public sentiment of information is proposed. Firstly Tibetan web pages are collected. Secondly preprocessing is conducted to extract the useful information from Web pages. Then the word segmentation and text representation are introduced. Finally the text similarity calculation is proposed to classify the text according to the public opinion words table.timent of information. It is meaningful for timely retrieving Tibetan public sentiment information of network and providing technical support for the text alignment technology of Tibetan and Chinese.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call