Abstract

The government work report of the State Council is a kind of comprehensive policy text. This paper uses text mining technology to carry out a comprehensive multi-granularity, multi-level quantitative analysis of the government work reports, which has a great practical and instructive significance for relevant personnels to understand the evolution of domain knowledge in a short time. Firstly, a series of text preprocessing is done by using the Chinese word segmentation tool combined with three kind of dictionary built by authors, i.e., the domain word dictionary, the domain synonym dictio­nary and the domain stopword dictionary. Then, according to the co-occurrence information of words in the government work reports, we attempt to conduct topic modeling on the corpus consisted of all the government work reports and single government work report respectively, Finally, we find 12 latent topics for the corpus, such as the Economic reform, Agriculture, Government construction, Defense military and so on. Based on the 12 topics, we conduct the topic modeling on every single government work report, with which topic evolution analysis is carried out over the whole period of all government work reports.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.