Abstract
Government document analysis can help organizations to understand key policy points and make right response. To realize intelligent analysis of a volume of policies, we propose a discourse parsing technique for in-depth understanding of Chinese government documents (CGDs). Based on Superstructure Schema and Rhetorical Structure Theory (RST), we characterize the stylistic features and macrostructure patterns of CGDs and develop a discourse analysis framework to specify its functional structure and semantic system. A tree schema is adopted to portray the hierarchy of textual fragments in full text and formalize their representations in machine-based parsing, then a multi-head partition self-attention model is constructed to evaluate the discourse analysis framework. Experiment results show that our parsing model embedding with inherent CGD discourse features performs better than baselines. The highest f1-score in span and label prediction tasks researches 0.9209. The parsing model can be migrated to other discourse parsing tasks by considering specific genre features of different documents.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.