Abstract

Language resource database is a language protection project gradually carried out in various regions in China in recent years, which aims to permanently retain the language characteristics of various regions through information technology. Through professional database architecture, the language characteristics of various regions in China are recorded and archived, so as to provide reference data resources for future generations' learning and research, help Chinese people fully understand the language resources and national conditions in various regions of China, scientifically plan the development of language resources, drive the promotion of Putonghua and realize the scientific development of language resources. In order to further extract the language resources needed to build the knowledge map, the author analyzes the structured data in the Internet through Python crawler technology. After that, the crawler software automatically grabs the topic information of network language resources and extracts the corresponding language resources through regular expressions.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call