Abstract

Background: Since the outbreak of the COVID-19 virus in Wuhan, China, in early 2020, the Chinese government has formed a mode of information disclosure. More than 400 cities have announced specific location information for newly diagnosed cases of novel coronavirus pneumonia, including residential areas or places of stay. We have established a conditional random field model and a rule-dependent model based on Chinese geographical name elements. Taking Guangdong Province as an example, the identification of named entities and the automatic extraction of epidemic-related sites are carried out. This method will help locate the spread of the epidemic, prevent and control the spread of the epidemic, and gain more time for vaccine clinical trials. Methods: Based on the presentation form of the habitual place or place of stay of the diagnosed cases in the text of the web page, a conditional random field model is established, and a rule-dependent model is established according to the combination rule of the elements of the place words and the place name dictionary composed of provinces, cities and administrative regions. Findings: The results of the analysis based on the conditional random field model and the rule-dependent model show that the location of confirmed cases of new coronavirus pneumonia in Guangdong Province in mid-February is mainly concentrated in Guangzhou,Shenzhen,Zhuhai and Shantou Cities. In Guangzhou, Futian District has more epidemicsites and Huangpu and Conghua District have fewer epidemic sites. Government officials in Guangzhou City should pay attention to Futian District. Interpretation: Governments at all levels in Guangzhou Province have intervened to control the epidemic through various means in mid-February. According to the results of the model analysis, we believe that the administrative regions with more diagnosed locations should focus on and take measures such as blockades and control of personnel flow to control the disease in those administrative regions to avoid affecting other adjacent administrative regions.

Highlights

  • In early 2020, the COVID-19 virus began to erupt from Wuhan, Hubei, China, and it has only become a global epidemic in just a few months

  • The results of the analysis based on the conditional random field model and the rule-dependent model show that the location of confirmed cases of new coronavirus pneumonia in Guangdong Province in mid-February is mainly concentrated in Guangzhou,Shenzhen,Zhuhai and Shantou Cities

  • 3.1 Place name entity recognition result This article uses the corpus marked by People's Daily in January 1998, of which 80% is selected as the training set, the remaining 20% is used as the closed test set, and the COVID-19 outbreak news release crawled through the Internet will be used as the open test set

Read more

Summary

Introduction

In early 2020, the COVID-19 virus began to erupt from Wuhan, Hubei, China, and it has only become a global epidemic in just a few months. From the epidemiological point of view, the specific location information such as the residential area or place of stay where the confirmed cases of new coronary pneumonia are published, is conducive to the individual in the life, and to the government to establish an epidemic transmission channel to prevent and control the spread of the epidemic. Taking Guangdong Province as an example, the identification of named entities and the automatic extraction of epidemic-related sites are carried out This method will help locate the spread of the epidemic, prevent and control the spread of the epidemic, and gain more time for vaccine clinical trials

Objectives
Methods
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call