Abstract
Background: Since the outbreak of the COVID-19 virus in Wuhan, China, in early 2020, the Chinese government has formed a mode of information disclosure. More than 400 cities have announced specific location information for newly diagnosed cases of novel coronavirus pneumonia, including residential areas or places of stay. We have established a conditional random field model and a rule-dependent model based on Chinese geographical name elements. Taking Guangdong Province as an example, the identification of named entities and the automatic extraction of epidemic-related sites are carried out. This method will help locate the spread of the epidemic, prevent and control the spread of the epidemic, and gain more time for vaccine clinical trials. Methods: Based on the presentation form of the habitual place or place of stay of the diagnosed cases in the text of the web page, a conditional random field model is established, and a rule-dependent model is established according to the combination rule of the elements of the place words and the place name dictionary composed of provinces, cities and administrative regions. Findings: The results of the analysis based on the conditional random field model and the rule-dependent model show that the location of confirmed cases of new coronavirus pneumonia in Guangdong Province in mid-February is mainly concentrated in Guangzhou,Shenzhen,Zhuhai and Shantou Cities. In Guangzhou, Futian District has more epidemicsites and Huangpu and Conghua District have fewer epidemic sites. Government officials in Guangzhou City should pay attention to Futian District. Interpretation: Governments at all levels in Guangzhou Province have intervened to control the epidemic through various means in mid-February. According to the results of the model analysis, we believe that the administrative regions with more diagnosed locations should focus on and take measures such as blockades and control of personnel flow to control the disease in those administrative regions to avoid affecting other adjacent administrative regions.
Highlights
In early 2020, the COVID-19 virus began to erupt from Wuhan, Hubei, China, and it has only become a global epidemic in just a few months
The results of the analysis based on the conditional random field model and the rule-dependent model show that the location of confirmed cases of new coronavirus pneumonia in Guangdong Province in mid-February is mainly concentrated in Guangzhou,Shenzhen,Zhuhai and Shantou Cities
3.1 Place name entity recognition result This article uses the corpus marked by People's Daily in January 1998, of which 80% is selected as the training set, the remaining 20% is used as the closed test set, and the COVID-19 outbreak news release crawled through the Internet will be used as the open test set
Summary
In early 2020, the COVID-19 virus began to erupt from Wuhan, Hubei, China, and it has only become a global epidemic in just a few months. From the epidemiological point of view, the specific location information such as the residential area or place of stay where the confirmed cases of new coronary pneumonia are published, is conducive to the individual in the life, and to the government to establish an epidemic transmission channel to prevent and control the spread of the epidemic. Taking Guangdong Province as an example, the identification of named entities and the automatic extraction of epidemic-related sites are carried out This method will help locate the spread of the epidemic, prevent and control the spread of the epidemic, and gain more time for vaccine clinical trials
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.