Abstract

For an emergent pandemic, such as COVID-19, the statistics of symptoms based on hospital data may be biased or delayed due to the high proportion of asymptomatic or mild-symptom infections who were not recorded in hospitals. Meanwhile, the difficulty of accessing large-scale clinical data also limits many researchers to conduct timely research. Given the wide coverage and promptness of social media, this study aimed to present an efficient workflow to track and visualize the dynamic characteristics and co-occurrences of symptoms for the pandemic from large-scale and long-term social media data. This retrospective study included 471,553,966 COVID-19-related tweets from February 1, 2020, to April 30, 2022. We curated a hierarchical symptom lexicon for social media containing 10 affected organs/systems, 257 symptoms, and 1808 synonyms. The dynamic characteristic of COVID-19 symptoms over time were analyzed from the perspectives of weekly new cases, overall distribution, and temporal prevalence of reported symptoms. The symptom evolutions between virus strains (Delta and Omicron) were investigated by comparing the symptom prevalence during their dominant periods. A co-occurrence symptom network was developed and visualized to investigate inner relationships among symptoms and affected body systems. This study identified 201 COVID-19 symptoms and grouped them into 10 affected body systems. The weekly quantity of self-reported symptoms has a high consistency (0.8528, P<.001) and one-week leading trend (0.8802, P<.001) with new COVID-19 infections. The frequency of symptoms showed dynamic changes as the pandemic progressed, from typical respiratory symptoms in the early stage to more musculoskeletal and nervous symptoms at later stages. We found the prevalence difference of symptoms between Delta period and Omicron period: less severe symptoms (coma and dyspnea), more flu-like symptoms (throat pain and nasal congestion), and less typical COVID symptoms (anosmia and taste altered) (All P<.001). Network analysis reveals the co-occurrences among symptoms and systems corresponding to specific disease progressions, including palpitation (cardiovascular) and dyspnea (respiratory), alopecia (musculoskeletal) and impotence (reproductive). This study identified more and milder COVID-19 symptoms than clinical research and characterize the dynamic symptom evolution based on 400 million tweets over 27 months. The symptom network reveals potential comorbidity risk and prognostic disease progressions. These demonstrate that the cooperation of social media and a well-designed workflow can depict a holistic picture of pandemic symptoms to complement clinical studies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.