Abstract

BackgroundA new illness can come to public attention through social media before it is medically defined, formally documented, or systematically studied. One example is a condition known as breast implant illness (BII), which has been extensively discussed on social media, although it is vaguely defined in the medical literature.ObjectiveThe objective of this study is to construct a data analysis pipeline to understand emerging illnesses using social media data and to apply the pipeline to understand the key attributes of BII.MethodsWe constructed a pipeline of social media data analysis using natural language processing and topic modeling. Mentions related to signs, symptoms, diseases, disorders, and medical procedures were extracted from social media data using the clinical Text Analysis and Knowledge Extraction System. We mapped the mentions to standard medical concepts and then summarized these mapped concepts as topics using latent Dirichlet allocation. Finally, we applied this pipeline to understand BII from several BII-dedicated social media sites.ResultsOur pipeline identified topics related to toxicity, cancer, and mental health issues that were highly associated with BII. Our pipeline also showed that cancers, autoimmune disorders, and mental health problems were emerging concerns associated with breast implants, based on social media discussions. Furthermore, the pipeline identified mentions such as rupture, infection, pain, and fatigue as common self-reported issues among the public, as well as concerns about toxicity from silicone implants.ConclusionsOur study could inspire future studies on the suggested symptoms and factors of BII. Our study provides the first analysis and derived knowledge of BII from social media using natural language processing techniques and demonstrates the potential of using social media information to better understand similar emerging illnesses.

Highlights

  • The ubiquity of social media has resulted in early descriptions of new and evolving diseases on social media platforms before they can be systematically studied [1,2,3,4,5,6,7], during the era of the medical internet [8,9,10,11,12,13,14]

  • Note that we used concept unique identifiers (CUIs) in latent Dirichlet allocation (LDA) to derive the topic and word distributions, but we have presented the most frequent mentions that were mapped to the respective CUIs in these tables

  • Our methods and informatics strategies applied in this study would provide working examples for analyzing other emerging but not well-defined illnesses from social media data

Read more

Summary

Introduction

Background The ubiquity of social media has resulted in early descriptions of new and evolving diseases on social media platforms before they can be systematically studied [1,2,3,4,5,6,7], during the era of the medical internet [8,9,10,11,12,13,14]. 1 (page number not for citation purposes) studies have demonstrated social media as an effective tool to disseminate information regarding symptoms, personal well-being, and public health resources during multiple influenza outbreaks [25,26,27,28]. From the analysis of Weibo (Weibo Corporation) posts, Huang et al [30] concluded that most of the affected patients were older persons, with fever as the most common symptom. These studies demonstrate that public social media data can be leveraged to better understand emerging illnesses and to accommodate prompt responses. One example is a condition known as breast implant illness (BII), which has been extensively discussed on social media, it is vaguely defined in the medical literature

Objectives
Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.