GitHub Repository Research Articles

Health agencies have been widely adopting social media to disseminate important information, educate the public on emerging health issues, and understand public opinions. The Centers for Disease Control and Prevention (CDC) widely used social media platforms during the COVID-19 pandemic to communicate with the public and mitigate the disease in the United States. It is crucial to understand the relationships between the CDC's social media communications and the actual epidemic metrics to improve public health agencies' communication strategies during health emergencies. This study aimed to identify key topics in tweets posted by the CDC during the pandemic, investigate the temporal dynamics between these key topics and the actual COVID-19 epidemic measures, and make recommendations for the CDC's digital health communication strategies for future health emergencies. Two types of data were collected: (1) a total of 17,524 COVID-19-related English tweets posted by the CDC between December 7, 2019, and January 15, 2022, and (2) COVID-19 epidemic measures in the United States from the public GitHub repository of Johns Hopkins University from January 2020 to July 2022. Latent Dirichlet allocation topic modeling was applied to identify key topics from all COVID-19-related tweets posted by the CDC, and the final topics were determined by domain experts. Various multivariate time series analysis techniques were applied between each of the identified key topics and actual COVID-19 epidemic measures to quantify the dynamic associations between these 2 types of time series data. Four major topics from the CDC's COVID-19 tweets were identified: (1) information on the prevention of health outcomes of COVID-19; (2) pediatric intervention and family safety; (3) updates of the epidemic situation of COVID-19; and (4) research and community engagement to curb COVID-19. Multivariate analyses showed that there were significant variabilities of progression between the CDC's topics and the actual COVID-19 epidemic measures. Some CDC topics showed substantial associations with the COVID-19 measures over different time spans throughout the pandemic, expressing similar temporal dynamics between these 2 types of time series data. Our study is the first to comprehensively investigate the dynamic associations between topics discussed by the CDC on Twitter and the COVID-19 epidemic measures in the United States. We identified 4 major topic themes via topic modeling and explored how each of these topics was associated with each major epidemic measure by performing various multivariate time series analyses. We recommend that it is critical for public health agencies, such as the CDC, to update and disseminate timely and accurate information to the public and align major topics with key epidemic measures over time. We suggest that social media can help public health agencies to inform the public on health emergencies and to mitigate them effectively.

Read full abstract

Detecting ink mismatch is a significant challenge in verifying the authenticity of documents, especially when dealing with uneven ink distribution. Conventional imaging methods frequently fail to distinguish visually similar inks. Our study presents a novel hyperspectral unmixing approach to detect ink mismatches in unbalanced clusters. The proposed method identifies unique spectral characteristics of different inks employing k-means clustering and Gaussian mixture models (GMMs) to perform color segmentation on different ink types and utilizes elbow estimation and silhouette coefficient to evaluate the number of inks estimation precisely. For a more accurate estimation of quantity, which is generally not an attribute of clustering methods, we employed entropy calculations in the red, green, and blue depth channels for precise abundance estimation of ink. This unique combination of basic techniques in conjunction exhibits better efficacy in performing ink unmixing and provides a real-world document forensic solution compared to current methods that rely on assumptions like prior knowledge of the inks used in a document and deep learning-based methods that rely heavily on abundant training datasets. We evaluate our approach on the iVision handwritten hyperspectral images dataset (iVision HHID), which is a comprehensive and rich dataset that surpasses the commonly-used UWA writing inks hyperspectral images (WIHSI) database in size and diversity. This study has accomplished the unmixing task with three main challenges: unmixing of diverse ink spectral signatures (149 spectral bands instead of 33 bands in the previous dataset), without using prior knowledge and assumptions about the number of inks used in the questioned document, and not requiring large training data for performing unmixing. Furthermore, the security of the proposed document authentication methodology to address the likelihood of forgeries or manipulations in questioned documents is enhanced as compared to previous works relying on known inks and known spectrum. Randomization techniques and anomaly detection mechanisms are used in our methodology which increases the difficulty for adversaries to predict and manipulate specific aspects of the input data in questioned documents, thereby enhancing the robustness of our method. The code for conducting this research can be accessed at GitHub repository.

Read full abstract

GitHub Repository Research Articles

Related Topics

Articles published on GitHub Repository

Interpreting protein abundance in Saccharomyces cerevisiae through relational learning.

Ds-Seq: An Integrated Pipeline for In Silico Small RNA Se-quence Analysis for Host-pathogen Interaction Studies

RI−Calc: A user friendly software and web server for refractive index calculation

Dynamic Associations Between Centers for Disease Control and Prevention Social Media Contents and Epidemic Measures During COVID-19: Infoveillance Study.

A hyperspectral unmixing approach for ink mismatch detection in unbalanced clusters

IDSL_MINT: a deep learning framework to predict molecular fingerprints from mass spectra

A gentle introduction to computer vision-based specimen classification in ecological datasets.

CAPTVRED: an automated pipeline for viral tracking and discovery from capture-based metagenomics samples

MvlearnR and Shiny App for multiview learning.

Optimal linear ensemble of binary classifiers.

Synphage: a pipeline for phage genome synteny graphics focused on gene conservation.

JobEdKG: An uncertain knowledge graph-based approach for recommending online courses and predicting in-demand skills based on career choices

M-Ionic: prediction of metal-ion-binding sites from sequence using residue embeddings.

Open source and reproducible and inexpensive infrastructure for data challenges and education

Computational reproducibility of Jupyter notebooks from biomedical publications.

Rworkflows: automating reproducible practices for the R community

Meta-Scaler: A Meta-Learning Framework for the Selection of Scaling Techniques.

Augmenting a training dataset of the generative diffusion model for molecular docking with artificial binding pockets.

A MACHINE LEARNING BASED CLASSIFICATION AND PREDICTION TECHNIQUE FOR DDOS ATTACKS

DUNE Computing Tutorials

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

GitHub Repository Research Articles

Related Topics

Articles published on GitHub Repository

Interpreting protein abundance in Saccharomyces cerevisiae through relational learning.

Ds-Seq: An Integrated Pipeline for In Silico Small RNA Se-quence Analysis for Host-pathogen Interaction Studies

RI−Calc: A user friendly software and web server for refractive index calculation

Dynamic Associations Between Centers for Disease Control and Prevention Social Media Contents and Epidemic Measures During COVID-19: Infoveillance Study.

A hyperspectral unmixing approach for ink mismatch detection in unbalanced clusters

IDSL_MINT: a deep learning framework to predict molecular fingerprints from mass spectra

A gentle introduction to computer vision-based specimen classification in ecological datasets.

CAPTVRED: an automated pipeline for viral tracking and discovery from capture-based metagenomics samples

MvlearnR and Shiny App for multiview learning.

Optimal linear ensemble of binary classifiers.

Synphage: a pipeline for phage genome synteny graphics focused on gene conservation.

JobEdKG: An uncertain knowledge graph-based approach for recommending online courses and predicting in-demand skills based on career choices

M-Ionic: prediction of metal-ion-binding sites from sequence using residue embeddings.

Open source and reproducible and inexpensive infrastructure for data challenges and education

Computational reproducibility of Jupyter notebooks from biomedical publications.

Rworkflows: automating reproducible practices for the R community

Meta-Scaler: A Meta-Learning Framework for the Selection of Scaling Techniques.

Augmenting a training dataset of the generative diffusion model for molecular docking with artificial binding pockets.

A MACHINE LEARNING BASED CLASSIFICATION AND PREDICTION TECHNIQUE FOR DDOS ATTACKS

DUNE Computing Tutorials