Anomaly Detection Research Articles

Abstract Introduction Digital disease surveillance (DDS) detects public health events from internet-based data e.g., online news. Event features depicting epidemiological and social characteristics of health events can be extracted from news using the natural language process techniques. However, few studies have leveraged the event features to support anomaly detection in DDS. We aimed to understand the distribution of the event features and explore anomaly detection using the frequency of these features. Methods We collected event data from COVID-19-related news collected from October 1 to December 31, 2021, sourced from BioCaster, an infectious-disease-focused DDS system. The predefined event features in BioCaster include disease, pathogen, location and 14 binary features, such as if an event was caused by an unclassified virus. We described the distribution of the features and detected changes in the frequency of event features using a Bayesian online change point detection. We compared the change points with the number of new cases and of genomic samples collected. Results We included 170,168 news articles reporting COVID-19 in 155 countries. The event feature indicating that an event was caused by an unclassified virus was identified as positive among 3831 (2.25%) news and 12.91% of news had positive value for the feature indicating cases who had travelled across international borders. The change points detected from these two features were temporally correlated to the emergence of the Omicron variant in corresponding countries, which was more significant in countries with at least 300 news articles. Conversely, event features irrelevant to this case study, e.g., if the cases were military workers, were identified as negative in all news and no change points were detected. Conclusions Our study highlights the potential of monitoring the frequency of event features extracted from online news for anomaly detection in DDS, which relies on sufficient news coverage. Key messages • Monitoring the event features extracted from online news provide is useful approach for automatic anomaly detection in digital disease surveillance. • Increasing media coverage is fundamental for improving the early detection in a digital disease surveillance system.

Read full abstract

Abstract Background Heart rate (HR) tracking by wrist-worn devices using photoplethysmography (PPG) could assist in continuously following up physical activity. However, the accuracy can be impacted by (motion) artefacts. Machine learning models could help to recognise artefacts in PPG-based HR data. The choice of classifier in these machine learning models is a determing factor for task performance of the model. Purpose This study evaluates and determines the optimal classifier for a new machine learning-based approach to enhance the reliability of artefact detection in PPG-based HR data. Methods A total of 62 participants (27 cardiac rehabilitation patients, 35 healthy athletes) wore both a test device and a reference device measuring HR continuously for 24 hours. A training dataset was prepared, assigning two independent labels (i.e. anomaly and activity) to each HR episode based on the reference device data. Fitbit data were processed using our in-house designed artefact removal procedure, which involves the application of two classification models: one for anomaly detection and another for activity detection. Four distinct classifiers were employed for both models: Balanced Bagging, Balanced Bagging with Random Forest, Balanced Random Forest, and Logistic Regression. Each classifier was evaluated using area under the receiver operating characteristic curve (ROC-AUC), accuracy, sensitivity and specificity. Results Of the 1,647,328 HR data points collected, 103,095 (6.26%) were identified as artefacts. Figure 1 and Figure 2 summarise the performance of the distinct classifiers for the anomaly model and the activity model, respectively. Balanced Bagging and Balanced Bagging with Random Forest consistently demonstrate the highest AUC values and accuracies across both anomaly and activity detection models (anomaly detection: AUC = 0.95, accuracy = 89-85%; activity detection: AUC = 0.98, accuracy = 95%). Comparing these two, Balanced Bagging with Random Forest emerges as the preferred option, given the highest sensitivity in both anomaly detection (93%&gt;86%) and activity detection models (99%&gt;96%). In contrast, Balanced Random Forest and Logistic Regression exhibit inferior performance. In the anomaly detection model, Balanced Random Forest exhibits a lower sensitivity of 75%, while Logistic Regression performs even worse with a sensitivity of 25%. Similarly, in the activity detection model, both Balanced Random Forest and Logistic Regression demonstrate diminished performance. Conclusions Balanced Bagging with Random Forest emerges as the optimal classifier to detect anomalies and activities in continuous PPG-based HR data, thus contributing to the optimisation of our in-house designed procedure for removing artefacts. This processing aims to provide a reliable and automatic way for continuous HR monitoring, which can help monitor and guide physical activities.

Read full abstract

Anomaly Detection Research Articles

Related Topics

Articles published on Anomaly Detection

Libby-Novick Beta-Liouville Distribution for Enhanced Anomaly Detection in Proportional Data

IoT Leak Detection System for Onshore Oil Pipeline Based on Thermography.

Edge Artificial Intelligence for Electrical Anomaly Detection Based on Process-In-Memory Chip

DA-Net: A classification-guided network for dental anomaly detection from dental and maxillofacial images

Comprehensive Review of One-Class Classification Approaches for Anomaly Detection

AIoT-Based Visual Anomaly Detection in Photovoltaic Sequence Data via Sequence Learning

XAI-Based Accurate Anomaly Detector That Is Robust Against Black-Box Evasion Attacks for the Smart Grid

Advancing athlete safety through real-time ECG monitoring for enhanced cardiovascular health in sports performance

Credit Anomaly Detection Method based on Bayesian Networks

Monitoring event data extracted from online news for outbreak detection

Optimisation of artefact detection in photoplethysmography heart rate data: influence of different classifiers in machine learning models

Cross-validatory Z-Residual for Diagnosing Shared Frailty Models

Advancing Algorithmic Adaptability in Hyperspectral Anomaly Detection with Stacking-Based Ensemble Learning

Chemometric approaches for discriminating manufacturers of Korean handmade paper using infrared spectroscopy

Privacy-preserving MTS anomaly detection for network devices through federated learning

D2-SPDM: Faster R-CNN-Based Defect Detection and Surface Pixel Defect Mapping with Label Enhancement in Steel Manufacturing Processes

Enhancing IoT Security Using GA-HDLAD: A Hybrid Deep Learning Approach for Anomaly Detection

Nonlinear Reinforcement Learning-Based Dynamic Test Case Prioritization with Anomaly Detection for Continuous Integration

Outlier detection in classification based on feature-selection-based regression

Taxonomy and Survey of Collaborative Intrusion Detection System using Federated Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Anomaly Detection Research Articles

Related Topics

Articles published on Anomaly Detection

Libby-Novick Beta-Liouville Distribution for Enhanced Anomaly Detection in Proportional Data

IoT Leak Detection System for Onshore Oil Pipeline Based on Thermography.

Edge Artificial Intelligence for Electrical Anomaly Detection Based on Process-In-Memory Chip

DA-Net: A classification-guided network for dental anomaly detection from dental and maxillofacial images

Comprehensive Review of One-Class Classification Approaches for Anomaly Detection

AIoT-Based Visual Anomaly Detection in Photovoltaic Sequence Data via Sequence Learning

XAI-Based Accurate Anomaly Detector That Is Robust Against Black-Box Evasion Attacks for the Smart Grid

Advancing athlete safety through real-time ECG monitoring for enhanced cardiovascular health in sports performance

Credit Anomaly Detection Method based on Bayesian Networks

Monitoring event data extracted from online news for outbreak detection

Optimisation of artefact detection in photoplethysmography heart rate data: influence of different classifiers in machine learning models

Cross-validatory Z-Residual for Diagnosing Shared Frailty Models

Advancing Algorithmic Adaptability in Hyperspectral Anomaly Detection with Stacking-Based Ensemble Learning

Chemometric approaches for discriminating manufacturers of Korean handmade paper using infrared spectroscopy

Privacy-preserving MTS anomaly detection for network devices through federated learning

D2-SPDM: Faster R-CNN-Based Defect Detection and Surface Pixel Defect Mapping with Label Enhancement in Steel Manufacturing Processes

Enhancing IoT Security Using GA-HDLAD: A Hybrid Deep Learning Approach for Anomaly Detection

Nonlinear Reinforcement Learning-Based Dynamic Test Case Prioritization with Anomaly Detection for Continuous Integration

Outlier detection in classification based on feature-selection-based regression

Taxonomy and Survey of Collaborative Intrusion Detection System using Federated Learning