Post-marketing surveillance refers to the process of monitoring the safety of drugs once they reach the market, after the successful completion of clinical trials. In this work, we investigate a computational approach using data mining tools to detect safety signals from post-market safety databases, or in other words, to identify adverse events (AEs) with disproportionately high reporting rates compared to other AEs, associated with a particular drug or a drug class. Essentially, we view this as a problem of cluster analysis-based anomaly detection on post-market safety data, where the goal is to 'unsupervisedly' detect the anomalous or the signal AEs. Our findings demonstrate the potential of using a clustering ensemble method to detect drug safety signals. It employs multiple clustering or anomaly detection algorithms, followed by a performance comparison of the candidate algorithms based on a collection of appropriate measures of goodness of clustering results. The method is general enough to include any number of clustering or anomaly detection algorithms and goodness measures, and performs better than any of the candidate algorithms in identifying the signal AEs. Extensive simulation studies illustrate that the ensemble method detects the AE signals from synthetic post-market safety datasets pretty accurately across the different scenarios explored. Based on the cases reported to the FDA Adverse Event Reporting System (FAERS) between 2013 and 2022, we further demonstrate that the ensemble method successfully identifies and confirms most of the adverse events that are known to occur most frequently in reaction to antiepileptic drugs and -lactam antibiotics.
Read full abstract