Abstract

Non-parametric and distribution-free two-sample tests have been the foundation of many change point detection algorithms. However, randomness in the test statistic as a function of time makes them susceptible to false positives and localization ambiguity. We address these issues by deriving and applying filters matched to the expected temporal signatures of a change for various sliding window, two-sample tests under IID assumptions on the data. These filters are derived asymptotically with respect to the window size for the Wasserstein quantile test, the Wasserstein-1 distance test, Maximum Mean Discrepancy squared (MMD <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> ), and the Kolmogorov-Smirnov (KS) test. The matched filters are shown to have two important properties. First, they are distribution-free, and thus can be applied without prior knowledge of the underlying data distributions. Second, they are peak-preserving, which allows the filtered signal produced by our methods to maintain expected statistical significance. Through experiments on synthetic data as well as activity recognition benchmarks, we demonstrate the utility of this approach for mitigating false positives and improving the test precision. Our method allows for the localization of change points without the use of ad-hoc post-processing to remove redundant detections common to current methods. We further highlight the performance of statistical tests based on the Quantile-Quantile (Q-Q) function and show how the invariance property of the Q-Q function to order-preserving transformations allows these tests to detect change points of different scales with a single threshold within the same dataset.

Highlights

  • Given a time-varying signal, the problem of change point detection (CPD) is to identify specific points in time where the signal exhibits a significant change either in its deterministic content or underlying stochastic distribution

  • For various non-parametric tests that have been used as the foundation of multiple CPD algorithms, we derive these filters under the simple observation that sliding windows over a change point will cause samples from one window to be drawn from a mixture distribution

  • We are able to derive the expected response of the test statistic in the region of a change point which is used to compute the matched filter in the operational case

Read more

Summary

INTRODUCTION

Given a time-varying signal, the problem of change point detection (CPD) is to identify specific points in time where the signal exhibits a significant change either in its deterministic content or underlying stochastic distribution. It is clear that the sliding window methods produce a correlated test statistic where the effects of the change at a given point in time are spread over an interval which contains the change point Motivated by this fact, we draw on the classical signal processing idea of a matched filter [12] as a tool to better. In the context of CPD, these properties allow Q-Q tests to detect relatively small changes in a manner that is practically independent of the overall scale of the data Whether this characteristic is useful or a source of false alarms depends heavily on the underlying application, an issue that is examined in this work using both simulated and real-world data. We demonstrate the improvement in detection as shown through false alarm rates, and the related metrics of precision and recall evaluating on simulated data as well as real-world benchmarks based on human activity (HASC [14], MASTRE [15]) and honeybee activity (Beedance) [16]

RELATED WORK
Problem Statement
Notation and Background
Quantile-Quantile Tests
Statistical Tests for Change Point Detection
EVALUATION
Simulation Data
Simulated Results
Real-World Data
Real-World Results
CONCLUSION AND FUTURE WORK
Mathematical Background
Proof of Theorem 1
Proof of Theorem 2
Proof of Theorem 4
Proof of Theorem 5

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.