Abstract

When dealing with semi-supervised scenarios, the Positive and Unlabeled (PU) problem is a special case in which few labeled examples from a single class of interest are received to proceed with the classification of unseen instances, according to their similarities with the known class. In the scope of time series, most of the current studies propose to address this subject using a self-training approach based on the 1-Nearest Neighbor algorithm. In order to compute the most similar instance, they compare features along the time domain using the Euclidean Distance and the Dynamic Time Warping-Delta. Despite time-domain measurements permit the analysis of local series shapes, they disconsider temporal recurrences commonly found in natural phenomena (e.g. population growth, climate studies) and are more sensitive to local noise and fluctuations, leading to poor classification performances as confirmed in this paper. This drawback motivated us to propose the use of the Maximum Diagonal Line of the Cross-Recurrence Quantification Analysis (MDL-CRQA), applied on the time series phase space, as similarity measurement. The phase space is obtained after applying Takens embedding theorem on the series, unfolding temporal relationships and dependencies among data observations. As consequence, by comparing phase spaces rather than the series themselves, we can assess how their trajectories evolve along time, including their periodicities and temporal cycles, as well as decreasing noise influences. Experimental results confirm MDL-CRQA improves classification results for PU time series when compared against the mostly used time-domain similarity measurements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.