Abstract

Outlier mining is to find exceptional behaviors of objects that deviate from the rest of the dataset or do not satisfy the common patterns. This paper introduces a density definition using the minimum hyper sphere and proposes an outlier mining algorithm based on neighbor-density-deviation. First, the definition of local space-density of an object is proposed by using the minimum hyper sphere. Second, the nearest neighbor sequence (NNS) based on the distance between an object and the neighbors of the object is established. After getting the space-density and the NNS of the object, the neighborhood density deviation (NDD) in NNS can be calculated based on the sum of density difference between the object and its neighbors. Finally, the neighbor-density-deviation-based outlier factor (NDDOF) is obtained to indicate the degree of the object being an outlier. To evaluate the effectiveness and the performance of the novel definition of space density and the NDDOF algorithm, we experiment on a synthetic dataset and three real UCI datasets. The results verify that the space-density is meaningful and the NDDOF algorithm has higher quality in outlier mining.DOI: http://dx.doi.org/10.5755/j01.itc.45.3.13164

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.