Abstract

In many application areas, it is important to detect outliers. Traditional engineering approach to outlier detection is that we start with some values x/sub 1/,..., x/sub n/, compute the sample average E, the sample standard variation /spl sigma/, and then mark a value x as an outlier if x is outside the k/sub 0/-sigma interval [E-k/sub 0//spl middot//spl sigma/, E+k/sub 0//spl middot//spl sigma/] (for some pre-selected parameter k/sub 0/). In real life, we often have only interval ranges [x/sub i/, x~/sub i/] for the normal values x/sub 1/,...,x/sub n/. In this case, we only have intervals of possible values for the bounds E-k/sub 0//spl middot//spl sigma/ and E+k/sub 0//spl middot//spl sigma/. We can therefore identify outliers as values that are outside all k/sub 0/-sigma intervals. In this paper, we analyze the computational complexity of these outlier detection problems, and provide efficient algorithms that solve some of these problems (under reasonable conditions). We also provide algorithms that estimate the degree of outlier-ness of a given value x-measured as the largest value k/sub 0/ for which x is outside the corresponding k/sub 0/-sigma interval.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.