Method of outliers removal based on the weighted training samples of w-objects

Елена Владимировна Волченко

doi:10.15587/1729-4061.2014.24331

Abstract

The problem of preprocessing training samples to improve the efficiency of trainable recognition systems is considered in the paper. A new method for solving the problem of outliers removal based on constructing weighted reduced samples of w-objects is proposed. The wGridDC method for constructing the weighted sample of w-objects by superimposing the grid features on the space and constructing weighted objects of new sample by analyzing the contents of cells is used as a basis for the proposed method.Within the proposed method, two outliers removal algorithms are developed. The algorithm for constructing the weighted training sample of w-objects with simultaneous outliers removal at a given filtering threshold is focused on the use in the tasks that require not only filtering the original data, but also controlling the size of the sample. Herewith, filtering threshold is user-defined. The algorithm for constructing the weighted training sample of w-objects with simultaneous outliers removal at automatic filtering threshold detection is focused on the tasks that require constructing samples, providing the highest efficiency of the system.Analysis of the effectiveness of the proposed method has shown that the main advantage of the threshold filtering algorithm is the ability to control the size of the sample. The main advantage of the non-threshold filtering algorithm is the ability to automatically select the value of the filtering threshold that provides the greatest efficiency of the recognition system as a whole. Thus, the proposed method in general and both its constituent algorithms allow to obtain the samples, providing high efficiency of trainable recognition systems.

Highlights

ВведениеПроблема качества данных является на сегодняшний день одной из важнейших проблем, решаемых при построении интеллектуальных систем [1,2,3].
Особенно остро данная проблема проявляется при построении обучающихся систем распознавания как самостоятельных систем или подсистем сложных интеллектуальных систем [4].
Предобработка данных в системах распознавания является итеративным процессом и включает [1]:.

Summary

Введение

Проблема качества данных является на сегодняшний день одной из важнейших проблем, решаемых при построении интеллектуальных систем [1,2,3]. Особенно остро данная проблема проявляется при построении обучающихся систем распознавания как самостоятельных систем или подсистем сложных интеллектуальных систем [4]. Предобработка данных в системах распознавания является итеративным процессом и включает [1]:. − очистку данных, которая заключается в удалении шума, пропусков в данных и данных низкого качества;. Построение современных систем распознавания предполагает выполнение одного или нескольких этапов предобработки данных за одну или несколько итераций. В большинстве систем предобработка данных заключается в их очистке, при выполнении которой наибольшее внимание уделяется удалению шума (выбросов) и данных низкого качества [1, 4]

Постановка проблемы и анализ литературы

Постановка задачи

Рассчитывается шаг клетки по формуле s

Анализ результатов экспериментальных исследований

Выводы

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Method of outliers removal based on the weighted training samples of w-objects

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Eastern-European Journal of Enterprise Technologies

Lead the way for us

Journal: Eastern-European Journal of Enterprise Technologies	Publication Date: Jun 20, 2014
License type: cc-by

Similar Papers

Conceptual space based gross outlier removal for geometric model fitting
Xing Wang ... Yan Yan
-
Xing Wang, et. al.Xing Wang ... Yan Yan
01 Nov 2016
01 Nov 2016

STAR_outliers: a python package that separates univariate outliers from non-normal distributions
John T Gregg ... Jason H Moore
BioData Mining | VOL. 16
John T Gregg, et. al.John T Gregg ... Jason H Moore
04 Sep 2023
BioData Mining | VOL. 16

Outlier formation and removal in 3D laser scanned point clouds

-

01 Jan 2014
01 Jan 2014

Point Clouds Outlier Removal Method Based on Improved Mahalanobis and Completion
Chengzhi Qu ... Yang Yang
IEEE Robotics and Automation Letters | VOL. 8
Chengzhi Qu, et. al.Chengzhi Qu ... Yang Yang
01 Jan 2023
IEEE Robotics and Automation Letters | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Method of outliers removal based on the weighted training samples of w-objects

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Eastern-European Journal of Enterprise Technologies