Abstract

There are two semantic explanations including the lost and the unrelated for the missing values. To deal with the hybrid information systems which including the numerical attributes and the missing value attributes, a generalized incomplete rough set model is proposed based on neighborhood relations. The model approximates an arbitrary subset in the universe with neighborhood granules, and the generalized neighborhood relations are the generalization of the asymmetry similarity relations and the tolerance relations. It overcomes the shortcoming that the classical rough set can not deal with numerical attributes directly and can deal with the generalized incomplete system which has the missing values both include the lost and the unrelated. The discrimination methods of the missing value and a hybrid reduction algorithm are proposed also. The discrimination methods of the lost or the unrelated conditions are proposed based on the assumption of the consistency classification, and the influence of the noise samples and the neighborhood values to the classification accuracy is presented as well. The validity and feasibility of the reduction algorithm are demonstrated by the results of experiments on five UCI machine learning databases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.