Abstract

Crowdsourcing services provide a way to obtain large amounts of labeled data, which is inexpensive and effective. In crowdsourcing scenarios, the integrated labels of instances can be deduced by implementing ground truth inference algorithms. However, those labels often contain substantial noise and, to mitigate the effects of noise, label noise handling techniques are needed. This paper proposes a novel multi-view-based noise correction algorithm (MVNC). MVNC introduces the idea of multi-view learning to make better use of the information from crowdsourced data. It adds a new view composed of multiple noise labels and then trains classifiers on two views respectively to correct noise instances. In this process, different information between the views is fully utilized to generate the disagreement between two classifiers, so that the classifiers can complement each other and make more reliable predictions for noise instances. Experimental results on 38 benchmark data sets and 6 real-world data sets show that the new view significantly enhances the effect of noise correction.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.