Abstract

This paper presents a system, AscentX, for labeled data creation and training with active supervision. The aim of system is that it makes collecting data and training models easier. We integrate semi-supervised learning and active learning as well as crowdsourcing learning paradigms. We design a web-based interface for labeling examples based on Python-Django and incorporate our proposed algorithms into it, including semi-supervised classification algorithms, semi-supervised clustering algorithms and with application to data filtering. Moreover, we provide the interfaces for future development including standard datasets attaching, algorithms and active learning models plunging. The system supports the visualization of labeling the examples and analyzing the results of algorithms, which is user-friendly for human annotator. We experimentally verify that AscentX achieves comparable performance to the state-of-the-art systems, using real-life (crawling web data) and benchmark datasets.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.