Abstract
Learning from label proportions (LLP) is a new kind of learning problem which has attracted wide interest in machine learning. Different from the well-known supervised learning, the training data of LLP is in the form of bags and only the proportion of each class in each bag is available. Actually, many modern applications can be successfully abstracted to this problem such as modeling voting behaviors and spam filtering. However, time-consuming training is still a challenge for LLP, which becomes a bottleneck especially when addressing large bags and bag sizes. In this paper, we propose a fast algorithm called multi-class learning from label proportions by extreme learning machine (LLP-ELM), which takes advantage of an extreme learning machine with fast learning speed to solve multi-class learning from label proportions. Firstly, we reshape the hidden layer output matrix and the training data target matrix of an extreme learning machine to adapt to the proportion information instead of the real labels. Secondly, a robust loss function with a regularization term is formulated and two efficient solutions are provided to different cases. Finally, various experiments demonstrate the significant speed-up of the proposed model with better accuracies on different datasets compared with several state-of-the-art methods.
Highlights
In the era of big data, many real-world applications involve a multi-class problem
Large scale data means more time-consuming for training especially for weakly supervised learning where the labels of training data are inaccessible. This necessitates the development of fast learning algorithms for multi-class learning from label proportions
The label proportions (LLP) problem is described by a set of training data, which is divided into several bags
Summary
Fan Zhang 1,† , Jiabin Liu 2,† , Bo Wang 3 , Zhiquan Qi 4,5,6, * and Yong Shi 4,5,6,7. Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing 100190, China. Research Center on Fictitious Economy and Data Science, Chinese Academy of Sciences, College of Information Science and Technology, University of Nebraska at Omaha, NE 68182, USA.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.