Abstract

In a lot of practical machine learning applications, such as web page classification, protein shape classification, unlabelled instances are easy to obtain, but labelled instances are rather too expensive to get. Thus, recently, semi-supervised learning (SSL) methods including graph-based algorithms have attracted many interests from researchers. However, most of these algorithms used the Gaussian function to calculate weights of the edge of the graph. In this paper, we proposed a novel weight for graph-based semi-supervised algorithms. In this new algorithm, the label information is added from problem into SSL algorithm, and the geodesic distance is utilized instead of Euclidean distance to calculate the distance between two instances. Furthermore, class prior knowledge is also added from problem into the target function. In this paper, we focus on learning with local and global consistency. We found that the effect of class prior knowledge maybe different between under low-label rate and high-label rate. Experiments on two University of California Irvine (UCI) data sets and United States Postal Service handwritten digit recognition show that our proposed algorithm is really effective.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call