Abstract

Co-clustering aims to cluster the rows and columns of data simultaneously and can be often formulated as a two-objective optimization problem (one objective for rows and the other for columns) and the solution is a Pareto-optimal solution set in principle. Existing methods usually convert the co-clustering problem into a single-objective optimization problem by setting a hyper-parameter λ between the two objectives. However, finding a good value of λ is not easy because of its large parameter space; also, there may exist many equally good λs. In this paper, a nondominated sorting genetic model (NSGC) is proposed to tackle the co-clustering problem, totally bypassing the trade-off parameter λ and returning to the original two-objective problem. The core of our model is to group a row objective function and a column objective function and integrate them into a genetic algorithm as the fitness functions. After this reformulation, we follow a standard genetic algorithm procedure to iteratively find the Pareto-optimal solutions. Finally, to fish out a single best solution we further design a sorting criterion according to which the Pareto-optimal solution set can be totally ordered. Extensive experiments with 16 public datasets are conducted, and the results demonstrate the superiority of our approach.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call