Co-clustering aims to cluster the rows and columns of data simultaneously and can be often formulated as a two-objective optimization problem (one objective for rows and the other for columns) and the solution is a Pareto-optimal solution set in principle. Existing methods usually convert the co-clustering problem into a single-objective optimization problem by setting a hyper-parameter λ between the two objectives. However, finding a good value of λ is not easy because of its large parameter space; also, there may exist many equally good λs. In this paper, a nondominated sorting genetic model (NSGC) is proposed to tackle the co-clustering problem, totally bypassing the trade-off parameter λ and returning to the original two-objective problem. The core of our model is to group a row objective function and a column objective function and integrate them into a genetic algorithm as the fitness functions. After this reformulation, we follow a standard genetic algorithm procedure to iteratively find the Pareto-optimal solutions. Finally, to fish out a single best solution we further design a sorting criterion according to which the Pareto-optimal solution set can be totally ordered. Extensive experiments with 16 public datasets are conducted, and the results demonstrate the superiority of our approach.
Read full abstract