Globally Optimal Dense and Sparse Spanning Trees, and their Applications

Mustafa Ozen,Goran Lesaja,Hua Wang

doi:10.19139/soic-2310-5070-855

Abstract

Finding spanning trees under various constraints is a classic problem with applications in many fields. Recently, a novel notion of dense ( sparse ) tree, and in particular spanning tree (DST and SST respectively), is introduced as the structure that have a large (small) number of subtrees, or small (large) sum of distances between vertices. We show that finding DST and SST reduces to solving the discrete optimization problems. New and efficient approaches to find such spanning trees is achieved by imposing certain conditions on the vertex degrees which are then used to define an objective function that is minimized over all spanning trees of the graph under consideration. Solving this minimization problem exactly may be prohibitively time consuming for large graphs. Hence, we propose to use genetic algorithm (GA) which is one of well known metaheuristics methods to solve DST and SST approximately. As far as we are aware this is the first time GA has been used in this context.We also demonstrate on a number of applications that GA approach is well suited for these types of problems both in computational efficiency and accuracy of the approximate solution. Furthermore, we improve the efficiency of the proposed method by using Kruskal s algorithm in combination with GA. The application of our methods to several practical large graphs and networks is presented. Computational results show that they perform faster than previously proposed heuristic methods and produce more accurate solutions. Furthermore, the new feature of the proposed approach is that it can be applied recursively to sub-trees or spanning trees with additional constraints in order to further investigate the graphical properties of the graph and/or network. The application of this methodology on the gene network of a cancer cell led to isolating key genes in a network that were not obvious from previous studies.

Highlights

Seeking the spanning tree of a given graph structure is a classic problem that has numerous applications and variations
In this subsection we provide a brief justification and an additional argument for the validity of the genetic algorithm (GA) approach described in the previous section
To distinguish Integer Linear Programming (ILP) model from the proposed models in the previous section, we use the following notation: Suppose that G represents the undirected graph in Fig 3 with vertex set V = {vj, j = 1..nN } and edge set E = {yi, i = 1..nE} where nN and nE are the number of nodes and edges respectively

Summary

Introduction

Seeking the spanning tree of a given graph structure is a classic problem that has numerous applications and variations. In a weighted graph, finding the spanning tree with minimum total weight is known as the minimum spanning tree problem and is probably one of the most extensively studied problems. In the case of unweighted graphs, it is of interest to define a formal criterion to distinguish spanning trees (or sub-structures in general) that are more “compact” or “spread out”. One such criteria can be introduced through the topological indices defined as graph invariants. The best known distance-based index is the Wiener index [10, 11], defined as the sum of distances between all pairs of vertices.

Results

Discussion

Conclusion