Abstract

Finding a maximum clique is important in research areas such as computational chemistry, social network analysis, and bioinformatics. It is possible to compare the maximum clique size between protein graphs to determine their similarity and function. In this paper, improvements based on machine learning (ML) are added to a dynamic algorithm for finding the maximum clique in a protein graph, Maximum Clique Dynamic (MaxCliqueDyn; short: MCQD). This algorithm was published in 2007 and has been widely used in bioinformatics since then. It uses an empirically determined parameter, Tlimit, that determines the algorithm’s flow. We have extended the MCQD algorithm with an initial phase of a machine learning-based prediction of the Tlimit parameter that is best suited for each input graph. Such adaptability to graph types based on state-of-the-art machine learning is a novel approach that has not been used in most graph-theoretic algorithms. We show empirically that the resulting new algorithm MCQD-ML improves search speed on certain types of graphs, in particular molecular docking graphs used in drug design where they determine energetically favorable conformations of small molecules in a protein binding site. In such cases, the speed-up is twofold.

Highlights

  • Finding the maximum clique in a graph is a well-studied NP-complete problem [1]

  • To determine if any speed-ups are possible by tuning the parameter Tlimit, we plot the time needed for Maximum Clique Dynamic (MCQD) to find the maximum clique at different values of the Tlimit parameter

  • To determine if any speed-ups are possible by tuning the parameter Tlimit, we p the time needed for MCQD to find the maximum clique at different values of the T parameter

Read more

Summary

Introduction

Finding the maximum clique in a graph is a well-studied NP-complete problem [1]. Recently developed algorithms significantly reduce the time required to search for a maximum clique, which is of great practical importance in many fields such as bioinformatics, social network analysis, and computational chemistry [2,3].There have been many advances in the search for faster algorithms for maximum cliques, many of which focus on specific domains of graphs [4,5,6,7]. To make the algorithm work fast on general graphs, some good heuristics have been proposed to speed up the branch-and-bound search [1,4,8,9,10,11,12,13,14,15] One such algorithm is MCQD, on which we have built [4]. In the MCQD algorithm, there is a single parameter that can be set before the algorithm is executed This parameter, called Tlimit, controls the fraction of a graph on which tighter upper bounds apply to the size of a maximal clique.

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call