A safety framework for flow decomposition problems via integer linear programming.

Fernando H C Dias,Manuel Cáceres,Lucia Williams,Brendan Mumey,Alexandru I Tomescu

doi:10.1093/bioinformatics/btad640

Fernando H C Dias, Manuel Cáceres + Show 3 more

Open Access

https://doi.org/10.1093/bioinformatics/btad640

Copy DOI

Abstract

Many important problems in Bioinformatics (e.g. assembly or multiassembly) admit multiple solutions, while the final objective is to report only one. A common approach to deal with this uncertainty is finding "safe" partial solutions (e.g. contigs) which are common to all solutions. Previous research on safety has focused on polynomially time solvable problems, whereas many successful and natural models are NP-hard to solve, leaving a lack of "safety tools" for such problems. We propose the first method for computing all safe solutions for an NP-hard problem, "minimum flow decomposition" (MFD). We obtain our results by developing a "safety test" for paths based on a general integer linear programming (ILP) formulation. Moreover, we provide implementations with practical optimizations aimed to reduce the total ILP time, the most efficient of these being based on a recursive group-testing procedure. Experimental results on transcriptome datasets show that all safe paths for MFDs correctly recover up to 90% of the full RNA transcripts, which is at least 25% more than previously known safe paths. Moreover, despite the NP-hardness of the problem, we can report all safe paths for 99.8% of the over 27 000 non-trivial graphs of this dataset in only 1.5 h. Our results suggest that, on perfect data, there is less ambiguity than thought in the notoriously hard RNA assembly problem. https://github.com/algbio/mfd-safety.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A safety framework for flow decomposition problems via integer linear programming.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Journal: Bioinformatics (Oxford, England)	Publication Date: Oct 20, 2023
License type: CC BY 4.0

Similar Papers

A graph approach to placement of Service Functions Chains
Nicolas Tastevin ... Mathieu Bouet
-
Nicolas Tastevin, et. al.Nicolas Tastevin ... Mathieu Bouet
01 May 2017
01 May 2017

The Baggage Belt Assignment Problem
David Pisinger ... Rosario Scatamacchia
EURO Journal on Transportation and Logistics | VOL. 10
David Pisinger, et. al.David Pisinger ... Rosario Scatamacchia
01 Jan 2020
EURO Journal on Transportation and Logistics | VOL. 10

Integer Linear Programming Formulation for the Unified Duplication-Loss-Coalescence Model
Javad Ansarifar ... Alexey Markin
-
Javad Ansarifar, et. al.Javad Ansarifar ... Alexey Markin
01 Jan 2020
01 Jan 2020

Fast, Flexible, and Exact Minimum Flow Decompositions via ILP
Fernando H C Dias ... Brendan Mumey
-
Fernando H C Dias, et. al.Fernando H C Dias ... Brendan Mumey
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A safety framework for flow decomposition problems via integer linear programming.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)