Abstract

Existing techniques to reconstruct tree models of progression for accumulative processes, such as cancer, seek to estimate causation by combining correlation and a frequentist notion of temporal priority. In this paper, we define a novel theoretical framework called CAPRESE (CAncer PRogression Extraction with Single Edges) to reconstruct such models based on the notion of probabilistic causation defined by Suppes. We consider a general reconstruction setting complicated by the presence of noise in the data due to biological variation, as well as experimental or measurement errors. To improve tolerance to noise we define and use a shrinkage-like estimator. We prove the correctness of our algorithm by showing asymptotic convergence to the correct tree under mild constraints on the level of noise. Moreover, on synthetic data, we show that our approach outperforms the state-of-the-art, that it is efficient even with a relatively small number of samples and that its performance quickly converges to its asymptote as the number of samples increases. For real cancer datasets obtained with different technologies, we highlight biologically significant differences in the progressions inferred with respect to other competing techniques and we also show how to validate conjectured biological relations with progression models.

Highlights

  • Performance on cancer Next Generation Sequencing (NGS) datasets we show the application of reconstruction techniques to the validation of a specific relation among recurrent mutations involved in atypical Chronic Myeloid Leukemia (ACML)

  • In this work we have introduced a novel framework for the reconstruction of the causal topologies underlying cumulative progressive phenomena, based on the probability raising notion of causation

  • We prove correctness of CAPRESE by showing asymptotic convergence to the correct tree

Read more

Summary

Introduction

Its initiation and progression are caused by dynamic somatic alterations to the genome manifested as point mutations, structural alterations, DNA methylation and histone modification changes [1] These genomic alterations are generated by random processes, and since individual tumor cells compete for space and resources, the fittest variants are naturally selected for. If through some mutations a cell acquires the ability to ignore antigrowth signals from the body, this cell may thrive and divide, and its progeny may eventually dominate some part(s) of the tumor. This clonal expansion can be seen as a discrete state of the cancer’s progression, marked by the acquisition of a set of genetic events. Different progression sequences are possible, but some are more common than others, and not every order is viable [2]

Objectives
Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.