Abstract
The aim of the paper is two-fold. First, we show that structure finding with the PC algorithm can be inherently unstable and requires further operational constraints in order to consistently obtain models that are faithful to the data. We propose a methodology to stabilise the structure finding process, minimising both false positive and false negative error rates. This is demonstrated with synthetic data. Second, to apply the proposed structure finding methodology to a data set comprising single-voxel Magnetic Resonance Spectra of normal brain and three classes of brain tumours, to elucidate the associations between brain tumour types and a range of observed metabolites that are known to be relevant for their characterisation. The data set is bootstrapped in order to maximise the robustness of feature selection for nominated target variables. Specifically, Conditional Independence maps (CI-maps) built from the data and their derived Bayesian networks have been used. A Directed Acyclic Graph (DAG) is built from CI-maps, being a major challenge the minimization of errors in the graph structure. This work presents empirical evidence on how to reduce false positive errors via the False Discovery Rate, and how to identify appropriate parameter settings to improve the False Negative Reduction. In addition, several node ordering policies are investigated that transform the graph into a DAG. The obtained results show that ordering nodes by strength of mutual information can recover a representative DAG in a reasonable time, although a more accurate graph can be recovered using a random order of samples at the expense of increasing the computation time.
Highlights
Structure finding methods lie within the field of Probabilistic Graphical Models
Results improved when the False Negative Reduction (FNR) was activated, showing a decrease in False Positives (FP) but no significant changes were observed with respect to the False Discovery Rate (FDR)
The FNR policy is focused on decreasing the false negatives (FN) errors, but it is recommended not to activate it when the optimal values of the effect size are unknown, which is the usual case for new data
Summary
Structure finding methods lie within the field of Probabilistic Graphical Models. They have been studied extensively [1, 2], especially from a theoretical perspective, as they offer an efficient graphical approach to apply statistical estimates in a complex system. Robust CI-maps of single-voxel MRS to elucidate associations between brain tumours and metabolites
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.