The effect of non-linear signal in classification problems using gene expression.

Benjamin J Heil,Jake Crawford,Casey S Greene

doi:10.1371/journal.pcbi.1010984

Benjamin J Heil, Jake Crawford + Show 1 more

Open Access

https://doi.org/10.1371/journal.pcbi.1010984

Copy DOI

Abstract

Those building predictive models from transcriptomic data are faced with two conflicting perspectives. The first, based on the inherent high dimensionality of biological systems, supposes that complex non-linear models such as neural networks will better match complex biological systems. The second, imagining that complex systems will still be well predicted by simple dividing lines prefers linear models that are easier to interpret. We compare multi-layer neural networks and logistic regression across multiple prediction tasks on GTEx and Recount3 datasets and find evidence in favor of both possibilities. We verified the presence of non-linear signal when predicting tissue and metadata sex labels from expression data by removing the predictive linear signal with Limma, and showed the removal ablated the performance of linear methods but not non-linear ones. However, we also found that the presence of non-linear signal was not necessarily sufficient for neural networks to outperform logistic regression. Our results demonstrate that while multi-layer neural networks may be useful for making predictions from gene expression data, including a linear baseline model is critical because while biological systems are high-dimensional, effective dividing lines for predictive models may not be.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS computational biology	Publication Date: Mar 27, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The effect of non-linear signal in classification problems using gene expression.

Abstract

Talk to us

Similar Papers

More From: PLoS computational biology

Lead the way for us

Similar Papers

Systems Biology: The Big Picture
Angela Spivey
Environmental Health Perspectives | VOL. 112
Angela SpiveyAngela Spivey
01 Nov 2004
Environmental Health Perspectives | VOL. 112

Integrating Omics Data into Genomic Prediction
Zhengcao Li
-
Zhengcao LiZhengcao Li
21 Feb 2022
21 Feb 2022

Scalable algorithms for physics-informed neural and graph networks
Khemraj Shukla ... George E Karniadakis
Data-Centric Engineering | VOL. 3
Khemraj Shukla, et. al.Khemraj Shukla ... George E Karniadakis
01 Jan 2021
Data-Centric Engineering | VOL. 3

Synthetic biology for basic and applied plant research.
Christoph Benning ... Lee Sweetlove
The Plant journal : for cell and molecular biology | VOL. 87
Christoph Benning, et. al.Christoph Benning ... Lee Sweetlove
01 Jul 2016
The Plant journal : for cell and molecular biology | VOL. 87

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The effect of non-linear signal in classification problems using gene expression.

Abstract

Talk to us

Similar Papers

More From: PLoS computational biology