Solution to Statistical Challenges in Proteomics Is More Statistics, Not Less.

Oliver Serang,Lukas Käll

doi:10.1021/acs.jproteome.5b00568

Abstract

In any high-throughput scientific study, it is often essential to estimate the percent of findings that are actually incorrect. This percentage is called the false discovery rate (abbreviated "FDR"), and it is an invariant (albeit, often unknown) quantity for any well-formed study. In proteomics, it has become common practice to incorrectly conflate the protein FDR (the percent of identified proteins that are actually absent) with protein-level target-decoy, a particular method for estimating the protein-level FDR. In this manner, the challenges of one approach have been used as the basis for an argument that the field should abstain from protein-level FDR analysis altogether or even the suggestion that the very notion of a protein FDR is flawed. As we demonstrate in simple but accurate simulations, not only is the protein-level FDR an invariant concept, when analyzing large data sets, the failure to properly acknowledge it or to correct for multiple testing can result in large, unrecognized errors, whereby thousands of absent proteins (and, potentially every protein in the FASTA database being considered) can be incorrectly identified.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Proteome Research	Publication Date: Aug 28, 2015
Citations: 45	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Solution to Statistical Challenges in Proteomics Is More Statistics, Not Less.

Abstract

Talk to us

Similar Papers

More From: Journal of Proteome Research

Lead the way for us

Similar Papers

On the Evaluation of Discrepant Scientific Data with Unrecognized Errors
Dan G Cacuci ... Mihaela Ionescu-Bujor
Nuclear Science and Engineering | VOL. 165
Dan G Cacuci, et. al.Dan G Cacuci ... Mihaela Ionescu-Bujor
01 May 2010
Nuclear Science and Engineering | VOL. 165

Simplified numerical modelling of rectangular concrete-filled steel columns, beams and beam-columns
Zhong Tao ... Utsab Katwal
-
Zhong Tao, et. al.Zhong Tao ... Utsab Katwal
27 Jun 2018
27 Jun 2018

Profiling of N-Acetylated Protein Termini Provides In-depth Insights into the N-terminal Nature of the Proteome
Andreas O Helbig ... Albert J.R Heck
Molecular & Cellular Proteomics | VOL. 9
Andreas O Helbig, et. al.Andreas O Helbig ... Albert J.R Heck
01 May 2010
Molecular & Cellular Proteomics | VOL. 9

Data-driven and physics-based modeling

-

04 Feb 2020
04 Feb 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Solution to Statistical Challenges in Proteomics Is More Statistics, Not Less.

Abstract

Talk to us

Similar Papers

More From: Journal of Proteome Research