PhenStat: A Tool Kit for Standardized Analysis of High Throughput Phenotypic Data.

Natalja Kurbatova,Jeremy C Mason,Hugh Morgan,Natasha A Karp,Terrence F Meehan

doi:10.1371/journal.pone.0131274

Abstract

The lack of reproducibility with animal phenotyping experiments is a growing concern among the biomedical community. One contributing factor is the inadequate description of statistical analysis methods that prevents researchers from replicating results even when the original data are provided. Here we present PhenStat – a freely available R package that provides a variety of statistical methods for the identification of phenotypic associations. The methods have been developed for high throughput phenotyping pipelines implemented across various experimental designs with an emphasis on managing temporal variation. PhenStat is targeted to two user groups: small-scale users who wish to interact and test data from large resources and large-scale users who require an automated statistical analysis pipeline. The software provides guidance to the user for selecting appropriate analysis methods based on the dataset and is designed to allow for additions and modifications as needed. The package was tested on mouse and rat data and is used by the International Mouse Phenotyping Consortium (IMPC). By providing raw data and the version of PhenStat used, resources like the IMPC give users the ability to replicate and explore results within their own computing environment.

Highlights

Irreproducibility of animal research is slowing advancement in understanding disease mechanisms, squandering resources on unproductive avenues of research and contributing to the cost of development of new drugs [1]
Applying the appropriate statistical analysis is a challenge in assessing biological data [28,29,30] and is an area of active research for high throughput phenotyping [10,20]
There is a need for accessible, freely available statistical tools that support the community in choosing the best analysis, especially when complex statistical methods are involved

Summary

Introduction

Irreproducibility of animal research is slowing advancement in understanding disease mechanisms, squandering resources on unproductive avenues of research and contributing to the cost of development of new drugs [1]. In large-scale model organism screens, a suite of statistical tests is required to accurately associate the interaction between genotype and phenotype. High-throughput methods ensure large volumes of phenotype data continue to be collected, an automated statistical method selection process and analysis platform is required. We have developed PhenStat, an R package of tools for the identification of phenotypic associations with an emphasis on statistical tools for high-throughput experiments that is made freely available from the Bioconductor repository. The PhenStat package has been tested and demonstrated with an application of 420 lines of mouse phenotyping data from the http://www.sanger.ac.uk/mouseportal/ Sanger Mouse Genetics Project [15] and http://www.eumodic.org/ EUMODIC project [16] and on rat phenotyping datasets from PhysGen resource (http://pga.mcw.edu) [17]. The usage of PhenStat enables these analyses to be automated and version controlled

Methods

Statistical methods available

Future work

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Jul 6, 2015
Citations: 56	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

PhenStat: A Tool Kit for Standardized Analysis of High Throughput Phenotypic Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Chapter 16 - High-throughput phenotyping: the latest research tool for sustainable crop production under global climate change scenarios
Xiuqing Fu ... Dong Jiang
Sustainable Crop Productivity and Quality Under Climate Change | VOL. -
Xiuqing Fu, et. al.Xiuqing Fu ... Dong Jiang
01 Jan 2021
Sustainable Crop Productivity and Quality Under Climate Change | VOL. -

OpenStats: A robust and scalable software package for reproducible analysis of high-throughput phenotypic data.
Hamed Haselimashhadi ... Giorgio F Gilestro
PloS one | VOL. 15
Hamed Haselimashhadi, et. al.Hamed Haselimashhadi ... Giorgio F Gilestro
30 Dec 2020
PloS one | VOL. 15

A bioimage informatics platform for high-throughput embryo phenotyping.
James M Brown ... Jennifer Vibert
Briefings in bioinformatics | VOL. 19
James M Brown, et. al.James M Brown ... Jennifer Vibert
14 Oct 2016
Briefings in bioinformatics | VOL. 19

The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data.
Gautier Koscielny ...
Nucleic Acids Research | VOL. 42
Gautier Koscielny, et. al.Gautier Koscielny ...
04 Nov 2013
Nucleic Acids Research | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PhenStat: A Tool Kit for Standardized Analysis of High Throughput Phenotypic Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE