Abstract

Beta-Poisson (BP) models employ Poisson distributions, where the corresponding rate parameter itself is a Beta-distributed random variable. They have been shown to appropriately mimic gene expression distributions in the context of single-cell ribonucleic acid sequencing (scRNA-seq), a breakthrough technology allowing to sequence information from individual biological cells and facilitating fundamental insights into numerous fields of biology. A prominent scRNA-seq data analysis task is to identify differences in gene expression distributions across two conditions. To validate new statistical approaches in this context, one typically has to rely on accurate simulations, as usually no ground truth for an assessment is available. We introduce several simulation procedures that allow to generate differential distributions (DDs) based on BP models. In particular, we describe how to create different types of DDs, mirroring various sources or origins of a difference, and different degrees of DDs, from a weak to a strong difference. The soundness of the simulation procedures is shown in a validation study in which theoretically expected model properties of the DD simulations are confirmed. The findings are in principle not restricted to the scRNA-seq context and may be generally applicable also to other application areas. The simulation approaches are implemented in the publicly available R package SimBPDD.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.