Abstract
BackgroundWith the recent advance of sequencing technology, the collection of RNA expression (RNA‐seq) data has been growing rapidly. RNA‐seq data are statistically count‐type measurements. Poisson distribution is a basic probability distribution for modeling count‐type data. With Poisson regression models, various experimental factors, GC content as well as alternative splicing isoforms can be flexibly considered in RNA‐seq data analysis. Due to the biochemical and technical limitations of sequencing technology, the biases among RNA‐seq data have been recognized.MethodsIn this study, an artificial censoring approach has been proposed to an isoform‐specific Poisson regression model for analyzing RNA‐seq data. Low expression values can be grouped (censored) into one probability category, and high expression values can also be grouped (censored) into another probability category. We have implemented the related Newton‐Raphson numeric computing procedure to achieve the maximum likelihood estimation for our censored‐Poisson regression model. The related mathematical simplifications have been derived for the consideration of stable and convenient numerical computing.ResultsThe advantages of our artificial censoring approach have been demonstrated in both simulation studies and application analysis of experimental data.ConclusionsOur proposed artificial censoring approach allows us to focus on the majority of data. As the extreme values (tails) of data are artificially censored, more efficient analysis results can be obtained, even from relatively simple Poisson regression models. Our proposed artificial censoring approach can certainly be considered for other well‐developed models or methods for RNA‐seq data analysis.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.