Abstract

With the recent advances in single-cell RNA-sequencing (scRNA-seq) technologies, the estimation of allele expression from single cells is becoming increasingly reliable. Allele expression is both quantitative and dynamic and is an essential component of the genomic interactome. Here, we systematically estimate the allele expression from heterozygous single nucleotide variant (SNV) loci using scRNA-seq data generated on the 10×Genomics Chromium platform. We analyzed 26,640 human adipose-derived mesenchymal stem cells (from three healthy donors), sequenced to an average of 150K sequencing reads per cell (more than 4 billion scRNA-seq reads in total). High-quality SNV calls assessed in our study contained approximately 15% exonic and >50% intronic loci. To analyze the allele expression, we estimated the expressed variant allele fraction (VAFRNA) from SNV-aware alignments and analyzed its variance and distribution (mono- and bi-allelic) at different minimum sequencing read thresholds. Our analysis shows that when assessing positions covered by a minimum of three unique sequencing reads, over 50% of the heterozygous SNVs show bi-allelic expression, while at a threshold of 10 reads, nearly 90% of the SNVs are bi-allelic. In addition, our analysis demonstrates the feasibility of scVAFRNA estimation from current scRNA-seq datasets and shows that the 3′-based library generation protocol of 10×Genomics scRNA-seq data can be informative in SNV-based studies, including analyses of transcriptional kinetics.

Highlights

  • In the last several years, single-cell RNA-sequencing has become an accessible platform for genomic studies [1,2,3]

  • Recent studies have demonstrated the usefulness of scRNA-seq single nucleotide variant (SNV) assessments for a variety of applications, including random monoallelic expression (RME), transcriptional burst kinetics [7,8,9,10,11], haplotype inference [12], chromosome X inactivation [13,14], genetic heterogeneity in cancer [15,16,17,18,19], aneuploidy [20], quantitative trait loci (QTL) assessments [21], and demultiplexing [22,23,24]

  • We demonstrate an approach for assessing RME, and compare the results from scRNA-seq data generated on the 10×Genomics Chromium with studies based on different platforms

Read more

Summary

Introduction

In the last several years, single-cell RNA-sequencing (scRNA-seq) has become an accessible platform for genomic studies [1,2,3]. With the emerging advances in scRNA-seq technologies, estimations of genetic variation from scRNA-seq data are becoming more reliable [4,5,6]. Recent studies have demonstrated the usefulness of scRNA-seq single nucleotide variant (SNV) assessments for a variety of applications, including random monoallelic expression (RME), transcriptional burst kinetics [7,8,9,10,11], haplotype inference [12], chromosome X inactivation [13,14], genetic heterogeneity in cancer [15,16,17,18,19], aneuploidy [20], quantitative trait loci (QTL) assessments [21], and demultiplexing [22,23,24]. Genetic variation can be assessed using RNA-seq data [24,25,26,27,28,29,30], by calculating the variant allele fraction

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call