Reference genome and transcriptome informed by the sex chromosome complement of the sample increase ability to detect sex differences in gene expression from RNA-Seq data

Kimberly C Olney,Jocelyn P Andrews,Valeria A Valverde-Vesling,Melissa A Wilson,Sarah M Brotman

doi:10.1186/s13293-020-00312-9

Abstract

BackgroundHuman X and Y chromosomes share an evolutionary origin and, as a consequence, sequence similarity. We investigated whether the sequence homology between the X and Y chromosomes affects the alignment of RNA-Seq reads and estimates of differential expression. We tested the effects of using reference genomes and reference transcriptomes informed by the sex chromosome complement of the sample’s genome on the measurements of RNA-Seq abundance and sex differences in expression.ResultsThe default genome includes the entire human reference genome (GRCh38), including the entire sequence of the X and Y chromosomes. We created two sex chromosome complement informed reference genomes. One sex chromosome complement informed reference genome was used for samples that lacked a Y chromosome; for this reference genome version, we hard-masked the entire Y chromosome. For the other sex chromosome complement informed reference genome, to be used for samples with a Y chromosome, we hard-masked only the pseudoautosomal regions of the Y chromosome, because these regions are duplicated identically in the reference genome on the X chromosome. We analyzed the transcript abundance in the whole blood, brain cortex, breast, liver, and thyroid tissues from 20 genetic female (46, XX) and 20 genetic male (46, XY) samples. Each sample was aligned twice: once to the default reference genome and then independently aligned to a reference genome informed by the sex chromosome complement of the sample, repeated using two different read aligners, HISAT and STAR. We then quantified sex differences in gene expression using featureCounts to get the raw count estimates followed by Limma/Voom for normalization and differential expression. We additionally created sex chromosome complement informed transcriptome references for use in pseudo-alignment using Salmon. Transcript abundance was quantified twice for each sample: once to the default target transcripts and then independently to target transcripts informed by the sex chromosome complement of the sample.ConclusionsWe show that regardless of the choice of the read aligner, using an alignment protocol informed by the sex chromosome complement of the sample results in higher expression estimates on the pseudoautosomal regions of the X chromosome in both genetic male and genetic female samples, as well as an increased number of unique genes being called as differentially expressed between the sexes. We additionally show that using a pseudo-alignment approach informed on the sex chromosome complement of the sample eliminates Y-linked expression in female XX samples.

Highlights

Human X and Y chromosomes share an evolutionary origin and, as a consequence, sequence similarity
We show that regardless of the choice of the read aligner, using an alignment protocol informed by the sex chromosome complement of the sample results in higher expression estimates on the pseudoautosomal regions of the X chromosome in both genetic male and genetic female samples, as well as an increased number of unique genes being called as differentially expressed between the sexes
We show that using a pseudo-alignment approach informed on the sex chromosome complement of the sample eliminates Y-linked expression in female XX samples

Summary

Introduction

Human X and Y chromosomes share an evolutionary origin and, as a consequence, sequence similarity. Accounting for the sex chromosome complement of the sample in quantifying gene expression has been limited due to shared sequence homology between the sex chromosomes, X and Y, that can confound gene expression estimates. The X and Y chromosomes share an evolutionary origin: mammalian X and Y chromosomes originated from a pair of indistinguishable autosomes ~ 180–210 million years ago that acquired the sex-determining genes [9,10,11]. Other regions of high sequence similarity between X and Y include the X-transposed region (XTR) with 98.78% homology [15] (Fig. 1a). The evolution of the X and Y chromosomes has resulted in a pair of chromosomes that are diverged, but still share some regions of high sequence similarity

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biology of Sex Differences	Publication Date: Jul 21, 2020
Citations: 32	License type: open-access

R Discovery Prime

R Discovery Prime

Reference genome and transcriptome informed by the sex chromosome complement of the sample increase ability to detect sex differences in gene expression from RNA-Seq data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biology of Sex Differences

Lead the way for us

Similar Papers

Tissue-specific sex differences in human gene expression.
Irfahan Kassam ... Allan F Mcrae
Human Molecular Genetics | VOL. 28
Irfahan Kassam, et. al.Irfahan Kassam ... Allan F Mcrae
20 May 2019
Human Molecular Genetics | VOL. 28

Sex differences in human adipose tissue gene expression and genetic regulation involve adipogenesis.
Warren D Anderson ... Eric E Schadt
Genome Research | VOL. 30
Warren D Anderson, et. al.Warren D Anderson ... Eric E Schadt
23 Sep 2020
Genome Research | VOL. 30

Sex differences in gene expression in response to ischemia in the human left ventricular myocardium.
Gregory Stone ... Oliva Meritxell
Human molecular genetics | VOL. 28
Gregory Stone, et. al.Gregory Stone ... Oliva Meritxell
14 Jan 2019
Human molecular genetics | VOL. 28

Sex differences in early and term placenta are conserved in adult tissues
Kimberly C. Olney ... Seema B. Plaisier
Biology of Sex Differences | VOL. 13
Kimberly C. Olney, et. al.Kimberly C. Olney ... Seema B. Plaisier
22 Dec 2022
Biology of Sex Differences | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reference genome and transcriptome informed by the sex chromosome complement of the sample increase ability to detect sex differences in gene expression from RNA-Seq data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biology of Sex Differences