Effect of high variation in transcript expression on identifying differentially expressed genes in RNA-seq analysis.

Weitong Cui,Qinglu Wang,Yajun Liang,Xuewen Tian,Jing Zhang,Yifan Geng,Huaru Xue

doi:10.1111/ahg.12441

Abstract

Great efforts have been made on the algorithms that deal with RNA-seq data to enhance the accuracy and efficiency of differential expression (DE) analysis. However, no consensus has been reached on the proper threshold values of fold change and adjusted p-value for filtering differentially expressed genes (DEGs). It is generally believed that the more stringent the filtering threshold, the more reliable the result of a DE analysis. Nevertheless, by analyzing the impact of both adjusted p-value and fold change thresholds on DE analyses, with RNA-seq data obtained for three different cancer types from the Cancer Genome Atlas (TCGA) database, we found that, for a given sample size, the reproducibility of DE results became poorer when more stringent thresholds were applied. No matter which threshold level was applied, the overlap rates of DEGs were generally lower for small sample sizes than for large sample sizes. The raw read count analysis demonstrated that the transcript expression of the same gene in different samples, whether in tumor groups or in normal groups, showed high variations, which resulted in a drastic fluctuation in fold change values and adjustedp-values when different sets of samples were used. Overall, more stringent thresholds did not yield more reliable DEGs due to high variations in transcript expression; the reliability of DEGs obtained with small sample sizes was more susceptible to these variations. Therefore, less stringent thresholds are recommended for screening DEGs. Moreover, large sample sizes should be considered in RNA-seq experimental designs to reduce the interfering effect of variations in transcript expression on DEG identification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effect of high variation in transcript expression on identifying differentially expressed genes in RNA-seq analysis.

Abstract

Talk to us

Similar Papers

More From: Annals of human genetics

Lead the way for us

Journal: Annals of human genetics	Publication Date: Aug 3, 2021
Citations: 3

Similar Papers

Identification of potential biomarkers for colorectal cancer by clinical database analysis and Kaplan-Meier curves analysis.
Chongyang Li ... Ying Gao
Medicine | VOL. 102
Chongyang Li, et. al.Chongyang Li ... Ying Gao
10 Feb 2023
Medicine | VOL. 102

FN1 promotes prognosis and radioresistance in head and neck squamous cell carcinoma: From radioresistant HNSCC cell line to integrated bioinformatics methods.
Xiaojun Tang ... Qinglai Tang
Frontiers in genetics | VOL. 13
Xiaojun Tang, et. al.Xiaojun Tang ... Qinglai Tang
21 Sep 2022
Frontiers in genetics | VOL. 13

Global differential gene expression in the pituitary gland and the ovaries of pre- and postpubertal Brahman heifers.
L T Nguyen ... J F Medrano
Journal of Animal Science | VOL. 95
L T Nguyen, et. al.L T Nguyen ... J F Medrano
01 Feb 2017
Journal of Animal Science | VOL. 95

Disparities within luminal breast cancer: Clinical and molecular features of African American and non-Hispanic white patients.
Kent Hoskins ... Lisa Eileen Blumencranz
Journal of Clinical Oncology | VOL. 39
Kent Hoskins, et. al.Kent Hoskins ... Lisa Eileen Blumencranz
20 May 2021
Journal of Clinical Oncology | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effect of high variation in transcript expression on identifying differentially expressed genes in RNA-seq analysis.

Abstract

Talk to us

Similar Papers

More From: Annals of human genetics