Genes In RNA-seq Data Research Articles

Forkhead box protein M1 (FOXM1) is a member of the forkhead superfamily of transcription factors. It plays numerous critical roles in cancer development and progression, such as the regulation of G2/M transitions of cell cycle, anti-apoptosis, DNA damage repair, invasion, and drug resistance. In a pan-cancer meta-analysis of mRNA expression signatures from ∼18,000 human tumors with overall survival outcomes across 39 malignancies, over expression of the FOXM1 was a major predictor of adverse outcomes and appeared anti-correlated with tumor immune cell populations (Nat Med. 2015;21(8):938-945). Thus, we examined the prognostic and biological role of FOXM1 in non-small cell lung cancer (NSCLC). We examined the prognostic impact of FOXM1 expression in lung cancer patients using a genomic database (UTSW – “Lung Cancer Explorer” and KM-Plotter). We also assessed the expression of FOXM1 in NSCLC lines, and human bronchial epithelial cells (HBECs), and assessed the association between FOXM1 and cell cycle related genes in RNA-seq data. We examined the effect of the downregulation of FOXM1 on tumor proliferation and colony formation of lung adenocarcinoma cell lines by short hairpin RNA (Tet-pLKO-FOXM1-shRNA), and the effect of the downregulation of FOXM1 on tumor 3D spheroid formation in spheroid-forming cell lines using Nunclon Sphera. High FOXM1 expression correlated with poor prognosis in NSCLC patients who underwent surgical resection, especially adenocarcinoma. FOXM1 expression in lung cancer cell lines was also much higher than in HBECs. The expression level of FOXM1 was significantly correlated with cell cycle regulator genes such as CCNB1, CCNA2, and PLK1 in both tissue samples (Lung cancer explorer) and cell lines (our data). shRNA mediated reduction of FOXM1 expression significantly inhibited tumor cell proliferation and colony formation of NSCLC cells. Additionally, in the 3D spheroid formation assay, FOXM1 knockdown altered spheroid morphology. FOXM1 was over expressed in NSCLC compared to normal lung epithelial cells, and high tumor FOXM1 expression was prognostic of poor survival in patients with NSCLC who underwent surgical resection, especially in patients with adenocarcinoma. FOXM1 expression correlated with the expression of regulators of the G2/M transition, and functional knockdown studies demonstrate an important role in tumor proliferation, colony formation, and 3D spheroid formation. Overall, our results suggest that FOXM1 has important roles in NSCLC growth, while data from other groups using “CIBERSORT” analyses suggest a possible role for FOXM1 in tumor immune cells infiltration, collectively indicating that FOXM1 is a potential target for the treatment of lung cancer adenocarcinoma.

Abstract Introduction RNA sequencing (RNA-seq) has rapidly become one of the main methods to study transcriptome. It has become an important and popular issue to identify biomarkers based on their differential expression patterns in NGS data. Currently, more than 10 different algorithms can be used to detect differentially expressed genes. However, challenge arises, when trying to choose the best algorithm under different experimental conditions. To address this issue, we aimed to perform a series of simulations under distinct scenarios, which can help researchers to select the best algorithm according to their data types and structures. Method RNA-Seq data were simulated by a published method named as flux-simulator. Three parameters including the proportion of differentially expressed genes, the relative fold changes of differentially expressed genes, and the replicate number of samples, were considered to define distinct simulation scenarios. To make the parameters more practical, we determined the parameters based on a real dataset. Three types of differentially expressed genes were simulated according to their fold changes between two groups. A total of 7 algorithms including DESeq, DESeq2, DEGseq, edgeR, limma, baySeq and Cuffdiff, were compared and evaluated. The raw read count table was analyzed in all algorithms, except Cuffdiff. In order to avoid errors from performing the alignment, the read count table is obtained directly from the simulated fastq file. Results Specificity and sensitivity were calculated and compared in different scenarios. As expected, the more replicate count is, the higher accuracy is obtained in all algorithms. Although previously studies showed that there were marginal effect between replicate numbers and number of reported differentially expressed genes, our results did not demonstrate such phenomenon. It might be attributed to that the replicate number is not large enough. In addition, the results showed limma, edgeR and DESeq were more conservative than the other algorithms, DEGseq has highest accuracy but follow with lower specificity when a large amount of genes are differentially expressed. Over all, edgeR shows the best trade off within sensitivity and specificity. Further research efforts are warranted to compare algorithms in different scenarios, especially the number of differentially expressed genes is low. Conclusion In conclusion, this study provided a direct comparison of different algorithms under different experimental scenarios. The results showed that edgeR algorithm worked better in the scenario of finding novel genes in new disease. With its high precision, it will be more efficiency to validate identified gene through experiment. Citation Format: Chin-Ting Wu, Mong-Hsun Tsai, Tzu-Pin Lu, Liang-Chuan Lai, Eric Y. Chuang. Performances evaluation of algorithms for identifying differentially expressed genes in RNA-seq data. [abstract]. In: Proceedings of the 106th Annual Meeting of the American Association for Cancer Research; 2015 Apr 18-22; Philadelphia, PA. Philadelphia (PA): AACR; Cancer Res 2015;75(15 Suppl):Abstract nr 4852. doi:10.1158/1538-7445.AM2015-4852

Genes In RNA-seq Data Research Articles

Related Topics

Articles published on Genes In RNA-seq Data

Integrating single-cell RNA-sequencing and bulk RNA-sequencing data to explore the role of mitophagy-related genes in prostate cancer

Phenylalanine Ammonia-Lyase (PAL) Genes Family in Wheat (Triticum aestivum L.): Genome-Wide Characterization and Expression Profiling

Impact of MAPT mutations on transcriptomic signatures of FTLD brains and patient-derived pluripotent cell models.

Investigation of the putative role of antisense transcripts as regulators of sense transcripts by correlation analysis of sense-antisense pairs in colorectal cancers.

Genome Wide Identification, Characterization, and Expression Analysis of YABBY-Gene Family in WHEAT (Triticum aestivum L.)

Visualization methods for differential expression analysis

Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model.

MafA Expression Preserves Immune Homeostasis in Human and Mouse Islets.

DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data.

P3.03-14 Downregulation of FOXM1 Inhibits Tumor Proliferation, Colony Formation and Spheroid Formation of Non-Small Cell Lung Cancer

Variance component testing for identifying differentially expressed genes in RNA-seq data.

PowsimR: power analysis for bulk and single cell RNA-seq experiments.

Marginal likelihood estimation of negative binomial parameters with applications to RNA-seq data.

A two-step integrated approach to detect differentially expressed genes in RNA-Seq data.

Identification of Differentially Expressed Genes in RNA-seq Data of Arabidopsis thaliana: A Compound Distribution Approach.

FuMa: reporting overlap in RNA-seq detected fusion genes.

Abstract 4852: Performances evaluation of algorithms for identifying differentially expressed genes in RNA-seq data

Effect of low-expression gene filtering on detection of differentially expressed genes in RNA-seq data.

A balanced method detecting differentially expressed genes for RNA-sequencing data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Genes In RNA-seq Data Research Articles

Related Topics

Articles published on Genes In RNA-seq Data

Integrating single-cell RNA-sequencing and bulk RNA-sequencing data to explore the role of mitophagy-related genes in prostate cancer

Phenylalanine Ammonia-Lyase (PAL) Genes Family in Wheat (Triticum aestivum L.): Genome-Wide Characterization and Expression Profiling

Impact of MAPT mutations on transcriptomic signatures of FTLD brains and patient-derived pluripotent cell models.

Investigation of the putative role of antisense transcripts as regulators of sense transcripts by correlation analysis of sense-antisense pairs in colorectal cancers.

Genome Wide Identification, Characterization, and Expression Analysis of YABBY-Gene Family in WHEAT (Triticum aestivum L.)

Visualization methods for differential expression analysis

Sample size calculations for the differential expression analysis of RNA-seq data using a negative binomial regression model.

MafA Expression Preserves Immune Homeostasis in Human and Mouse Islets.

DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data.

P3.03-14 Downregulation of FOXM1 Inhibits Tumor Proliferation, Colony Formation and Spheroid Formation of Non-Small Cell Lung Cancer

Variance component testing for identifying differentially expressed genes in RNA-seq data.

PowsimR: power analysis for bulk and single cell RNA-seq experiments.

Marginal likelihood estimation of negative binomial parameters with applications to RNA-seq data.

A two-step integrated approach to detect differentially expressed genes in RNA-Seq data.

Identification of Differentially Expressed Genes in RNA-seq Data of Arabidopsis thaliana: A Compound Distribution Approach.

FuMa: reporting overlap in RNA-seq detected fusion genes.

Abstract 4852: Performances evaluation of algorithms for identifying differentially expressed genes in RNA-seq data

Effect of low-expression gene filtering on detection of differentially expressed genes in RNA-seq data.

A balanced method detecting differentially expressed genes for RNA-sequencing data