DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models

Ilana Lambert,Stefano Colella,Marie-Laure Martin-Magniette,Christine Paysant-Le Roux

doi:10.1186/s13007-020-00611-7

Abstract

BackgroundRNAseq is nowadays the method of choice for transcriptome analysis. In the last decades, a high number of statistical methods, and associated bioinformatics tools, for RNAseq analysis were developed. More recently, statistical studies realised neutral comparison studies using benchmark datasets, shedding light on the most appropriate approaches for RNAseq data analysis.ResultsDiCoExpress is a script-based tool implemented in R that includes methods chosen based on their performance in neutral comparisons studies. DiCoExpress uses pre-existing R packages including FactoMineR, edgeR and coseq, to perform quality control, differential, and co-expression analysis of RNAseq data. Users can perform the full analysis, providing a mapped read expression data file and a file containing the information on the experimental design. Following the quality control step, the user can move on to the differential expression analysis performed using generalized linear models thanks to the automated contrast writing function. A co-expression analysis is implemented using the coseq package. Lists of differentially expressed genes and identified co-expression clusters are automatically analyzed for enrichment of annotations provided by the user. We used DiCoExpress to analyze a publicly available RNAseq dataset on the transcriptional response of Brassica napus L. to silicon treatment in plant roots and mature leaves. This dataset, including two biological factors and three replicates for each condition, allowed us to demonstrate in a tutorial all the features of DiCoExpress.ConclusionsDiCoExpress is an R script-based tool allowing users to perform a full RNAseq analysis from quality controls to co-expression analysis through differential analysis based on contrasts inside generalized linear models. DiCoExpress focuses on the statistical modelling of gene expression according to the experimental design and facilitates the data analysis leading the biological interpretation of the results.

Highlights

RNAseq is nowadays the method of choice for transcriptome analysis
We illustrate the use of DiCoExpress by analysing a dataset associated with the publication of Haddad et al [42]
Checking the quality control results in Brassica_napus_Data_Quality_Control. pdf output file, we observe a higher number of reads in the mature leaf samples compared to the root samples; Table 1 Target table of Brassica napus dataset in R

Summary

Introduction

RNAseq is nowadays the method of choice for transcriptome analysis. In the last decades, a high num‐ ber of statistical methods, and associated bioinformatics tools, for RNAseq analysis were developed. Multiple methods, based on different statistical modelling of data, are available to perform differential expression analysis. The linear models and their generalized extensions for negative binomial distributions (GLM) have been proposed to account for the versatility of multifactorial experiments. They are available in the R-package limma [12] for the linear models and in the R-packages edgeR [10] and DESeq2 [11] for the generalized linear models. Pearson’s or Spearman’s correlations, WGCNA (Weighted correlation network analysis) method [13], hierarchical clustering and K-means are the most conventional approaches found in the literature [14, 15]. A model selection criterion allows determining the most appropriate cluster number [16, 17]

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Plant Methods	Publication Date: May 12, 2020
Citations: 34	License type: open-access

R Discovery Prime

R Discovery Prime

DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Plant Methods

Lead the way for us

Similar Papers

Integrating differential expression, co-expression and gene network analysis for the identification of common genes associated with tumor angiogenesis deregulation
Beatriz Monterde ... Juan A.G Ranea
Journal of Biomedical Informatics | VOL. 144
Beatriz Monterde, et. al.Beatriz Monterde ... Juan A.G Ranea
12 Jun 2023
Journal of Biomedical Informatics | VOL. 144

Traces of SARS-CoV-2 RNA in Peripheral Blood Cells of Patients with COVID-19.
Ahmed Moustafa ... Ramy K Aziz
OMICS: A Journal of Integrative Biology | VOL. 25
Ahmed Moustafa, et. al.Ahmed Moustafa ... Ramy K Aziz
19 Jul 2021
OMICS: A Journal of Integrative Biology | VOL. 25

Abstract 1817: Differential expression of long non-coding RNA in colon adenocarcinoma RNA-sequence data set
Stephen J O'Brien ... Shesh Rai
Cancer Research | VOL. 79
Stephen J O'Brien, et. al.Stephen J O'Brien ... Shesh Rai
01 Jul 2019
Abstract 1817: Differential expression of long non-coding RNA in colon adenocarcinoma RNA-sequence data set
Stephen J O'Brien ... Shesh Rai

Differential expression analysis of RNA sequencing data by incorporating non-exonic mapped reads.
Hung-I Harry Chen ... Devanand Sarkar
BMC Genomics | VOL. Suppl 16 7
Hung-I Harry Chen, et. al.Hung-I Harry Chen ... Devanand Sarkar
11 Jun 2015
BMC Genomics | VOL. Suppl 16 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Plant Methods