Abstract

Sequencing the transcriptome can answer various questions such as determining the transcripts expressed in a given species for a specific tissue or condition, evaluating differential expression, discovering variants, and evaluating allele-specific expression. Differential expression evaluates the expression differences between different strains, tissues, and conditions. Allele-specific expression evaluates expression differences between parental alleles. Both differential expression and allele-specific expression have been studied for heterosis (hybrid vigor), where the hybrid has improved performance over the parents for one or more traits. The Allele Workbench software was developed for a heterosis study that evaluated allele-specific expression for a mouse F1 hybrid using libraries from multiple tissues with biological replicates. This software has been made into a distributable package, which includes a pipeline, a Java interface to build the database, and a Java interface for query and display of the results. The required input is a reference genome, annotation file, and one or more RNA-Seq libraries with optional replicates. It evaluates allelic imbalance at the SNP and transcript level and flags transcripts with significant opposite directional allele-specific expression. The Java interface allows the user to view data from libraries, replicates, genes, transcripts, exons, and variants, including queries on allele imbalance for selected libraries. To determine the impact of allele-specific SNPs on protein folding, variants are annotated with their effect (e.g., missense), and the parental protein sequences may be exported for protein folding analysis. The Allele Workbench processing results in transcript files and read counts that can be used as input to the previously published Transcriptome Computational Workbench, which has a new algorithm for determining a trimmed set of gene ontology terms. The software with demo files is available from https://code.google.com/p/allele-workbench. Additionally, all software is ready for immediate use from an Atmosphere Virtual Machine Image available from the iPlant Collaborative (www.iplantcollaborative.org).

Highlights

  • Sequencing the transcriptome can answer various questions such as determining the transcripts expressed in a given species for a specific tissue or under a certain condition, evaluating differential expression (DE), discovering variants, and evaluating allele-specific expression (ASE)

  • This manuscript is mainly focused on allele imbalance, the data can be used for DE analysis, and is briefly discussed

  • The naming of the raw files is very important, as the name will be retained through each step of the pipeline, and used in the AW program to build the database, which maps each file to its respective library conditions, e.g. strain and tissue

Read more

Summary

Introduction

Sequencing the transcriptome can answer various questions such as determining the transcripts expressed in a given species for a specific tissue or under a certain condition, evaluating differential expression (DE), discovering variants, and evaluating allele-specific expression (ASE). ASE studies analyze the difference in expression between the parental alleles, where if one allele has significantly greater expression, it is referred to as allele imbalance (AI). Both ASE and DE require that the reads be trimmed and aligned to the reference genome. DE further requires the number of reads aligned to each transcript and a test for significance. This manuscript is mainly focused on allele imbalance, the data can be used for DE analysis, and is briefly discussed

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.