Abstract

BackgroundIn gene expression analysis, overlapping genes, splice variants, and fusion transcripts are potential sources of data analysis artefacts, depending on how the observed intensity is assigned to one, or more genes. We here exemplify this by an in-depth analysis of the INS-IGF2 fusion transcript, which has recently been reported to be among the highest expressed transcripts in human pancreatic beta cells and its protein indicated as a novel autoantigen in Type 1 Diabetes.ResultsThrough RNA sequencing and variant specific qPCR analyses we demonstrate that the true abundance of INS-IGF2 is >20,000 fold lower than INS in human beta cells, and we suggest an explanation to the nature of the artefacts which have previously led to overestimation of the gene expression level in selected studies. We reinvestigated the previous reported findings of detection of INS-IGF2 using antibodies both in Western blotting and immunohistochemistry. We found that the one available commercial antibody (BO1P) raised against recombinant INS-IGF2 show strong cross-reaction to native proinsulin, and we did not detect INS-IGF2 protein in the human beta cell line EndoC-βH1. Furthermore, using highly sensitive proteomics analysis we could not demonstrate INS-IGF2 protein in samples of human islets nor in EndoC-βH1.ConclusionsSequence features, such as fusion transcripts spanning multiple genes can lead to unexpected results in gene expression analysis, and care must be taken in generating and interpreting the results. For the specific case of INS-IGF2 we conclude that the abundance of the fusion transcript/protein is exceedingly lower than previously reported, and that current immuno-reagents available for detecting INS-IGF2 protein have a strong cross-reaction to native human proinsulin. Finally, we were unable to detect INS-IGF2 protein by proteomics analysis.Electronic supplementary materialThe online version of this article (doi:10.1186/s12867-015-0042-8) contains supplementary material, which is available to authorized users.

Highlights

  • In gene expression analysis, overlapping genes, splice variants, and fusion transcripts are potential sources of data analysis artefacts, depending on how the observed intensity is assigned to one, or more genes

  • Proteome analysis To finalise our conclusion that INS-IGF2 expression levels have been vastly overestimated, we investigated the abundance of the INS-IGF2 fusion protein using a strategy involving both immunochemical detection and mass spectrometry

  • The big question remains on how important this transcript is? And does it lead to the production of a novel beta cell specific protein? We show that the INS-IGF2 antibody as well as our proinsulin-specific antibody recognize an expected band of MW 22,000 in INS-IGF2 transduced cells (Hek293 and EndoC-βH1), as well as the native proinsulin band from the human beta cell line

Read more

Summary

Introduction

In gene expression analysis, overlapping genes, splice variants, and fusion transcripts are potential sources of data analysis artefacts, depending on how the observed intensity is assigned to one, or more genes. In the case of INS the issue is that a fusion transcript, INS-IGF2, is spanning the entire region (see Fig. 1 and Additional file 1: Figure S1), including both insulin exons as well as part of the adjacent IGF2-locus. This will lead to all signal in the region being mapped to INS-IGF2 and that INS will “disappear” from the analysis, and in certain cases so would IGF2 (Ensembl: ENSG00000167244). This will happen even if there are no probes targeting the unique part of the fusion transcript

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.