Systematic identification of intron retention associated variants from massive publicly available transcriptome sequencing data

Yuichi Shiraishi,Ai Okada,Kenichi Chiba,Asuka Kawachi,Ikuko Omori,Raúl Nicolás Mateos,Naoko Iida,Hirofumi Yamauchi,Kenjiro Kosaki,Akihide Yoshimi

doi:10.1038/s41467-022-32887-9

Yuichi Shiraishi, Ai Okada + Show 8 more

Open Access

https://doi.org/10.1038/s41467-022-32887-9

Copy DOI

Abstract

Many disease-associated genomic variants disrupt gene function through abnormal splicing. With the advancement of genomic medicine, identifying disease-associated splicing associated variants has become more important than ever. Most bioinformatics approaches to detect splicing associated variants require both genome and transcriptomic data. However, there are not many datasets where both of them are available. In this study, we develop a methodology to detect genomic variants that cause splicing changes (more specifically, intron retention), using transcriptome sequencing data alone. After evaluating its sensitivity and precision, we apply it to 230,988 transcriptome sequencing data from the publicly available repository and identified 27,049 intron retention associated variants (IRAVs). In addition, by exploring positional relationships with variants registered in existing disease databases, we extract 3,000 putative disease-associated IRAVs, which range from cancer drivers to variants linked with autosomal recessive disorders. The in-silico screening framework demonstrates the possibility of near-automatically acquiring medical knowledge, making the most of massively accumulated publicly available sequencing data. Collections of IRAVs identified in this study are available through IRAVDB (https://iravdb.io/).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature communications	Publication Date: Sep 29, 2022
Citations: 13	License type: open-access

R Discovery Prime

R Discovery Prime

Systematic identification of intron retention associated variants from massive publicly available transcriptome sequencing data

Abstract

Talk to us

Similar Papers

More From: Nature communications

Lead the way for us

Similar Papers

The importance of genomic predictors for clinical outcome of hematological malignancies
Cunte Chen ... Chengwu Zeng
Blood Science | VOL. 3
Cunte Chen, et. al.Cunte Chen ... Chengwu Zeng
01 Jul 2021
Blood Science | VOL. 3

An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
Vasanthan Jayakumar ... Yasubumi Sakakibara
BMC Genomics | VOL. 21
Vasanthan Jayakumar, et. al.Vasanthan Jayakumar ... Yasubumi Sakakibara
01 Apr 2020
BMC Genomics | VOL. 21

Abstract 2285: Regtools: Integrated analysis of genomic and transcriptomic data for discovery of mutations associated with aberrant splicing in cancer
Yang-Yang Feng ... Malachi Griffith
Cancer Research | VOL. 78
Yang-Yang Feng, et. al.Yang-Yang Feng ... Malachi Griffith
01 Jul 2018
Cancer Research | VOL. 78

Abstract 5647: Intron retention as a novel source of tumor neoantigens associated with response to checkpoint inhibitor therapy
Alicia C Smart ... Jihye Park
Cancer Research | VOL. 77
Alicia C Smart, et. al.Alicia C Smart ... Jihye Park
01 Jul 2017
Cancer Research | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Systematic identification of intron retention associated variants from massive publicly available transcriptome sequencing data

Abstract

Talk to us

Similar Papers

More From: Nature communications