Bioinformatics pipeline for the systematic mining genomic and proteomic variation linked to rare diseases: The example of monogenic diabetes.

Ksenia G Kuznetsova,Dafni Skiadopoulou,Alisa Manning,Jakub Vašíček,Janne Molnes,Stefan Johansson,Miriam Udler,Pål Rasmus Njølstad,Marc Vaudel

doi:10.1371/journal.pone.0300350

Ksenia G Kuznetsova, Dafni Skiadopoulou + Show 7 more

Open Access

PDF Available

https://doi.org/10.1371/journal.pone.0300350

Copy DOI

Export

Save

Cite

Journal: PLOS ONE	Publication Date: Apr 18, 2024
Citations: 2	License type: CC BY 4.0

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Monogenic diabetes is characterized as a group of diseases caused by rare variants in single genes. Like for other rare diseases, multiple genes have been linked to monogenic diabetes with different measures of pathogenicity, but the information on the genes and variants is not unified among different resources, making it challenging to process them informatically. We have developed an automated pipeline for collecting and harmonizing data on genetic variants linked to monogenic diabetes. Furthermore, we have translated variant genetic sequences into protein sequences accounting for all protein isoforms and their variants. This allows researchers to consolidate information on variant genes and proteins linked to monogenic diabetes and facilitates their study using proteomics or structural biology. Our open and flexible implementation using Jupyter notebooks enables tailoring and modifying the pipeline and its application to other rare diseases.

Full Text