Abstract The Gabriella Miller Kids First Pediatric Research Program (Kids First) aims at facilitating researchers to uncover new insights into the biology of childhood cancer and structural birth defects, including the discovery of shared genetic pathways between these disorders. Kids First Data Resource Center developed the Kids First Data Resource Portal (KFDRP), which is a centralized data platform for both Kids First and collaborative cohorts. Currently, KFDRP facilitates data access for 40 studies with over 35,500 participants. To assist researchers and clinicians in identifying potential disease-causing germline variants in the human genome, KFDRP released the Variant WorkBench (VWB), which facilitates variant querying, manipulation, analysis, and visualization. Cavatica is a widely used bioinformatics analysis platform developed by Velsera. Compared to a previous version, the Cavatica-based VWB has better support for data import/export, more convenient sharing of notebooks and data, and easier billing details, with the same level of scalable, cloud-based computing. Additionally, Cavatica-based VWB has incorporated multiple public reference databases, including Cancer Hotspots, ClinVar, COSMIC, dbNSFP, gnomAD, TOPMed, as well as gene-phenotype links provided by OMIM, HPO, Orphanet, and the Deciphering Developmental Disorders Project. These databases have been converted to parquet files, reducing process time and memory usage compared to regular text files, thus speeding up data retrieval and querying. Furthermore, Cavatica empowers users to create and share custom analysis workflows tailored to specific research questions and datasets, enhancing data utility and analysis accuracy. Here we present a gene-based variant filtering workflow that aggregates information from dbNSFP, HGMD, TOPMed, gnomAD, and ClinVar. We identified 1202 pathogenic/likely pathogenic variants in NF1, a gene that helps regulate cell growth, across 582 pediatric brain tumor samples within 6 mins. Among the variants, there are 21 protein-altering variants. Overall, the upgraded VWB is more powerful, user-friendly, and adaptable.
Read full abstract