Phenotype Ontology Terms Research Articles

BackgroundThere are approximately 8,000 different rare diseases that affect roughly 400 million people worldwide. Many of them suffer from delayed diagnosis. Ciliopathies are rare monogenic disorders characterized by a significant phenotypic and genetic heterogeneity that raises an important challenge for clinical diagnosis. Diagnosis support systems (DSS) applied to electronic health record (EHR) data may help identify undiagnosed patients, which is of paramount importance to improve patients’ care. Our objective was to evaluate three online-accessible rare disease DSSs using phenotypes derived from EHRs for the diagnosis of ciliopathies.MethodsTwo datasets of ciliopathy cases, either proven or suspected, and two datasets of controls were used to evaluate the DSSs. Patient phenotypes were automatically extracted from their EHRs and converted to Human Phenotype Ontology terms. We tested the ability of the DSSs to diagnose cases in contrast to controls based on Orphanet ontology.ResultsA total of 79 cases and 38 controls were selected. Performances of the DSSs on ciliopathy real world data (best DSS with area under the ROC curve = 0.72) were not as good as published performances on the test set used in the DSS development phase. None of these systems obtained results which could be described as “expert-level”. Patients with multisystemic symptoms were generally easier to diagnose than patients with isolated symptoms. Diseases easily confused with ciliopathy generally affected multiple organs and had overlapping phenotypes. Four challenges need to be considered to improve the performances: to make the DSSs interoperable with EHR systems, to validate the performances in real-life settings, to deal with data quality, and to leverage methods and resources for rare and complex diseases.ConclusionOur study provides insights into the complexities of diagnosing highly heterogenous rare diseases and offers lessons derived from evaluation existing DSSs in real-world settings. These insights are not only beneficial for ciliopathy diagnosis but also hold relevance for the enhancement of DSS for various complex rare disorders, by guiding the development of more clinically relevant rare disease DSSs, that could support early diagnosis and finally make more patients eligible for treatment.

Read full abstract

Abstract Background and Aims We aim to derive and validate a score estimating the probability of identifying a variant explicative of the renal phenotype using Exome Sequencing (ES) in adults with CKD of uncertain origin. Method Participants: prospective cohort study including all consecutive index patients with a CKD from uncertain origin in three French nephrology units (metropolitan: Sorbonne University Hospitals, Paris, and Conception Hospital, Marseille; overseas: La Réunion University Hospital, Réunion island) who underwent ES between October 11th, 2017 and May 31st, 2023. Outcome measure: identification of a causal variant using ES data, according to the American College of Medical Genetics and Genomics diagnostic criteria. Candiate variables and feature engineering: raw data regarding patients’ phenotypes was prospectively entered in a collection form at the time of ES by the prescribing physician. Structured data (yes/no questions and lists) was mapped to Human Phenotype Ontology (HPO) terms. Unstructured data (free text) was manually translated into HPO terms by the same investigator (nephrologist), which was blinded to ES results. First, second and third-order ancestor HPO terms were included in the analysis. Score's derivation: an optimal weighted average of multiple models (“ensemble model”) was specified, following guidelines provided in a recently published article (PMID:36905602). Score's validation: internal validation using 10-fold cross validation, followed by internal-external validation (data from 2/3 centers was used to develop a model which was then tested using data from the remaining center; this process was repeated in each center). Results We included 2,490 patients, 560/2,490 (22.5%) of whom had a causal variant identified by ES. We collected 1,028 distinct HPO terms describing the patients’ phenotypes. The most common term was “Hypertension”, occurring in 2,079/2,490 (83.5%) patients. In internal validation, the score showed accurate calibration and discrimination (area under the receiver operating characteristics curve (AUC): 0.71, 95% CI 0.68 to 0.73; index of precision accuracy (IPA): 0.11, 95% CI 0.09 to 0.13). Performances were moderately diminished in internal-external validation but remained satisfactory (Tenon: AUC 0.64, IPA 0.04; Marseille: AUC 0.67, IPA 0.04; La Réunion: 0.68, IPA 0.09). Conclusion We derived, internally and internally-externally validated a clinical score which accurately predicts the probability of obtaining a genetic diagnosis using Exome Sequencing. Feature engineering could further increase the score's performances. Automatic computation by clinicians using an interactive tool is achievable and could be of clinical utility, especially to guide resource allocation in case of a low pretest probability.

Read full abstract

Phenotype Ontology Terms Research Articles

Related Topics

Articles published on Phenotype Ontology Terms

Enhancing human phenotype ontology term extraction through synthetic case reports and embedding-based retrieval: A novel approach for improved biomedical data annotation

Leveraging Clinical Intuition to Improve Accuracy of Phenotype-Driven Prioritization

Phenotypic spectrum of dual diagnoses in developmental disorders

Quantitative proteomics of patient fibroblasts reveal biomarkers and diagnostic signatures of mitochondrial disease.

NeoGx: Machine-Recommended Rapid Genome Sequencing for Neonates.

Objectivizing issues in the diagnosis of complex rare diseases: lessons learned from testing existing diagnosis support systems on ciliopathies

#958 Development and Validation of a Score Estimating the Probability of obtaining a genetic diagnosis in adults with CKD of uncertain origin

Variants in mitochondrial disease genes are common causes of inherited peripheral neuropathies

Diagnostic yield after next-generation sequencing in pediatric cardiovascular disease

An AI-based approach driven by genotypes and phenotypes to uplift the diagnostic yield of genetic diseases.

Multi-center implementation of rapid whole genome sequencing provides additional evidence of its utility in the pediatric inpatient setting.

A step-by-step, multidisciplinary strategy to maximize the yield of genetic testing in pediatric patients with chronic kidney diseases.

Aspirin-exacerbated respiratory disease is associated with variants in filaggrin, epithelial integrity, and cellular interactions

Unbiased phenotype and genotype matching maximizes gene discovery and diagnostic yield

MSeqDR Quick-Mitome (QM): Combining Phenotype-Guided Variant Interpretation and Machine Learning Classifiers to Aid Primary Mitochondrial Disease Genetic Diagnosis.

Gene editing and cardiac disease modelling for the interpretation of genetic variants of uncertain significance in congenital heart disease

Term-BLAST-like alignment tool for concept recognition in noisy clinical texts.

Natural language processing and expert follow-up establishes tachycardia association with CDKL5 deficiency disorder

Optimal Protocols and Management of Clinical and Genomic Data Collection to Assist in the Early Diagnosis and Treatment of Multiple Congenital Anomalies.

ClinPrior: an algorithm for diagnosis and novel gene discovery by network-based prioritization

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Phenotype Ontology Terms Research Articles

Related Topics

Articles published on Phenotype Ontology Terms

Enhancing human phenotype ontology term extraction through synthetic case reports and embedding-based retrieval: A novel approach for improved biomedical data annotation

Leveraging Clinical Intuition to Improve Accuracy of Phenotype-Driven Prioritization

Phenotypic spectrum of dual diagnoses in developmental disorders

Quantitative proteomics of patient fibroblasts reveal biomarkers and diagnostic signatures of mitochondrial disease.

NeoGx: Machine-Recommended Rapid Genome Sequencing for Neonates.

Objectivizing issues in the diagnosis of complex rare diseases: lessons learned from testing existing diagnosis support systems on ciliopathies

#958 Development and Validation of a Score Estimating the Probability of obtaining a genetic diagnosis in adults with CKD of uncertain origin

Variants in mitochondrial disease genes are common causes of inherited peripheral neuropathies

Diagnostic yield after next-generation sequencing in pediatric cardiovascular disease

An AI-based approach driven by genotypes and phenotypes to uplift the diagnostic yield of genetic diseases.

Multi-center implementation of rapid whole genome sequencing provides additional evidence of its utility in the pediatric inpatient setting.

A step-by-step, multidisciplinary strategy to maximize the yield of genetic testing in pediatric patients with chronic kidney diseases.

Aspirin-exacerbated respiratory disease is associated with variants in filaggrin, epithelial integrity, and cellular interactions

Unbiased phenotype and genotype matching maximizes gene discovery and diagnostic yield

MSeqDR Quick-Mitome (QM): Combining Phenotype-Guided Variant Interpretation and Machine Learning Classifiers to Aid Primary Mitochondrial Disease Genetic Diagnosis.

Gene editing and cardiac disease modelling for the interpretation of genetic variants of uncertain significance in congenital heart disease

Term-BLAST-like alignment tool for concept recognition in noisy clinical texts.

Natural language processing and expert follow-up establishes tachycardia association with CDKL5 deficiency disorder

Optimal Protocols and Management of Clinical and Genomic Data Collection to Assist in the Early Diagnosis and Treatment of Multiple Congenital Anomalies.

ClinPrior: an algorithm for diagnosis and novel gene discovery by network-based prioritization