We propose an approach for the identification of mutant genes for rare diseases in single cases of unknown etiology. All genes with rare biologically significant variants sorted from individual exome data are tested further for profiling of their spatial-temporal and cell/tissue specific expression compared to that of their paralogs. We developed a simple bioinformatics tool ("Essential Paralogue by Expression" (EPbE)) for such analysis. Here, we present rare clinical forms of early ataxia with cerebellar hypoplasia. Using whole-exome sequencing and the EPbE tool, we identified two novel mutant genes previously not associated with congenital human diseases. In Family I, the unique missense mutation (p.Lys258Glu) was found in the LRCH2 gene inherited in an X-linked manner. p.Lys258Glu occurs in the evolutionarily invariant site of the leucine-rich repeat domain of LRCH2. In Family II and Family III, the identical genetic variant was found in the CSMD1 gene inherited as an autosomal-recessive trait. The variant leads to amino acid substitution p.Gly2979Ser in a highly conserved region of the complement-interacting domain of CSMD1. The LRCH2 gene for Family I patients (in which congenital cerebellar hypoplasia was associated with demyelinating polyneuropathy) is expressed in Schwann and precursor Schwann cells and predominantly over its paralogous genes in the developing cerebellar cortex. The CSMD1 gene is predominantly expressed over its paralogous genes in the cerebellum, specifically in the period of late childhood. Thus, the comparative spatial-temporal expression of the selected genes corresponds to the neurological manifestations of the disease.
Read full abstract