Beyond Homozygosity Mapping: Family-Control analysis based on Hamming distance for prioritizing variants in exome sequencing.

Atsuko Imai,Somayyeh Fahiminiya,Martine Tétreault,Jurg Ott,Mark Lathrop,Yasushi Sakata,Akihiro Nakaya,Seiji Takashima,Jacek Majewski

doi:10.1038/srep12028

Abstract

A major challenge in current exome sequencing in autosomal recessive (AR) families is the lack of an effective method to prioritize single-nucleotide variants (SNVs). AR families are generally too small for linkage analysis, and length of homozygous regions is unreliable for identification of causative variants. Various common filtering steps usually result in a list of candidate variants that cannot be narrowed down further or ranked. To prioritize shortlisted SNVs we consider each homozygous candidate variant together with a set of SNVs flanking it. We compare the resulting array of genotypes between an affected family member and a number of control individuals and argue that, in a family, differences between family member and controls should be larger for a pathogenic variant and SNVs flanking it than for a random variant. We assess differences between arrays in two individuals by the Hamming distance and develop a suitable test statistic, which is expected to be large for a causative variant and flanking SNVs. We prioritize candidate variants based on this statistic and applied our approach to six patients with known pathogenic variants and found these to be in the top 2 to 10 percentiles of ranks.

Highlights

A major challenge in current exome sequencing in autosomal recessive (AR) families is the lack of an effective method to prioritize single-nucleotide variants (SNVs)
Homozygosity mapping is often applied to identify long runs of homozygosity[3], which may be interpreted as harboring segments of DNA identical by descent (IBD), but length alone is known to be a poor statistic for this purpose[4]
We developed a novel method to prioritize candidate variants in AR families based on direct comparison of segments of sequence variants between an affected family member and control individuals from the same population, that is, our approach works by comparing a single affected individual with a number of control individuals

Summary

Introduction

A major challenge in current exome sequencing in autosomal recessive (AR) families is the lack of an effective method to prioritize single-nucleotide variants (SNVs). We assess differences between arrays in two individuals by the Hamming distance and develop a suitable test statistic, which is expected to be large for a causative variant and flanking SNVs. We prioritize candidate variants based on this statistic and applied our approach to six patients with known pathogenic variants and found these to be in the top 2 to 10 percentiles of ranks. Because of paucity of crossovers very close to the disease locus, SNVs in its vicinity tend to be IBD and, homozygous[3] For this reason, we want to see whether distances between affected and control individuals are larger for true candidate variants than other candidate variants.

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific reports	Publication Date: Jul 6, 2015
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Beyond Homozygosity Mapping: Family-Control analysis based on Hamming distance for prioritizing variants in exome sequencing.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific reports

Lead the way for us

Similar Papers

HDR: a statistical two-step approach successfully identifies disease genes in autosomal recessive families.
Atsuko Imai ... Akira Ohtake
Journal of Human Genetics | VOL. 61
Atsuko Imai, et. al.Atsuko Imai ... Akira Ohtake
30 Jun 2016
Journal of Human Genetics | VOL. 61

Causality in Genetics
Ali J Marian
Circulation Research | VOL. 114
Ali J MarianAli J Marian
16 Jan 2014
Circulation Research | VOL. 114

Use of Clinical Exome Sequencing in Isolated Congenital Heart Disease.
Laura Zahavich ... Seema Mital
Circulation: Cardiovascular Genetics | VOL. 10
Laura Zahavich, et. al.Laura Zahavich ... Seema Mital
04 May 2017
Circulation: Cardiovascular Genetics | VOL. 10

Novel candidate genes and variants underlying autosomal recessive neurodevelopmental disorders with intellectual disability.
...
Human Genetics | VOL. 137
, et. al. ...
22 Aug 2018
Human Genetics | VOL. 137

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Beyond Homozygosity Mapping: Family-Control analysis based on Hamming distance for prioritizing variants in exome sequencing.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific reports