Abstract

Next-generation sequencing (NGS) is a new approach for biomedical research, useful for the diagnosis of genetic diseases in extremely heterogeneous conditions. In this work, we describe how data generated by high-throughput NGS experiments can be analyzed to find single nucleotide polymorphisms (SNPs) in DNA samples of patients affected by neuromuscular disorders. In particular, we consider untagged pooled NGS data, where DNA samples of different individuals are combined in a single experiment, still providing information with an uncertainty limited to only two patients. At the moment, only few publications address the problem of SNPs detection in pooled experiments, and existing tools are often inaccurate. We propose a computational procedure consisting of two parts. In the first, data are filtered by means of decision rules. The second phase is based on a supervised classification technique. In the present work, we compare different de facto standard supervised and unsupervised procedures to identify and classify variants potentially related to muscular diseases, and we discuss results in terms of statistical and biological validation.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.