Optimal Sparse Segment Identification With Application in Copy Number Variation Analysis

X Jessie Jeng,T Tony Cai,Hongzhe Li

doi:10.1198/jasa.2010.tm10083

Abstract

Motivated by DNA copy number variation (CNV) analysis based on high-density single nucleotide polymorphism (SNP) data, we consider the problem of detecting and identifying sparse short segments in a long one-dimensional sequence of data with additive Gaussian white noise, where the number, length, and location of the segments are unknown. We present a statistical characterization of the identifiable region of a segment where it is possible to reliably separate the segment from noise. An efficient likelihood ratio selection (LRS) procedure for identifying the segments is developed, and the asymptotic optimality of this method is presented in the sense that the LRS can separate the signal segments from the noise as long as the signal segments are in the identifiable regions. The proposed method is demonstrated with simulations and analysis of a real dataset on identification of copy number variants based on high-density SNP data. The results show that the LRS procedure can yield greater gain in power for detecting the true segments than some standard signal identification methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal Sparse Segment Identification With Application in Copy Number Variation Analysis

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association

Lead the way for us

Journal: Journal of the American Statistical Association	Publication Date: Sep 1, 2010
Citations: 72

Similar Papers

Impact of linkage disequilibrium heterogeneity along the genome on genomic prediction and heritability estimation
Duanyang Ren ... Jinyan Teng
Genetics Selection Evolution | VOL. 54
Duanyang Ren, et. al.Duanyang Ren ... Jinyan Teng
27 Jun 2022
Genetics Selection Evolution | VOL. 54

Impact of Marker Pruning Strategies Based on Different Measurements of Marker Distance on Genomic Prediction in Dairy Cattle.
Duanyang Ren ... Jiaqi Li
Animals | VOL. 11
Duanyang Ren, et. al.Duanyang Ren ... Jiaqi Li
02 Jul 2021
Animals | VOL. 11

Innovative technology for cancer risk analysis
S Tommas ... S De Summa
Annals of Oncology | VOL. 22
S Tommas, et. al.S Tommas ... S De Summa
01 Jan 2010
Annals of Oncology | VOL. 22

A genome-wide scan for copy number variations using high-density single nucleotide polymorphism array in Simmental cattle.
Yang Wu ... Hongyan Ren
Animal Genetics | VOL. 46
Yang Wu, et. al.Yang Wu ... Hongyan Ren
27 Apr 2015
Animal Genetics | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal Sparse Segment Identification With Application in Copy Number Variation Analysis

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association