Abstract
Estimating the proportion of signals hidden in a large number of noise variables is a pervasive objective in scientific research. In this paper, we consider realistic, yet theoretically challenging scenarios with arbitrary covariance dependencies between variables. We quantify the overall level of covariance dependence using mean absolute correlation (MAC), and investigate the performance of a family of estimators across the full range of MAC values. We explore the joint effect of MAC dependence, signal sparsity, and signal intensity on estimator performance, and find that no single estimator in the family performs optimally across all MAC dependence levels. Based on this theoretical insight, we propose a new estimator that is better suited to arbitrary covariance dependencies. Our method compares favorably to several existing methods in a variety of finite-sample settings, including those with strong or weak covariance dependencies and real dependence structures from genetic association studies.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.