Resource profile and user guide of the Polygenic Index Repository.

Joël Becker ,Kathleen Mullan Harris ,Peter M Visscher ,Matthew H Mcintyre ,Katarzyna Bryc ,Babak Alipanahi ,Steven J Pitts ,Andrew Steptoe ,Nadia K Litterman ,Janie F Shelton ,Adam Auton ,Benjamin Williams ,Nancy Wang ,Jonathan P Beauchamp ,Grant Goldman ,Philipp Koellinger ,Richie Poulton ,Daniel W Belsky ,Vladimir Vacic ,Daniel J Benjamin ,Patrick Turley ,Michelle N Meyer ,Matt Mcgue ,Magnus Johannesson ,Joanna L Mountain ,Suyash Shringarpure ,Hariharan Jayashankar ,David Laibson ,Carrie A M Northover ,Sarah L Elson ,Joyce Y Tung ,Catherine H Wilson ,Patrik K E Magnusson ,Olesya Ajnakina ,Jennifer C Mccreight ,Elliot M Tucker‐Drob ,Lili Milani ,Richard Karlsson Linnér ,Terrie E Moffitt ,Avshalom Caspi ,Robert K Bell ,J Fah Sathirapongsasuti ,Sven Oskarsson ,Aaron Kleinman ,Chao Tian ,Olga V Sazonova ,David L Corcoran ,Jeremy Freese ,Nicholas A Furlotte ,Tõnu Esko ,David A Hinds ,Pierre Fontanillas ,Pamela Herd ,William G Iacono ,Rafael Ahlskog ,David Cesarini ,K Paige Harden ,Michael H Bennett ,Casper A.p Burik ,Karen E Huber ,Travis T Mallard ,Alexander I Young ,Karen Sugden ,Michelle Agee ,Aysu Okbay

doi:10.1038/s41562-021-01119-3

Abstract

Polygenic indexes (PGIs) are DNA-based predictors. Their value for research in many scientific disciplines is growing rapidly. As a resource for researchers, we used a consistent methodology to construct PGIs for 47 phenotypes in 11 datasets. To maximize the PGIs' prediction accuracies, we constructed them using genome-wide association studies-some not previously published-from multiple data sources, including 23andMe and UK Biobank. We present a theoretical framework to help interpret analyses involving PGIs. A key insight is that a PGI can be understood as an unbiased but noisy measure of a latent variable we call the 'additive SNP factor'. Regressions in which the true regressor is this factor but the PGI is used as its proxy therefore suffer from errors-in-variables bias. We derive an estimator that corrects for the bias, illustrate the correction, and make a Python tool for implementing it publicly available.

Full Text