Rare germline copy number variants (CNVs) and breast cancer risk

Joe Dennis ,Anna González‐Neira ,Celine M Vachon ,Wei Zheng ,Taru Muranen ,Jose E Castelao ,Rachel A Murphy ,Heather Thorne ,Emmanouil Saloustros ,Desmond Yip ,Jacques Simard ,Argyrios Ziogas ,Logan C Walker ,Renske Keeman ,Kristine Kleivi Sahlberg ,Georgia Chenevix‐Trench ,Stig E Bojesen ,Fergus J Couch ,Rita K Schmutzler ,Audrey Jung ,Thérèse Truong ,Yon‐Dschun Ko ,Marike Gabrielson ,Alicja Ogrodniczak ,Elana Martinez ,Irene L Andrulis ,Dinny Graham ,Melissa A Troester ,A Heather Eliassen ,Rudolf Kaaks ,Javier Benı́tez ,Veli‐Matti Kosma ,Håkan Olsson ,Grethe I.g Alnæs ,Arto Mannermaa ,Rodney J Scott ,Cari M Kitahara ,Dimitriοs Mavroudis ,Janet E Olson ,Kyriaki Michailidou ,Manuela Gago-Domínguez ,Montserrat García‐Closas ,Inger Torhild Gram ,John L Hopper ,Douglas F Easton ,Sara Margolin ,Reiner Hoppe ,Peter T Simpson ,Gad Rennert ,Mythily Sachchithananthan ,Tjoung‐Won Park‐Simon ,Peter A Fasching ,Jenny Chang‐Claude ,Camilla Wendt ,Penny Soucy ,Hoda Anton‐Culver ,Diether Lambrechts ,Xiaohong R Yang ,Katri Pylkäs ,Esther M John ,Robert Winqvist ,Lauren R Teras ,Natalia Bogdanova ,Natalia Antonenkova ,Henrik Flyger ,J Lacey ,Antoinette Hollestelle ,Nirmala Pathmanathan ,Hermann Brenner ,Elinor J Sawyer ,Angela Cox ,Siranoush Manoukian ,Jane Carpenter ,Jonathan P Tyrer ,Stella Koutros ,Judy Kirk ,D Gareth Evans ,Melissa C Southey ,Ian Tomlinson ,Stephen B Fox ,Marina Bermisheva ,Anne‐Lise Børresen‐Dale ,Anna Defazio ,Elza Khusnutdinova ,Dale P Sandler ,J Margriet Collée ,Allison W Kurian ,Marjanka K Schmidt ,Leila Dorling ,Alicja Wolk ,Charles M Perou ,Olivia Fletcher ,Roger L Milne ,Paolo Peterlongo ,Bjørn Naume ,Alison M Dunning ,Ian G Campbell ,Peter Kraft ,Peter Devilee ,Matthias W Beckmann ,Qin Wang ,Nichola Johnson ,Mikael Eriksson ,Heli Nevanlinna ,Christopher A Haiman ,Lin Fritschi ,Rulla M Tamimi ,Sabine Behrens ,Ann Smeets ,Kristan J Aronson ,Robert N Baldassano ,Null Author_Id ,Rana Shibli ,Jack A Taylor ,Dijana Plaseska‐Karanfilska ,Alison Davis ,Deborah J Marsh ,Paul Pharoah ,Pascal Guénel ,Martha S Linet ,Anthony J Swerdlow ,Graham G Giles ,Michael E Jones ,Thomas U Ahearn ,Olav Engebråten ,Anthony Howell ,Jürgen Geisler ,Agnes Jager ,Mary Beth Terry ,Christine L Clarke ,Nicole L Larson ,Vessela N Kristensen ,Thilo Dörk ,Eric Hahnen ,Jonine D Figueroa ,Katerina Kubelka-Sabit ,Amanda B Spurdle ,Kamila Czene ,Per Hall ,Anna Jakubowska ,Geoffrey J Lindeman ,Laura E Beane Freeman ,Laure Dossus ,Simon S Cross ,Manjeet K Bolla ,Volker Arndt

doi:10.1038/s42003-021-02990-6

Abstract

Germline copy number variants (CNVs) are pervasive in the human genome but potential disease associations with rare CNVs have not been comprehensively assessed in large datasets. We analysed rare CNVs in genes and non-coding regions for 86,788 breast cancer cases and 76,122 controls of European ancestry with genome-wide array data. Gene burden tests detected the strongest association for deletions in BRCA1 (P = 3.7E−18). Nine other genes were associated with a p-value < 0.01 including known susceptibility genes CHEK2 (P = 0.0008), ATM (P = 0.002) and BRCA2 (P = 0.008). Outside the known genes we detected associations with p-values < 0.001 for either overall or subtype-specific breast cancer at nine deletion regions and four duplication regions. Three of the deletion regions were in established common susceptibility loci. To the best of our knowledge, this is the first genome-wide analysis of rare CNVs in a large breast cancer case-control dataset. We detected associations with exonic deletions in established breast cancer susceptibility genes. We also detected suggestive associations with non-coding CNVs in known and novel loci with large effects sizes. Larger sample sizes will be required to reach robust levels of statistical significance.

Highlights

Germline copy number variants (CNVs) are pervasive in the human genome but potential disease associations with rare CNVs have not been comprehensively assessed in large datasets
We recently developed a new CNV calling method, CamCNV16, which focuses on rare CNVs and identifies outlier samples that may have a CNV, based on the intensity distribution across all samples at each probe
Duplications tended to be longer than deletions: for example, deletions called on OncoArray covered a mean of 45 Kilobases (Kb) (SD 106 Kb) over 9.8 probes (SD 17.2), while duplications covered a mean of 109 Kb (SD 202 Kb) over 18.9 probes (SD 36.5)

Summary

Introduction

Germline copy number variants (CNVs) are pervasive in the human genome but potential disease associations with rare CNVs have not been comprehensively assessed in large datasets. We analysed rare CNVs in genes and non-coding regions for 86,788 breast cancer cases and 76,122 controls of European ancestry with genome-wide array data. Rare loss of function variants in susceptibility genes such as BRCA1 and CHEK2 are associated with a large increase in risk[6]. Large-scale genome-wide association studies (GWAS) have established breast cancer associations with common variants at more than 150 loci, mostly in non-coding regions[8,9,10,11]. We recently developed a new CNV calling method, CamCNV16, which focuses on rare CNVs and identifies outlier samples that may have a CNV, based on the intensity distribution across all samples at each probe We showed that this approach is able to detect CNVs using as few as three probes[16].

Methods

Results

Conclusion