BackgroundRelatively few studies have evaluated the role of host (germline) genetics in predicting outcome in diffuse large B-cell lymphoma (DLBCL), and of these, most were small candidate gene studies without replication and/or from the pre-R-CHOP era. We report the first genome-wide association study (GWAS) of outcome in DLBCL patients treated with rituximab and anthracycline-based chemotherapy. We used two independent studies for discovery, and validated the results in two additional independent studies. MethodsIn the discovery phase, we conducted a meta-analysis of two GWAS studies of DLBCL patients treated with rituximab and anthracycline-based chemotherapy, the first study as part of the prospective clinical trials of the LNH2003B program (N=540) of the Lymphoma Study Association (LYSA) and the second as part of the Molecular Epidemiology Resource (N=316), a prospective observational study from the University of Iowa/Mayo Clinic Lymphoma Specialized Program of Research Excellence (SPORE). Genotyping for the GWAS was performed using the Illumina 610 (LYSA) and 660 Quad (SPORE) BeadArrays (Illumina, San Diego, USA). HapMap2 was used to impute additional SNPs. Association testing was conducted assuming a log-additive genetic model and using Cox regression analyses. We calculated the hazard ratios (HRs) and 95% CIs for event-free survival (EFS), with adjustment for age, sex and age-adjusted IPI. The two studies were then combined using the fixed effects inverse variance method based on the beta estimates and standard errors from each study. From the meta-analysis, 89 SNPs were selected for validation, based on a minor allele frequency (MAF) >0.05 and LD pattern for multiple SNPs from the same region. Validation cohorts were an independent set of DLBCL patients from the SPORE (SPORE-2, N=366) and DLBCL patients from the prospective GOELAMS 075 phase III study (N=294), and validation genotyping was conducted using the Illumina VeraCode platform. We then conducted a meta-analysis of the 4 studies. We report SNPs with both a statistical significance of p<5x10-6 and effect sizes in the same direction across the 4 studies in the meta-analysis. For the latter SNPs, we also report the results for overall survival (OS), adjusted for age, sex and age-adjusted IPI. ResultsIn the final meta-analysis of all four studies for the prediction of EFS, the top SNP was rs7765004 (MAF=0.30; HR=1.45; p=9.7 x 10-8) in the 6q21 region, which is near the MARCKS (myristoylated alanine-rich protein kinase C substrate) and HDAC2 genes. MARCKS encodes for a protein involved in cell mobility, while HDAC2 belongs to the histone deacetylase family. Additional SNPs from this region included rs6918103 (MAF=0.25; HR=1.38; p=7.6 x 10-6), which marks an EZH2 histone binding site, and rs11969684 (MAF=0.12; HR=1.43; p=3.3 x 10-5). The next hit was rs7712513 (MAF=0.31; HR=1.40; p=3.2 x 10-7), which is near arginine-fifty homeobox pseudogene 1 (ARGFXP1) on chromosome 5q23.2. This was followed by a region on chromosome 8q13.3, with the SNP rs9298183 (MAF=0.29; HR=1.41; p=1.4 x 10-6) from LOC100132891 showing the strongest association, along with 3 other SNPs from this region (rs2035376, rs6472637, and rs1600797; all p<5 x 10-6). These SNPs are near the musculin gene (MSC) a transcriptional repressor of E2A frequently silenced by promoter methylation in DLBCL. Finally, rs2253986 (MAF=0.27; HR=1.39; p=3.4 x 10-6) at chromosome 16q12.2, along with rs2253971 and rs6499810, were associated with EFS. These SNPs are located in CES5A (carboxylesterase 5A), which encodes a member of the carboxylesterase family. The family members are responsible for the hydrolysis or transesterification of various xenobiotics. The top SNPs associated with EFS also predicted OS, including rs7765004 (HR=1.41; p=1.2 x 10-5), rs7712513 (HR=1.39; p=8.4 x 10-6), rs9298183 (HR=1.36; p=3.5 x 10-5), and rs2253986 (HR=1.35; p=5.2 x 10-5). ConclusionsAlthough our top SNPs on did not reach genome-wide significance (p<5x10-8), they showed clear consistency across the four large prospective studies and predicted both EFS and OS independently of IPI. The strongest findings were from loci on 6q21, 8q13.3, 5q23.2, and 16q12.2, and several genes from these regions are involved in methylation and xenobiotic metabolism, supporting a role for host genetic variation in these pathways in DLBCL prognosis. Disclosures:No relevant conflicts of interest to declare.