This study aimed to develop an ancestry-specific polygenic risk scores (PRSs) for the prediction of breast cancer events in Japanese females and validate it in a longitudinal cohort study. Using publicly available summary statistics of female breast cancer genome-wide association study (GWAS) of Japanese and European ancestries, we, respectively, developed 31 candidate genome-wide PRSs using pruning and thresholding (P + T) and LDpred methods with varying parameters. Among the candidate PRS models, the best model was selected using a case-cohort dataset (63 breast cancer cases and 2213 sub-cohorts of Japanese females during a median follow-up of 11.9years) according to the maximal predictive ability by Harrell's C-statistics. The best-performing PRS for each derivation GWAS was evaluated in another independent case-cohort dataset (260 breast cancer cases and 7845 sub-cohorts of Japanese females during a median follow-up of 16.9years). For the best PRS model involving 46,861 single nucleotide polymorphisms (SNPs; P + T method with PT = 0.05 and R2 = 0.2) derived from Japanese-ancestry GWAS, the Harrell's C-statistic was 0.598 ± 0.018 in the evaluation dataset. The age-adjusted hazard ratio for breast cancer in females with the highest PRS quintile compared with those in the lowest PRS quintile was 2.47 (95% confidence intervals, 1.64-3.70). The PRS constructed using Japanese-ancestry GWAS demonstrated better predictive performance for breast cancer in Japanese females than that using European-ancestry GWAS (Harrell's C-statistics 0.598 versus 0.586). This study developed a breast cancer PRS for Japanese females and demonstrated the usefulness of the PRS for breast cancer risk stratification.
Read full abstract