BackgroundEarly menarche is an established risk factor for breast cancer but its molecular contribution to tumor biology and prognosis remains unclear.MethodsWe profiled transcriptome-wide gene expression in breast tumors (N = 846) and tumor-adjacent normal tissues (N = 666) from women in the Nurses’ Health Studies (NHS) to investigate whether early menarche (age < 12) is associated with tumor molecular and prognostic features in women with breast cancer. Multivariable linear regression and pathway analyses using competitive gene set enrichment analysis were conducted in both tumor and adjacent-normal tissue and externally validated in TCGA (N = 116). Subgroup analyses stratified on ER-status based on the tumor were also performed. PAM50 signatures were used for tumor molecular subtyping and to generate proliferation and risk of recurrence scores. We created a gene expression score using LASSO regression to capture early menarche based on 28 genes from FDR-significant pathways in breast tumor tissue in NHS and tested its association with 10-year disease-free survival in both NHS (N = 836) and METABRIC (N = 952).ResultsEarly menarche was significantly associated with 369 individual genes in adjacent-normal tissues implicated in extracellular matrix, cell adhesion, and invasion (FDR ≤ 0.1). Early menarche was associated with upregulation of cancer hallmark pathways (18 significant pathways in tumor, 23 in tumor-adjacent normal, FDR ≤ 0.1) related to proliferation (e.g. Myc, PI3K/AKT/mTOR, cell cycle), oxidative stress (e.g. oxidative phosphorylation, unfolded protein response), and inflammation (e.g. pro-inflammatory cytokines IFNα\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\alpha$$\\end{document} and IFNγ\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\gamma$$\\end{document}). Replication in TCGA confirmed these trends. Early menarche was associated with significantly higher PAM50 proliferation scores (β = 0.082 [0.02–0.14]), odds of aggressive molecular tumor subtypes (basal-like, OR = 1.84 [1.18–2.85] and HER2-enriched, OR = 2.32 [1.46–3.69]), and PAM50 risk of recurrence score (β = 4.81 [1.71–7.92]). Our NHS-derived early menarche gene expression signature was significantly associated with worse 10-year disease-free survival in METABRIC (N = 952, HR = 1.58 [1.10–2.25]).ConclusionsEarly menarche is associated with more aggressive molecular tumor characteristics and its gene expression signature within tumors is associated with worse 10-year disease-free survival among women with breast cancer. As the age of onset of menarche continues to decline, understanding its relationship to breast tumor characteristics and prognosis may lead to novel secondary prevention strategies.