Breast cancer gene expression datasets do not reflect the disease at the population level

Yanping Xie,David A Cameron,Andrew H Sims,Jonine D Figueroa,Nicholas Moir,Brittny C Davis Lynn

doi:10.1038/s41523-020-00180-x

Abstract

Publicly available tumor gene expression datasets are widely reanalyzed, but it is unclear how representative they are of clinical populations. Estimations of molecular subtype classification and prognostic gene signatures were calculated for 16,130 patients from 70 breast cancer datasets. Collated patient demographics and clinical characteristics were sparse for many studies. Considerable variations were observed in dataset size, patient/tumor characteristics, and molecular composition. Results were compared with Surveillance, Epidemiology, and End Results Program (SEER) figures. The proportion of basal subtype tumors ranged from 4 to 59%. Date of diagnosis ranged from 1977 to 2013, originating from 20 countries across five continents although European ancestry dominated. Publicly available breast cancer gene expression datasets are a great resource, but caution is required as they tend to be enriched for high grade, ER-negative tumors from European-ancestry patients. These results emphasize the need to derive more representative and annotated molecular datasets from diverse populations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: npj Breast Cancer	Publication Date: Aug 25, 2020
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

Breast cancer gene expression datasets do not reflect the disease at the population level

Abstract

Talk to us

Similar Papers

More From: npj Breast Cancer

Lead the way for us

Similar Papers

Molecular profiling of a real-world breast cancer cohort with genetically inferred ancestries reveals actionable tumor biology differences between European ancestry and African ancestry patient populations
Minoru Miyashita ... Frederick M Howard
Breast Cancer Research | VOL. 25
Minoru Miyashita, et. al.Minoru Miyashita ... Frederick M Howard
25 May 2023
Breast Cancer Research | VOL. 25

Abstract SS1-04: Comprehensive genomic and transcriptomic profiling of molecular subtypes reveal ancestral differences in the activity of signaling pathways between patients with African and European ancestry
Minoru Miyashita ... Jean Baptiste Reynier
Cancer Research | VOL. 81
Minoru Miyashita, et. al.Minoru Miyashita ... Jean Baptiste Reynier
15 Feb 2021
Cancer Research | VOL. 81

Abstract B123: Using eHealth and data science to dissect breast cancer heterogeneity in the Chicago Multi-Ethnic (ChiMEC) Breast Cancer Cohort
Dezheng Huo ... Kevin White
Cancer Epidemiology, Biomarkers & Prevention | VOL. 29
Dezheng Huo, et. al.Dezheng Huo ... Kevin White
01 Jun 2020
Cancer Epidemiology, Biomarkers & Prevention | VOL. 29

Abstract PO-150: Comparison of RUNX1, RUNX2, RUNX3 and CBFβ gene expression in breast tumors Indicate ethnic differences and similarities by receptor status
Uzoamaka A Okoli
Cancer Epidemiology, Biomarkers & Prevention | VOL. 31
Uzoamaka A OkoliUzoamaka A Okoli
01 Jan 2021
Cancer Epidemiology, Biomarkers & Prevention | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Breast cancer gene expression datasets do not reflect the disease at the population level

Abstract

Talk to us

Similar Papers

More From: npj Breast Cancer