Large External Data Set Research Articles

The current histoclinical breast cancer classification is simple but imprecise. Several molecular classifications of breast cancers based on expression profiling have been proposed as alternatives. However, their reliability and clinical utility have been repeatedly questioned, notably because most of them were derived from relatively small initial patient populations. We analyzed the transcriptomes of 537 breast tumors using three unsupervised classification methods. A core subset of 355 tumors was assigned to six clusters by all three methods. These six subgroups overlapped with previously defined molecular classes of breast cancer, but also showed important differences, notably the absence of an ERBB2 subgroup and the division of the large luminal ER+ group into four subgroups, two of them being highly proliferative. Of the six subgroups, four were ER+/PR+/AR+, one was ER−/PR−/AR+ and one was triple negative (AR−/ER−/PR−). ERBB2-amplified tumors were split between the ER−/PR−/AR+ subgroup and the highly proliferative ER+ LumC subgroup. Importantly, each of these six molecular subgroups showed specific copy-number alterations. Gene expression changes were correlated to specific signaling pathways. Each of these six subgroups showed very significant differences in tumor grade, metastatic sites, relapse-free survival or response to chemotherapy. All these findings were validated on large external datasets including more than 3000 tumors. Our data thus indicate that these six molecular subgroups represent well-defined clinico-biological entities of breast cancer. Their identification should facilitate the detection of novel prognostic factors or therapeutical targets in breast cancer.

The University of California, San Francisco (UCSF) Cancer of the Prostate Risk Assessment (CAPRA) is a novel preoperative index which predicts the risk of biochemical recurrence after radical prostatectomy. The performance of the index is at least as good as the best available instruments based on clinical variables, and the 0 to 10 score is simple to calculate for both clinical and research purposes. This study used a large external dataset to validate CAPRA. Data were abstracted from the Shared Equal Access Regional Cancer Hospital (SEARCH) database, a registry of men who underwent radical prostatectomy at 4 Veterans Affairs and 1 active military medical center. Of 2096 men in the database, 1346 (64%) had full data available to calculate the CAPRA score. Performance of the CAPRA score was assessed with proportional hazards regression, survival analysis, and the concordance (c) index. Of the studied patients, 41% were non-Caucasian, and their mean age was 62 years. Twenty-six percent suffered recurrence; median follow-up among patients who did not recur was 34 months. The hazard ratio (HR) for each 1-point increase in CAPRA was 1.39 (95% CI [confidence interval], 1.31-1.46). The 5-year recurrence-free survival rate ranged from 86% for CAPRA 0-1 patients to 21% for CAPRA 7-10 patients. Increasing CAPRA scores were significantly associated with increasing risk of adverse pathologic outcomes. The c-index for CAPRA for the validation set was 0.68, compared with 0.66 for the original development set. The UCSF-CAPRA accurately predicted both biochemical and pathologic outcomes after radical prostatectomy among a large, diverse, cohort of men. These results validated the effectiveness of this powerful and straightforward instrument.

Large External Data Set Research Articles

Related Topics

Articles published on Large External Data Set

Analyzing Distillation Process of Hidden Terms in Web Documents for IR

A refined molecular taxonomy of breast cancer

A Hidden Topic-Based Framework toward Building Applications with Short Web Documents

External validation of QDSCORE® for predicting the 10‐year risk of developing Type 2 diabetes

Prediction of passive blood–brain partitioning: Straightforward and effective classification models based on in silico derived physicochemical descriptors

Waterlow score to predict patients at risk of developing Clostridium difficile-associated disease

Performance of multicomponent self-organizing regression (MCSOR) in QSAR, QSPR, and multivariate calibration: comparison with partial least-squares (PLS) and validation with large external data sets

Multiinstitutional validation of the UCSF cancer of the prostate risk assessment for prediction of recurrence after radical prostatectomy

Quantifying Peptide Signal in MALDI-TOF Mass Spectrometry Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large External Data Set Research Articles

Related Topics

Articles published on Large External Data Set

Analyzing Distillation Process of Hidden Terms in Web Documents for IR

A refined molecular taxonomy of breast cancer

A Hidden Topic-Based Framework toward Building Applications with Short Web Documents

External validation of QDSCORE® for predicting the 10‐year risk of developing Type 2 diabetes

Prediction of passive blood–brain partitioning: Straightforward and effective classification models based on in silico derived physicochemical descriptors

Waterlow score to predict patients at risk of developing Clostridium difficile-associated disease

Performance of multicomponent self-organizing regression (MCSOR) in QSAR, QSPR, and multivariate calibration: comparison with partial least-squares (PLS) and validation with large external data sets

Multiinstitutional validation of the UCSF cancer of the prostate risk assessment for prediction of recurrence after radical prostatectomy

Quantifying Peptide Signal in MALDI-TOF Mass Spectrometry Data