Interobserver Variability Study Research Articles

Abstract Introduction: Antibody-drug conjugates (ADC) are designed to effectively deliver cytotoxic agents directly to malignant cells. Trastuzumab deruxtecan (Tdx), an ADC of trastuzumab, an enzyme-cleavable linker, and a cytotoxic topoisomerase I inhibitor, has been shown to have antitumor activity in patients with breast cancer with low levels of HER2. However, current companion diagnostic tests for HER2-targetting therapies, immunohistochemistry (IHC) and fluorescent in situ hybridization (FISH), were optimized for high (gene amplified) levels of HER2. We hypothesize that the current common assays used in clinic do not efficiently differentiate between patients whose cancers have “0” HER2 expression and “1+” HER2 expression and thus could miss patients who would benefit from treatment with this drug. Methods: Here, we evaluated two years of HER2 surveys from the College of American Pathologists’ (CAP) Proficiency Testing Surveys for HER2 expression in breast cancer from 2019 and 2020. Participating laboratories received two tissue microarrays (TMAs) of 10 breast cancer cores, each Laboratories stained for HER2 using the standard IHC assay used in their labs. The scores were returned to the CAP as part of their quality assessment program. Each survey dataset covers the scores from around 1400 labs of 20 cores each as well as supplemental questions regarding the methodology used. We summarized the relative frequency and distribution of each score given to every core. A second independent analytic dataset was selected from the archives of the Department of Pathology at Yale, from breast biopsies in 2018. The set, enriched in HER2 2+ and 3+ cases, was read by eighteen board-certified pathologists, most with over 5 years’ experience, participating in an interobserver variability study. Hematoxylin and eosin (H&E) and HER2 IHC digitally scanned images of 170 independent cases were provided. Pathologists scored the cases as 0,1, 2, or 3+. Fisher’s exact test was used to compare the 0/1+ concordant cases to 2+/3+ cases. All tests were two-sided at a significant level 0.05. Statistical analysis was performed using Graphpad Prism Version 9.0.1 and the dplyr package in R Version 1.0.143. This study was approved by Yale Human Investigation IRB protocol ID 9505008219. Results: We found that 65% of the 80 cores evaluated in the CAP survey (52/80) had a concordance rate ≥90%. This high concordance was limited to scores of 0 and 3+. The lowest concordance was found between 0 versus 1+. Of the 80 cores, 56 were considered negative (HER2 score of 0 or 1). In 25% of those cores there was &lt; 70% concordance (n=15; 6 in 2019 and 9 in 2020). Analytic concordance was assessed in the independent, Yale cohort where we found that of the 170 cases, 92 were read as 0 by at least one pathologist. Of these 92, 24 were concordant (26%), defined as a ≥90% agreement. In comparison, 45/170 were read as 3+ by at least one pathologist. Again,. using a 90%definition of concordance, 26 of 45 cases (58%) were concordant. Comparison of 0/1+ concordant cases versus 2+/3+ concordant cases showed a significant difference (χ2 = 12.07, p&lt;0.0005). Conclusions: Assessment of laboratory performance of around 1400 CAP labs using common current HER2 assays on CAP survey specimens, there is significant discordance in the evaluation of 0 vs. 1+ cases. In a separate selected breast biopsy cohort examined by 18 breast pathologists, we showed that discordance between scores of 0 vs 1+ is significantly larger than that between 2+ and 3+. Given the efficacy of T-DXd, we believe patients may be mis-assigned for treatment or no treatment if the decision depends on performance of the standard current HER2 assays. Citation Format: Aileen I Fernandez, Matthew Liu, Andrew Bellizzi, Jane Brock, Oluwole Fadare, Krisztina Hanley, Malini Harigopal, Julie M. Jorns, M. Gabriela Kuba, Amy Ly, Mirna Podoll, Kimmie Rabe, Mary Ann Sanders, Kamaljeet Singh, Olivia L Snir, Rinda Soong, Shi Wei, Hannah Wen, Serena Wong, Esther Yoon, Lajos Pusztai, Emily Reisenbichler, David L. Rimm. Examination of low Her2 expression in breast cancer [abstract]. In: Proceedings of the 2021 San Antonio Breast Cancer Symposium; 2021 Dec 7-10; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2022;82(4 Suppl):Abstract nr P1-02-02.

Read full abstract

Personalized radiotherapy planning depends on high-quality delineation of target tumors and surrounding organs at risk (OARs). This process puts additional time burdens on oncologists and introduces variability among both experts and institutions. To explore clinically acceptable autocontouring solutions that can be integrated into existing workflows and used in different domains of radiotherapy. This quality improvement study used a multicenter imaging data set comprising 519 pelvic and 242 head and neck computed tomography (CT) scans from 8 distinct clinical sites and patients diagnosed either with prostate or head and neck cancer. The scans were acquired as part of treatment dose planning from patients who received intensity-modulated radiation therapy between October 2013 and February 2020. Fifteen different OARs were manually annotated by expert readers and radiation oncologists. The models were trained on a subset of the data set to automatically delineate OARs and evaluated on both internal and external data sets. Data analysis was conducted October 2019 to September 2020. The autocontouring solution was evaluated on external data sets, and its accuracy was quantified with volumetric agreement and surface distance measures. Models were benchmarked against expert annotations in an interobserver variability (IOV) study. Clinical utility was evaluated by measuring time spent on manual corrections and annotations from scratch. A total of 519 participants' (519 [100%] men; 390 [75%] aged 62-75 years) pelvic CT images and 242 participants' (184 [76%] men; 194 [80%] aged 50-73 years) head and neck CT images were included. The models achieved levels of clinical accuracy within the bounds of expert IOV for 13 of 15 structures (eg, left femur, κ = 0.982; brainstem, κ = 0.806) and performed consistently well across both external and internal data sets (eg, mean [SD] Dice score for left femur, internal vs external data sets: 98.52% [0.50] vs 98.04% [1.02]; P = .04). The correction time of autogenerated contours on 10 head and neck and 10 prostate scans was measured as a mean of 4.98 (95% CI, 4.44-5.52) min/scan and 3.40 (95% CI, 1.60-5.20) min/scan, respectively, to ensure clinically accepted accuracy. Manual segmentation of the head and neck took a mean 86.75 (95% CI, 75.21-92.29) min/scan for an expert reader and 73.25 (95% CI, 68.68-77.82) min/scan for a radiation oncologist. The autogenerated contours represented a 93% reduction in time. In this study, the models achieved levels of clinical accuracy within expert IOV while reducing manual contouring time and performing consistently well across previously unseen heterogeneous data sets. With the availability of open-source libraries and reliable performance, this creates significant opportunities for the transformation of radiation treatment planning.

Read full abstract

Interobserver Variability Study Research Articles

Related Topics

Articles published on Interobserver Variability Study

Interobserver variability studies in diagnostic imaging: a methodological systematic review.

Reproducibility and repeatability of biventricular function/volume and strain parameters by 2D and 4D stress echocardiography in adult patients with repaired TOF

MRI tagging of colonic chyme mixing in healthy subjects: Inter-observer variability and reliability of the measurement with time.

Global uncertainty in the diagnosis of neurological complications of SARS-CoV-2 infection by both neurologists and non-neurologists: An international inter-observer variability study

Interobserver variation of clinical oncologists compared to therapeutic radiographers (RTT) prostate contours on T2 weighted MRI.

Evaluation of therapeutic radiographer contouring for magnetic resonance image guided online adaptive prostate radiotherapy

Automatic 3D MRI-Ultrasound Registration for Image Guided Arthroscopy

Abstract P1-02-02: Examination of low Her2 expression in breast cancer

A multicenter study of interobserver variability in pathologic diagnosis of papillary breast lesions on core needle biopsy with WHO classification

Feasibility and Performance of Elastin Trichrome as a Primary Stain in Colorectal Cancer Resection Specimens: Results of an Interobserver Variability Study.

An international study of interobserver variability of "string sign" of pancreatic cysts among experienced endosonographers.

Evaluation of Deep Learning to Augment Image-Guided Radiotherapy for Head and Neck and Prostate Cancers

S0126 An International Study of Interobserver Variability of “String Sign” of Pancreatic Cysts Among Experienced Endosonographers

Visual and quantitative evaluation of [18F]FES and [18F]FDHT PET in patients with metastatic breast cancer: an interobserver variability study

Geometrical and dosimetric evaluation of breast target volume auto-contouring.

EP1.09-03 Interobserver Variability Study of PD-L1 Immunostaining in Non-Small Cell Lung Cancer

Adaptation of a deprescription intervention to the medication management of older people living in long-term care facilities

Tumor budding is a prognostic factor linked to epithelial mesenchymal transition in pancreatic ductal adenocarcinoma. Study report and literature review

Goblet cell carcinoid of the appendix - An interobserver variability study using two proposed classification systems.

A pattern‐based risk‐stratification scheme for salivary gland cytology: A multi‐institutional, interobserver variability study to determine applicability

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Interobserver Variability Study Research Articles

Related Topics

Articles published on Interobserver Variability Study

Interobserver variability studies in diagnostic imaging: a methodological systematic review.

Reproducibility and repeatability of biventricular function/volume and strain parameters by 2D and 4D stress echocardiography in adult patients with repaired TOF

MRI tagging of colonic chyme mixing in healthy subjects: Inter-observer variability and reliability of the measurement with time.

Global uncertainty in the diagnosis of neurological complications of SARS-CoV-2 infection by both neurologists and non-neurologists: An international inter-observer variability study

Interobserver variation of clinical oncologists compared to therapeutic radiographers (RTT) prostate contours on T2 weighted MRI.

Evaluation of therapeutic radiographer contouring for magnetic resonance image guided online adaptive prostate radiotherapy

Automatic 3D MRI-Ultrasound Registration for Image Guided Arthroscopy

Abstract P1-02-02: Examination of low Her2 expression in breast cancer

A multicenter study of interobserver variability in pathologic diagnosis of papillary breast lesions on core needle biopsy with WHO classification

Feasibility and Performance of Elastin Trichrome as a Primary Stain in Colorectal Cancer Resection Specimens: Results of an Interobserver Variability Study.

An international study of interobserver variability of "string sign" of pancreatic cysts among experienced endosonographers.

Evaluation of Deep Learning to Augment Image-Guided Radiotherapy for Head and Neck and Prostate Cancers

S0126 An International Study of Interobserver Variability of “String Sign” of Pancreatic Cysts Among Experienced Endosonographers

Visual and quantitative evaluation of [18F]FES and [18F]FDHT PET in patients with metastatic breast cancer: an interobserver variability study

Geometrical and dosimetric evaluation of breast target volume auto-contouring.

EP1.09-03 Interobserver Variability Study of PD-L1 Immunostaining in Non-Small Cell Lung Cancer

Adaptation of a deprescription intervention to the medication management of older people living in long-term care facilities

Tumor budding is a prognostic factor linked to epithelial mesenchymal transition in pancreatic ductal adenocarcinoma. Study report and literature review

Goblet cell carcinoid of the appendix - An interobserver variability study using two proposed classification systems.

A pattern‐based risk‐stratification scheme for salivary gland cytology: A multi‐institutional, interobserver variability study to determine applicability