Although the criteria for follicular-pattern thyroid tumors are well-established, diagnosing these lesions remains challenging in some cases. In the recent World Health Organization Classification of Endocrine and Neuroendocrine Tumors (5th edition), the invasive encapsulated follicular variant of papillary thyroid carcinoma was reclassified as its own entity. It is crucial to differentiate this variant of papillary thyroid carcinoma from low-risk follicular pattern tumors due to their shared morphological characteristics. Proteomics holds significant promise for detecting and quantifying protein biomarkers. We investigated the potential value of a protein biomarker panel defined by machine learning for identifying the invasive encapsulated follicular variant of papillary thyroid carcinoma, initially using formalin- fixed paraffin-embedded samples. We developed a supervised machine-learning model and tested its performance using proteomics data from 46 thyroid tissue samples. We applied a random forest classifier utilizing five protein biomarkers (ZEB1, NUP98, C2C2L, NPAP1, and KCNJ3). This classifier achieved areas under the curve (AUCs) of 1.00 and accuracy rates of 1.00 in training samples for distinguishing the invasive encapsulated follicular variant of papillary thyroid carcinoma from non-malignant samples. Additionally, we analyzed the performance of single-protein/gene receiver operating characteristic in differentiating the invasive encapsulated follicular variant of papillary thyroid carcinoma from others within The Cancer Genome Atlas projects, which yielded an AUC > 0.5. We demonstrated that integration of high-throughput proteomics with machine learning can effectively differentiate the invasive encapsulated follicular variant of papillary thyroid carcinoma from other follicular pattern thyroid tumors.
Read full abstract