Macro-averaged F1-score Research Articles

Abstract Anticancer therapy changes tumor physiology and genomics, making it a key variable in cancer studies. Although antineoplastics given at a single institution may be available in research-ready format, treatment at external institutions prior to receiving care at academic medical centers, common among patients at these centers, is often only described in free-text clinical notes, necessitating manual curation for downstream analysis. To overcome this bottleneck, we trained and validated natural language processing (NLP) models using initial consult notes to identify whether patients had received treatment at external institutions and studied the impact of these putative treatments on tumor genomics. Training data were derived from the AACR Project GENIE Biopharma Collaborative (BPC) for 2,663 patients at Memorial Sloan Kettering (MSK) across four cancer types. For each patient, we selected initial visits with medical and radiation oncologists based on an a priori note prioritization scheme and determined “ground-truth” prior external medications based on manually curated BPC administration records, whitelisting MSK-given medications. We trained logistic regression and clinical longformer models to identify external treatment receipt and evaluated model performance with 5-fold cross-validation. The clinical longformer model performed best across evaluation metrics, with an average area under the receiver operating characteristic curve of 0.972, macro-averaged precision/recall of 0.854/0.902 and macro-averaged F1 score of 0.876. Re-review of discrepant cases suggested that 75% of “false positives” may be due to curation error. We used our model to infer treatment status in a pan-cancer cohort with tumor genomic profiling using our institutional sequencing platform. Out of 48,447 patients, 11,900 were predicted to have received external treatment. Patients with putative external treatment had higher alteration frequencies in resistance-related genes than untreated patients and comparable to known pre-treated patients, including ESR1 in patients with breast cancer, AR in patients with prostate cancer, and EGFR T790M in patients with EGFR-mutated non-small cell lung cancer. Patients with putative external treatments, similar to known pre-treated patients, had shorter survival compared to treatment-naïve patients of the same cancer type. NLP can abstract external treatment status from clinical notes. When applied at scale, our model could help mitigate confounding variables and identify relationships between clinicogenomic variables and anticancer therapy. Citation Format: Thinh N. Tran, Karl B. Pichotta, Si-Yang Liu, Christopher Fong, Anisha Luthra, Brooke Mastrogiacomo, Steven Maron, Deborah Schrag, Sohrab P. Shah, Pedram Razavi, Bob T. Li, Gregory J. Riely, Nikolaus Schultz, Justin Jee. Identification of anti-neoplastic therapy given before initial visit at a referral center using natural language processing applied to medical oncology initial consultation notes. [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2023; Part 1 (Regular and Invited Abstracts); 2023 Apr 14-19; Orlando, FL. Philadelphia (PA): AACR; Cancer Res 2023;83(7_Suppl):Abstract nr 4259.

Read full abstract

The detection of where an organ starts and where it ends is achievable and, since this information can be delivered in real time, it could be quite important for several reasons. For one, by having the practical knowledge of the Wireless Endoscopic Capsule (WEC) transition through an organ's domain, we are able to align and control the endoscopic operation with any other possible protocol, i.e., delivering some form of treatment on the spot. Another is having greater anatomical topography information per session, therefore treating the individual in detail (not "in general"). Even the fact that by gathering more accurate information for a patient by merely implementing clever software procedures is a task worth exploiting, since the problems we have to overcome in real-time processing of the capsule findings (i.e., wireless transfer of images to another unit that will apply the necessary real time computations) are still challenging. This study proposes a computer-aided detection (CAD) tool, a CNN algorithm deployed to run on field programmable gate array (FPGA), able to automatically track the capsule transitions through the entrance (gate) of esophagus, stomach, small intestine and colon, in real time. The input data are the wireless transmitted image shots of the capsule's camera (while the endoscopy capsule is operating). We developed and evaluated three distinct multiclass classification CNNs, trained on the same dataset of total 5520 images extracted by 99 capsule videos (total 1380 frames from each organ of interest). The proposed CNNs differ in size and number of convolution filters. The confusion matrix is obtained by training each classifier and evaluating the trained model on an independent test dataset comprising 496 images extracted by 39 capsule videos, 124 from each GI organ. The test dataset was further evaluated by one endoscopist, and his findings were compared with CNN-based results. The statistically significant of predictions between the four classes of each model and the comparison between the three distinct models is evaluated by calculating the p-values and chi-square test for multi class. The comparison between the three models is carried out by calculating the macro average F1 score and Mattheus correlation coefficient (MCC). The quality of the best CNN model is estimated by calculations of sensitivity and specificity. Our experimental results of independent validation demonstrate that the best of our developed models addressed this topological problem by exhibiting an overall sensitivity (96.55%) and specificity of (94.73%) in the esophagus, (81.08% sensitivity and 96.55% specificity) in the stomach, (89.65% sensitivity and 97.89% specificity) in the small intestine and (100% sensitivity and 98.94% specificity) in the colon. The average macro accuracy is 95.56%, the average macro sensitivity is 91.82%.

Read full abstract

Macro-averaged F1-score Research Articles

Articles published on Macro-averaged F1-score

Abstract 4259: Identification of anti-neoplastic therapy given before initial visit at a referral center using natural language processing applied to medical oncology initial consultation notes

Zero-shot stance detection via multi-perspective contrastive learning with unlabeled data

Transformer-based structuring of free-text radiology report databases

LexID: The Metadata and Semantic Knowledge Graph Construction of Indonesian Legal Document

Revealing the Boundaries of Selected Gastro-Intestinal (GI) Organs by Implementing CNNs in Endoscopic Capsule Images.

Combining molecular and cell painting image data for mechanism of action prediction

RECA: Related Tables Enhanced Column Semantic Type Annotation Framework

Spanish Corpora of tweets about COVID-19 vaccination for automatic stance detection

Automatic machine learning-based classification of mandibular third molar impaction status

Estimating building energy efficiency from street view imagery, aerial imagery, and land surface temperature data

Accuracy Analysis of the End-to-End Extraction of Related Named Entities from Russian Drug Review Texts by Modern Approaches Validated on English Biomedical Corpora

Automatic detection of spongiosis associated with oral lichenoid lesions using machine learning

VulEye: A Novel Graph Neural Network Vulnerability Detection Approach for PHP Application

Handling severe data imbalance in chest X-Ray image classification with transfer learning using SwAV self-supervised pre-training

Feasibility Study of the Prediction of Radiologist's Instructions with the Bi-LSTM Model Trained with Descriptions of MR Imaging Order-statement

Neutron-gamma events discrimination under complex circumstances using ResNet

Unsupervised anomaly detection in unbalanced time series data from screw driving processes using k-means clustering

A predictive decision support system for coronavirus disease 2019 response management and medical logistic planning.

DeepEmotionNet: Emotion mining for corporate performance analysis and prediction

FF-MR: A DoH-Encrypted DNS Covert Channel Detection Method Based on Feature Fusion

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Macro-averaged F1-score Research Articles

Articles published on Macro-averaged F1-score

Abstract 4259: Identification of anti-neoplastic therapy given before initial visit at a referral center using natural language processing applied to medical oncology initial consultation notes

Zero-shot stance detection via multi-perspective contrastive learning with unlabeled data

Transformer-based structuring of free-text radiology report databases

LexID: The Metadata and Semantic Knowledge Graph Construction of Indonesian Legal Document

Revealing the Boundaries of Selected Gastro-Intestinal (GI) Organs by Implementing CNNs in Endoscopic Capsule Images.

Combining molecular and cell painting image data for mechanism of action prediction

RECA: Related Tables Enhanced Column Semantic Type Annotation Framework

Spanish Corpora of tweets about COVID-19 vaccination for automatic stance detection

Automatic machine learning-based classification of mandibular third molar impaction status

Estimating building energy efficiency from street view imagery, aerial imagery, and land surface temperature data

Accuracy Analysis of the End-to-End Extraction of Related Named Entities from Russian Drug Review Texts by Modern Approaches Validated on English Biomedical Corpora

Automatic detection of spongiosis associated with oral lichenoid lesions using machine learning

VulEye: A Novel Graph Neural Network Vulnerability Detection Approach for PHP Application

Handling severe data imbalance in chest X-Ray image classification with transfer learning using SwAV self-supervised pre-training

Feasibility Study of the Prediction of Radiologist's Instructions with the Bi-LSTM Model Trained with Descriptions of MR Imaging Order-statement

Neutron-gamma events discrimination under complex circumstances using ResNet

Unsupervised anomaly detection in unbalanced time series data from screw driving processes using k-means clustering

A predictive decision support system for coronavirus disease 2019 response management and medical logistic planning.

DeepEmotionNet: Emotion mining for corporate performance analysis and prediction

FF-MR: A DoH-Encrypted DNS Covert Channel Detection Method Based on Feature Fusion