Abstract

e19297 Background: Obtaining clinical outcomes for analysis has historically been a critical barrier to cancer genomics research. EHRs could constitute an important data source to bridge this gap, but EHRs rarely capture structured outcomes such as cancer progression. Novel, robust methods are needed to capture clinically relevant outcomes from EHRs. Methods: Among patients with lung adenocarcinoma whose tumors were sequenced via the Dana Farber Cancer Institute/Brigham and Women’s PROFILE study from 2013-2018, imaging reports following first palliative-intent systemic therapy were annotated using natural language processing (NLP) models trained to capture cancer progression according to the structured “PRISSMM” framework. NLP-based cancer progression and imaging report frequency were jointly modeled using inverse-intensity weighted generalized estimated equations, censored at six months, to explore associations between alterations in lung cancer biomarkers (ALK, EGFR, ROS1, BRAF, KRAS, SMARCA4) and progression. Among patients with KRAS mutations who received immunotherapy, we also analyzed the association between STK11 mutations and progression. The novel outcome generated by the model – imaging report-based progression (iPROG) – corresponded to the difference in the mean log odds of progression per inverse-intensity weighted report associated with a given biomarker; it was reported as adjusted mean probability and in exponentiated form as an odds ratio (OR). Results: Among 690 patients with lung adenocarcinoma, associations between tumor mutations and the iPROG outcome are listed in the Table. Conclusions: A deep NLP model applied to EHR data can capture a novel cancer progression outcome, which is associated with known prognostic markers in lung cancer. Application of this method to large “real world” datasets, with attention to interactions between treatment and genomics, could speed biomarker discovery. [Table: see text]

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call