Abstract

You have accessJournal of UrologyProstate Cancer: Advanced (including Drug Therapy) II1 Apr 2017PD37-06 A NEW ERA: AUTOMATED EXTRACTION OF DETAILED PROSTATE CANCER INFORMATION FROM NARRATIVELY WRITTEN HEALTH RECORDS. PIONEER WORK FROM A EUROPEAN TERTIARY CARE CENTER Sami-Ramzi Leyh-Bannurah, Zhe Tian, Pierre Karakiewicz, Dirk Pehrke, Hartwig Huland, Markus Graefen, and Lars Budäus Sami-Ramzi Leyh-BannurahSami-Ramzi Leyh-Bannurah More articles by this author , Zhe TianZhe Tian More articles by this author , Pierre KarakiewiczPierre Karakiewicz More articles by this author , Dirk PehrkeDirk Pehrke More articles by this author , Hartwig HulandHartwig Huland More articles by this author , Markus GraefenMarkus Graefen More articles by this author , and Lars BudäusLars Budäus More articles by this author View All Author Informationhttps://doi.org/10.1016/j.juro.2017.02.1568AboutPDF ToolsAdd to favoritesDownload CitationsTrack CitationsPermissionsReprints ShareFacebookTwitterLinked InEmail INTRODUCTION AND OBJECTIVES Detailed pathological information are necessary for follow-up analyses and new prediction tool development. Most institutional databases harbour a lack of either quantity or quality of such data. Expensive manpower needs to be manually invested for this purpose in daily clinical practice. Natural language (NLP) processing presents immense potential to automate the information-gathering process in the field of urology. To propose and validate a novel tool to extract specific detailed pathological information from written health records that contain continuous, narrative text in an automated and precise way. METHODS Overall, 1500 postoperative narrative pathology reports of patients undergoing radical prostatectomy in 2015 were analyzed. Of these, 750 reports were randomly selected as training data and the remaining 750 reports were used as validation data. Using domain knowledge from clinical experts and Stanford treebank parser for German, rule based extraction algorithms were created for pathological staging, Gleason percentages, prostate dimension and laterality of the tumor by iterative review of misclassified reports until there are no longer any misclassified reports in the training data. Each number found in the reports was also verified by clinical experts. RESULTS By applying the developed NLP system on the validation data, we assed the accuracy of each information extracted. The NLP derived accuracy for pT-stage, pN-stage, Gleason percentage, prostate weight, prostate volume and laterality of the tumour were 100%, 100%, 100%,100% and 97%, respectively. CONCLUSIONS We developed a novel NLP method that could extract detailed pathological information from a narrative, written pathological report with very high accuracy. This automated method can be implemented with the aim to greatly increase the efficiency and accessibility of research data in academic centers. © 2017FiguresReferencesRelatedDetails Volume 197Issue 4SApril 2017Page: e676 Advertisement Copyright & Permissions© 2017MetricsAuthor Information Sami-Ramzi Leyh-Bannurah More articles by this author Zhe Tian More articles by this author Pierre Karakiewicz More articles by this author Dirk Pehrke More articles by this author Hartwig Huland More articles by this author Markus Graefen More articles by this author Lars Budäus More articles by this author Expand All Advertisement Advertisement PDF downloadLoading ...

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call