Target Concept Guided Medical Concept Normalization in Noisy User-Generated Texts

Katikapalli Subramanyam Kalyan,Sivanesan Sangeetha

doi:10.18653/v1/2020.deelio-1.8

Abstract

Medical concept normalization (MCN) i.e., mapping of colloquial medical phrases to standard concepts is an essential step in analysis of medical social media text. The main drawback in existing state-of-the-art approach (Kalyan and Sangeetha, 2020b) is learning target concept vector representations from scratch which requires more number of training instances. Our model is based on RoBERTa and target concept embeddings. In our model, we integrate a) target concept information in the form of target concept vectors generated by encoding target concept descriptions using SRoBERTa, state-of-the-art RoBERTa based sentence embedding model and b) domain lexicon knowledge by enriching target concept vectors with synonym relationship knowledge using retrofitting algorithm. It is the first attempt in MCN to exploit both target concept information as well as domain lexicon knowledge in the form of retrofitted target concept vectors. Our model outperforms all the existing models with an accuracy improvement up to 1.36% on three standard datasets. Further, our model when trained only on mapping lexicon synonyms achieves up to 4.87% improvement in accuracy.

Highlights

Medical concept normalization (MCN) involves learning a model which can assign medical concept from a standard lexicon for the given health related mention
We deal with medical concept normalization in noisy usergenerated texts like tweets and online discussion forum posts
As social media text is highly noisy with irregular grammar and colloquial words, medical concept normalization in social media text is more challenging

Summary

Introduction

Medical concept normalization (MCN) involves learning a model which can assign medical concept from a standard lexicon for the given health related mention. We deal with medical concept normalization in noisy usergenerated texts like tweets and online discussion forum posts. With the rising popularity of social media platforms, common public are using these platforms to share information. In twitter people share their health experiences and in websites like AskAPatient.com, public post reviews for the drugs they consume. This valuable health information available in social media platforms can be exploited in applications like pharmacovigilance, public health monitoring etc (Kalyan and Sangeetha, 2020c). As social media text is highly noisy with irregular grammar and colloquial words, medical concept normalization in social media text is more challenging

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Target Concept Guided Medical Concept Normalization in Noisy User-Generated Texts

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 33	License type: cc-by

Similar Papers

Medical Concept Normalization in User-Generated Texts by Learning Target Concept Embeddings
Katikapalli Subramanyam Kalyan ... Sivanesan Sangeetha
-
Katikapalli Subramanyam Kalyan, et. al.Katikapalli Subramanyam Kalyan ... Sivanesan Sangeetha
01 Jan 2020
01 Jan 2020

A Practical Approach to Feature Selection
Kenji Kira ... Larry A Rendell
Machine Learning Proceedings 1992 | VOL. -
Kenji Kira, et. al.Kenji Kira ... Larry A Rendell
01 Jan 1992
Machine Learning Proceedings 1992 | VOL. -

Specialists, Scientists, and Sentiments: Word2Vec and Doc2Vec in Analysis of Scientific and Medical Texts.
Qufei Chen ... Marina Sokolova
SN Computer Science | VOL. 2
Qufei Chen, et. al.Qufei Chen ... Marina Sokolova
15 Aug 2021
SN Computer Science | VOL. 2

Discourse communities and their writing styles: A case study of Robert Boyle
Lilo Moessner ... Rwth Aachen
-
Lilo Moessner, et. al.Lilo Moessner ... Rwth Aachen
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Target Concept Guided Medical Concept Normalization in Noisy User-Generated Texts

Abstract

Highlights

Summary

Talk to us

Similar Papers