Formalizing the Recognition of Medical Domain Multiword Units

Kristina Kocijan,Krešimir Šojat

doi:10.1201/9781003138013-5

Abstract

This chapter discusses the problem of recognizing multiword units (MWUs) in medical domain texts written in the Croatian language. MWUs have been the focus of research of many authors since even before the Natural Language Processing era, which has only helped to spread interest in MWUs in multiple dimensions and directions. An overview of rule-based approaches to different levels of analysis of medical-related texts, ranging from simple regular expressions to commercial healthcare-domain-oriented tools like ClearForest, LEXIMER, and AeroText, among others, is given in Spasic et al. Health care is abundant in free-form medical texts, which are also almost impossible to obtain, even for research purposes. The creation of this lexicon is an ongoing project divided into several phases. In previous phases, the Croatian medical corpus was collected, and it is now continuously being made available through the Sketch Engine interface as the documents are tagged with the domain and subdomain markers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Formalizing the Recognition of Medical Domain Multiword Units

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Recognizing Verb-Based Croatian Idiomatic MWUs
Kristina Kocijan ... Sara Librenjak
-
Kristina Kocijan, et. al.Kristina Kocijan ... Sara Librenjak
01 Jan 2015
01 Jan 2015

Typographic enhancement of multiword units in second language text
Frank Boers ... Lin He
International Journal of Applied Linguistics | VOL. 27
Frank Boers, et. al.Frank Boers ... Lin He
25 Apr 2016
International Journal of Applied Linguistics | VOL. 27

Typical Phraseological Units in Poetic Texts
Michael Pace-Sigge
-
Michael Pace-SiggeMichael Pace-Sigge
01 Jan 2019
01 Jan 2019

The quest for croatian idioms as multiword units
Kristina Kocijan ... Sara Librenjak
-
Kristina Kocijan, et. al.Kristina Kocijan ... Sara Librenjak
12 Jul 2018
12 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Formalizing the Recognition of Medical Domain Multiword Units

Abstract

Talk to us

Similar Papers