Abstract

Although there are many clinical and biomedical language models for the English language, there is still a significant lack of models for Spanish. In this presentation, we introduce Clinical Flair, the first Spanish language model trained on real diagnoses from the Chilean public healthcare system. Taking the Named Entity Recognition task as a case study, we show that contextualized embeddings retrieved from our domain-specific language model outperform the results of the general domain model by a wide margin. In addition, we show that training a language model on real diagnoses may be more beneficial than training on electronic health records.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.