German BERT Model for Legal Named Entity Recognition

Harshil Darji,Jelena Mitrović,Michael Granitzer

doi:10.5220/0011749400003393

Abstract

The use of BERT, one of the most popular language models, has led to improvements in many Natural Language Processing (NLP) tasks. One such task is Named Entity Recognition (NER) i.e. automatic identification of named entities such as location, person, organization, etc. from a given text. It is also an important base step for many NLP tasks such as information extraction and argumentation mining. Even though there is much research done on NER using BERT and other popular language models, the same is not explored in detail when it comes to Legal NLP or Legal Tech. Legal NLP applies various NLP techniques such as sentence similarity or NER specifically on legal data. There are only a handful of models for NER tasks using BERT language models, however, none of these are aimed at legal documents in German. In this paper, we fine-tune a popular BERT language model trained on German data (German BERT) on a Legal Entity Recognition (LER) dataset. To make sure our model is not overfitting, we performed a stratified 10-fold cross-validation. The results we achieve by fine-tuning German BERT on the LER dataset outperform the BiLSTM-CRF+ model used by the authors of the same LER dataset. Finally, we make the model openly available via HuggingFace.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

German BERT Model for Legal Named Entity Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2023
Citations: 5	License type: cc-by-nc-nd

Similar Papers

Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques.
Siun Kim ... Yesol Hong
Drug Safety | VOL. 46
Siun Kim, et. al.Siun Kim ... Yesol Hong
17 Jun 2023
Drug Safety | VOL. 46

Optimizing Small BERTs Trained for German NER
Jochen Zöllner ... Roger Labahn
Information | VOL. 12
Jochen Zöllner, et. al.Jochen Zöllner ... Roger Labahn
25 Oct 2021
Information | VOL. 12

Towards a Novel Weakly Supervised Joint Approach of Named Entity Recognition and Normalization for Noisy Text
Assia Mezhar ... Mohammed Ramdani
SSRN Electronic Journal | VOL. -
Assia Mezhar, et. al.Assia Mezhar ... Mohammed Ramdani
01 Jan 2018
SSRN Electronic Journal | VOL. -

GREEK-BERT: The Greeks visiting Sesame Street
John Koutsikakis ... Ilias Chalkidis
-
John Koutsikakis, et. al.John Koutsikakis ... Ilias Chalkidis
02 Sep 2020
02 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

German BERT Model for Legal Named Entity Recognition

Abstract

Talk to us

Similar Papers