CAILIE 1.0: A dataset for Challenge of AI in Law - Information Extraction V1.0

Yu Cao,Yuanyuan Sun,Ce Xu,Chunnan Li,Jinming Du,Hongfei Lin

doi:10.1016/j.aiopen.2022.12.002

Yu Cao, Yuanyuan Sun + Show 4 more

Open Access

https://doi.org/10.1016/j.aiopen.2022.12.002

Copy DOI

Journal: AI Open	Publication Date: Jan 1, 2022
Citations: 1	License type: cc-by-nc-nd

Affiliation: Dalian University of Technology, University of Technology

Abstract

Legal information extraction requires identifying and classifying legal elements from specific legal documents. Considering that information extraction is mainly regarded as the first step in natural language understanding, the quality of legal information extraction results certainly has an immense impact on the performance of various legal artificial intelligence (AI) downstream tasks. However, Chinese judicial information extraction datasets are very scarce due to the particularity of legal documents. In response to this situation, we constructed a dataset for Challenge of AI in Law - Information Extraction V1.0 (CAILIE 1.0). The following two features of CAILIE are worth highlighting: 1) the entity definition focuses on more fine-grained theft document information, providing more interpretability for downstream legal AI; and 2) we define entity labels with judicial attributes based on natural attribute labels to meet the needs of Chinese judicial practice. We implement some classic models on this dataset. The experimental results show that legal information extraction is still challenging and additional research is required for this task to be solved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CAILIE 1.0: A dataset for Challenge of AI in Law - Information Extraction V1.0

Abstract

Talk to us

Similar Papers

More From: AI Open

Lead the way for us

Similar Papers

Towards Constructing a Chinese Information Extraction System to Support Innovations in Library Services
Zhang Zhixiong ... Li Sa
IFLA Journal | VOL. 33
Zhang Zhixiong, et. al. Zhang Zhixiong ... Li Sa
01 Dec 2007
IFLA Journal | VOL. 33

Identifying Temporal Components in a Chinese Temporal Information System
Wenjie Li ... Kam-Fai Wong
International Journal of Computer Processing of Languages | VOL. 13
Wenjie Li, et. al.Wenjie Li ... Kam-Fai Wong
01 Jun 2000
International Journal of Computer Processing of Languages | VOL. 13

A large-scale Chinese patent dataset for information extraction
Qian Zheng ... Lin Xu
Systems Science & Control Engineering | VOL. 12
Qian Zheng, et. al.Qian Zheng ... Lin Xu
31 Dec 2025
Systems Science & Control Engineering | VOL. 12

Research on Comprehensive Information Based Chinese Information Extraction System
Lei Li ... Jinghua Wang
-
Lei Li, et. al. Lei Li ... Jinghua Wang
30 Oct 2005
30 Oct 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CAILIE 1.0: A dataset for Challenge of AI in Law - Information Extraction V1.0

Abstract

Talk to us

Similar Papers

More From: AI Open