A comparative study of automated legal text classification using random forests and deep learning

Haihua Chen,Lei Wu,Jiangping Chen,Wei Lu,Junhua Ding

doi:10.1016/j.ipm.2021.102798

Haihua Chen, Lei Wu + Show 3 more

Open Access

https://doi.org/10.1016/j.ipm.2021.102798

Copy DOI

Abstract

Automated legal text classification is a prominent research topic in the legal field. It lays the foundation for building an intelligent legal system. Current literature focuses on international legal texts, such as Chinese cases, European cases, and Australian cases. Little attention is paid to text classification for U.S. legal texts. Deep learning has been applied to improving text classification performance. Its effectiveness needs further exploration in domains such as the legal field. This paper investigates legal text classification with a large collection of labeled U.S. case documents through comparing the effectiveness of different text classification techniques. We propose a machine learning algorithm using domain concepts as features and random forests as the classifier. Our experiment results on 30,000 full U.S. case documents in 50 categories demonstrated that our approach significantly outperforms a deep learning system built on multiple pre-trained word embeddings and deep neural networks. In addition, applying only the top 400 domain concepts as features for building the random forests could achieve the best performance. This study provides a reference to select machine learning techniques for building high-performance text classification systems in the legal domain or other fields.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Processing & Management	Publication Date: Nov 17, 2021
Citations: 80	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

A comparative study of automated legal text classification using random forests and deep learning

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Similar Papers

Advancing Legal Citation Text Classification A Conv1D-Based Approach for Multi-Class Classification
Ying Xie ... Yibo Yin
Journal of Theory and Practice of Engineering Science | VOL. 4
Ying Xie, et. al.Ying Xie ... Yibo Yin
28 Feb 2024
Journal of Theory and Practice of Engineering Science | VOL. 4

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

Artificial intelligence in interdisciplinary life science and drug discovery research.
Jürgen Bajorath
Future science OA | VOL. 8
Jürgen BajorathJürgen Bajorath
08 Mar 2022
Future science OA | VOL. 8

Attention Based Encoder Architecture for Automatic Text Classification: A Case Study on Text-news Classification
Joydeep Sinha Chowdhury ... Tanmoy Roy
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 06
Joydeep Sinha Chowdhury, et. al.Joydeep Sinha Chowdhury ... Tanmoy Roy
03 Apr 2022
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 06

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparative study of automated legal text classification using random forests and deep learning

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management