Measuring Similarity among Legal Court Case Documents

Arpan Mandal,Arindam Pal,Raktim Chaki,Sarbajit Saha,Saptarshi Ghosh,Kripabandhu Ghosh

doi:10.1145/3140107.3140119

Abstract

Computing the similarity between two legal documents is an important challenge in the Legal Information Retrieval domain. Efficient calculation of this similarity has useful applications in various tasks such as identifying relevant prior cases for a given case document. Prior works have proposed network-based and text-based methods for measuring similarity between legal documents. However, there are certain limitations in the prior methods. Network-based measures are not always meaningfully applicable since legal citation networks are usually very sparse. On the other hand, only primitive text-based similarity measures, such as TF-IDF based approaches, have been tried till date. In this work, we focus on improving text-based methodologies for computing the similarity between two legal documents. In addition to TF-IDF based measures, we use advanced similarity measures (such as topic modeling) and neural network models (such as word embeddings and document embeddings). We perform extensive experiments on a large dataset of Indian Supreme Court cases, and compare among various methodologies for measuring the textual similarity of legal documents. Our experiments show that embedding based approaches perform better than other approaches. We also demonstrate that the proposed embedding-based methodologies significantly outperforms a baseline hybrid methodology involving both network-based and text-based similarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Measuring Similarity among Legal Court Case Documents

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Deep Learning-Based Semantic Segmentation for Legal Texts: Unveiling Rhetorical Roles in Legal Case Documents
Divya Mohan ... B Sankar
E3S Web of Conferences | VOL. 529
Divya Mohan, et. al.Divya Mohan ... B Sankar
01 Jan 2024
E3S Web of Conferences | VOL. 529

Party Identification of Legal Documents using Co-reference Resolution and Named Entity Recognition
Chamodi Samarawickrama ... Nisansa De Silva
-
Chamodi Samarawickrama, et. al.Chamodi Samarawickrama ... Nisansa De Silva
26 Nov 2020
26 Nov 2020

Understanding Legal Documents: Classification of Rhetorical Role of Sentences Using Deep Learning and Natural Language Processing
Syed Rameel Ahmad ... Ibrahim Sahibzada
-
Syed Rameel Ahmad, et. al.Syed Rameel Ahmad ... Ibrahim Sahibzada
01 Feb 2020
01 Feb 2020

Implementation of Legal Documents Text Summarization and Classification by Applying Neural Network Techniques
Siddhartha Rusiya ... Anupam Jamatia
-
Siddhartha Rusiya, et. al.Siddhartha Rusiya ... Anupam Jamatia
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Measuring Similarity among Legal Court Case Documents

Abstract

Talk to us

Similar Papers