Abstract

With the rapid developments of Internet technology, a mass of law cases is constantly occurring and needs to be dealt with in time. Automatic classification of law text is the most basic and critical process in the online law advice platform. Deep neural network-based natural language processing (DNN-NLP) is one of the most promising approaches to implement text classification. Meanwhile, as the convolutional neural network-based (CNN-based) methods developed, CNN-based text classification has already achieved impressive results. However, previous work applied amounts of manually-annotated data, which increased the labor cost and reduced the adaptability of the approach. Hence, we present a new semi-supervised model to solve the problem of data annotation. Our method learns the embedding of small text regions from unlabeled data and then integrates the learned embedding into the supervised training. More specifically, the learned embedding regions with the two-view-embedding model are used as an additional input to the CNN’s convolution layer. In addition, to implement the multi-task learning task, we propose the multi-label classification algorithm to assign multiple labels to an instance. The proposed method is evaluated experimentally subject to a law case description dataset and English standard dataset RCV1 . On Chinese data, the simulation results demonstrate that, compared with the existing methods such as linear SVM, our scheme respectively improves by 7.76%, 7.86%, 9.19%, and 2.96% the precision, recall, F-1, and Hamming loss. Analogously, the results suggest that compared to CNN, our scheme respectively improves by 4.46%, 5.76%, 5.14% and 0.87% in terms of precision, recall, F-1, and Hamming loss. It is worth mentioning that the robustness of this method makes it suitable and effective for automatic classification of law text. Furthermore, the design concept proposed is promising, which can be utilized in other real-world applications such as news classification and public opinion monitoring.

Highlights

  • With the progress of information technology and the rapid development of Web2.0 network technology, the incredible development of the Internet has changed people’s lives, entertainment, and consumption pattern greatly

  • The system can carry out online law advice, law case handling, and other services, which greatly reduces the workload of lawyers, while bringing convenience for people who lack law knowledge

  • We found that the network depth improved the model performance, that is the training loss rate of the model would decrease as the number of hidden layers increased within a certain range

Read more

Summary

Introduction

With the progress of information technology and the rapid development of Web2.0 network technology, the incredible development of the Internet has changed people’s lives, entertainment, and consumption pattern greatly. As more and more users are willing to carry out online law advice, law case handling, and other services, several kinds of law service patterns were shown in [1,2]. How to find useful information quickly and accurately from law case description becomes very important for law text classification [3,4]. An decision support online law consulting platform is. A user enters text into the system for data preprocessing. The processed data are input to the semi-supervised model for semantic analysis. The semi-supervised model automatically recognizes and assigns the predefined law label. The user gets the information that he/she needs on the consulting system. We note that the system greatly reduces the workload of lawyers and brings convenience for people who lack law knowledge

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call