Deep neural network for hierarchical extreme multi-label text classification

Francesco Gargiulo,Stefano Silvestri,Mario Ciampi,Giuseppe De Pietro

doi:10.1016/j.asoc.2019.03.041

Abstract

The classification of natural language texts has gained a growing importance in many real world applications due to its significant implications in relation to crucial tasks, such as Information Retrieval, Question Answering, Text Summarization, Natural Language Understanding. In this paper we present an analysis of a Deep Learning architecture devoted to text classification, considering the extreme multi-class and multi-label text classification problem, when a hierarchical label set is defined. The paper presents a methodology named Hierarchical Label Set Expansion (HLSE), used to regularize the data labels, and an analysis of the impact of different Word Embedding (WE) models that explicitly incorporate grammatical and syntactic features. We evaluate the aforementioned methodologies on the PubMed scientific articles collection, where a multi-class and multi-label text classification problem is defined with the Medical Subject Headings (MeSH) label set, a hierarchical set of 27,775 classes. The experimental assessment proves the usefulness of the proposed HLSE methodology and also provides some interesting results relating to the impact of different uses and combinations of WE models as input to the neural network in this kind of application.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep neural network for hierarchical extreme multi-label text classification

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Mar 29, 2019
Citations: 107

Similar Papers

A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art
Juan J Lastra-Díaz ... Eneko Agirre
Engineering Applications of Artificial Intelligence | VOL. 85
Juan J Lastra-Díaz, et. al.Juan J Lastra-Díaz ... Eneko Agirre
01 Aug 2019
Engineering Applications of Artificial Intelligence | VOL. 85

An empirical assessment of different word embedding and deep learning models for bug assignment
Rongcun Wang ... Rubing Huang
The Journal of Systems & Software | VOL. 210
Rongcun Wang, et. al.Rongcun Wang ... Rubing Huang
06 Jan 2024
The Journal of Systems & Software | VOL. 210

Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity
Alberto Blanco ... Arantza Casillas
Computer Methods and Programs in Biomedicine | VOL. 188
Alberto Blanco, et. al.Alberto Blanco ... Arantza Casillas
10 Dec 2019
Computer Methods and Programs in Biomedicine | VOL. 188

Practical Significance of GA PartCC in Multi-Label Classification
Annapuna P Patil ... Mridul Tiwary
-
Annapuna P Patil, et. al.Annapuna P Patil ... Mridul Tiwary
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep neural network for hierarchical extreme multi-label text classification

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing