Hierarchical Text Classification and Its Foundations: A Review of Current Research

Alessandro Zangari,Matteo Marcuzzo,Andrea Albarelli,Andrea Gasparetto,Matteo Rizzo,Lorenzo Giudice

doi:10.3390/electronics13071199

Abstract

While collections of documents are often annotated with hierarchically structured concepts, the benefits of these structures are rarely taken into account by classification techniques. Within this context, hierarchical text classification methods are devised to take advantage of the labels’ organization to boost classification performance. In this work, we aim to deliver an updated overview of the current research in this domain. We begin by defining the task and framing it within the broader text classification area, examining important shared concepts such as text representation. Then, we dive into details regarding the specific task, providing a high-level description of its traditional approaches. We then summarize recently proposed methods, highlighting their main contributions. We also provide statistics for the most commonly used datasets and describe the benefits of using evaluation metrics tailored to hierarchical settings. Finally, a selection of recent proposals is benchmarked against non-hierarchical baselines on five public domain-specific datasets. These datasets, along with our code, are made available for future research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical Text Classification and Its Foundations: A Review of Current Research

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Mar 25, 2024
License type: CC BY 4.0

Similar Papers

Hierarchical Text Classification Methods and Their Specification
Aixin Sun ... Wee-Keong Ng
-
Aixin Sun, et. al.Aixin Sun ... Wee-Keong Ng
01 Jan 2003
01 Jan 2003

Effective Seed-Guided Topic Labeling for Dataless Hierarchical Short Text Classification
Yi Yang ... Jiawen Zhang
-
Yi Yang, et. al.Yi Yang ... Jiawen Zhang
01 Jan 2020
01 Jan 2020

A Hierarchical Fine-Tuning Approach Based on Joint Embedding of Words and Parent Categories for Hierarchical Multi-label Text Classification
Yinglong Ma ... Beihong Jin
-
Yinglong Ma, et. al.Yinglong Ma ... Beihong Jin
01 Jan 2020
01 Jan 2020

F-HMTC: Detecting Financial Events for Investment Decisions Based on Neural Hierarchical Multi-Label Text Classification
Xin Liang ... Fangzhou Yang
-
Xin Liang, et. al.Xin Liang ... Fangzhou Yang
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Text Classification and Its Foundations: A Review of Current Research

Abstract

Talk to us

Similar Papers

More From: Electronics