Exploring named‐entity recognition techniques for academic books

Pablo Calleja Ibañez,Elea Giménez‐Toledo

doi:10.1002/leap.1610

Abstract

Recent advances in the natural language processing (NLP) field have achieved impressive results in various tasks. However, NLP techniques are underrepresented in the analysis of Humanities and Social Science texts and in languages other than English. In particular, academic books are a highly valuable source of information that has not been exploited by these techniques at all. The recognition of named entities (person names, organizations or locations) and their semantic annotation over books could enrich the visibility and discoverability of the information by users. This is an opportunity for academia and the academic publishing industry in which semantic search is a central task and now books can be queried by named entities of interest that are in their content. This work proposes a methodology to apply named‐entity recognition to publish the results into an ontological semantic‐web format. The work has been performed over a corpus of academic books provided by UNE (Unión de Editoriales Universitarias Españolas, Union of Spanish University Presses). Results show an enrichment of the information extracted over the books and of the possibilities of querying them at the individual level but also within the whole set of books, increasing the possibilities for books to be discovered or retrieved beyond metadata.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring named‐entity recognition techniques for academic books

Abstract

Talk to us

Similar Papers

More From: Learned Publishing

Lead the way for us

Journal: Learned Publishing	Publication Date: May 17, 2024
License type: CC BY-NC 4.0

Similar Papers

Macao's academic book publishing industry: A SWOT and PEST analysis
Li Jiagui ... Johnny F I Lam
Learned Publishing | VOL. 37
Li Jiagui, et. al.Li Jiagui ... Johnny F I Lam
26 Feb 2024
Learned Publishing | VOL. 37

Analysis of Audio-Based News Classification Using Machine Learning Techniques
S Divya ... S Mohanavalli
-
S Divya, et. al.S Divya ... S Mohanavalli
01 Jan 2020
01 Jan 2020

Graph-Based Natural Language Processing and Information Retrieval Rada Mihalcea and Dragomir Radev (University of North Texas and University of Michigan) Cambridge, UK: Cambridge University Press, 2011, viii+192 pp; hardbound, ISBN 978-0-521-89613-9, $65.00
Chris Biemann
Computational Linguistics | VOL. 38
Chris BiemannChris Biemann
01 Mar 2012
Computational Linguistics | VOL. 38

Natural Language Processing Utilisation in Healthcare
S Vani ... Palvadi Srinivas Kumar
-
S Vani, et. al.S Vani ... Palvadi Srinivas Kumar
04 Feb 2022
04 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring named‐entity recognition techniques for academic books

Abstract

Talk to us

Similar Papers

More From: Learned Publishing