Why are these publications missing? Uncovering the reasons behind the exclusion of documents in free‐access scholarly databases

Lorena Delgado‐Quirós,Alberto Martín‐Martín,Isidro F Aguillo,Emilio Delgado López‐Cózar,Enrique Orduña‐Malea,José Luis Ortega

doi:10.1002/asi.24839

Abstract

AbstractThis study analyses the coverage of seven free‐access bibliographic databases (Crossref, Dimensions—non‐subscription version, Google Scholar, Lens, Microsoft Academic, Scilit, and Semantic Scholar) to identify the potential reasons that might cause the exclusion of scholarly documents and how they could influence coverage. To do this, 116 k randomly selected bibliographic records from Crossref were used as a baseline. API endpoints and web scraping were used to query each database. The results show that coverage differences are mainly caused by the way each service builds their databases. While classic bibliographic databases ingest almost the exact same content from Crossref (Lens and Scilit miss 0.1% and 0.2% of the records, respectively), academic search engines present lower coverage (Google Scholar does not find: 9.8%, Semantic Scholar: 10%, and Microsoft Academic: 12%). Coverage differences are mainly attributed to external factors, such as web accessibility and robot exclusion policies (39.2%–46%), and internal requirements that exclude secondary content (6.5%–11.6%). In the case of Dimensions, the only classic bibliographic database with the lowest coverage (7.6%), internal selection criteria such as the indexation of full books instead of book chapters (65%) and the exclusion of secondary content (15%) are the main motives of missing publications.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Why are these publications missing? Uncovering the reasons behind the exclusion of documents in free‐access scholarly databases

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology

Lead the way for us

Journal: Journal of the American Society for Information Science and Technology	Publication Date: Oct 31, 2023
License type: CC BY 4.0

Similar Papers

Ranking by Relevance and Citation Counts, a Comparative Study: Google Scholar, Microsoft Academic, WoS and Scopus
Rovira ... Guerrero-Solé
Future internet | VOL. 11
Rovira, et. al. Rovira ... Guerrero-Solé
19 Sep 2019
Future internet | VOL. 11

Microsoft Academic (Search): a Phoenix arisen from the ashes?
Anne-Wil Harzing
Scientometrics | VOL. 108
Anne-Wil HarzingAnne-Wil Harzing
15 Jun 2016
Scientometrics | VOL. 108

Microsoft Academic is one year old: the Phoenix is ready to leave the nest
Anne-Wil Harzing ... Satu Alakangas
Scientometrics | VOL. 112
Anne-Wil Harzing, et. al.Anne-Wil Harzing ... Satu Alakangas
26 Jun 2017
Scientometrics | VOL. 112

Two new kids on the block: How do Crossref and Dimensions compare with Google Scholar, Microsoft Academic, Scopus and the Web of Science?
Anne-Wil Harzing
Scientometrics | VOL. 120
Anne-Wil HarzingAnne-Wil Harzing
08 May 2019
Scientometrics | VOL. 120

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Why are these publications missing? Uncovering the reasons behind the exclusion of documents in free‐access scholarly databases

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology