Vocabulary Based Information Retrieval: A Look at CSR Reports by European Listed Companies

Anastasiia Borisova,Paul Andre

doi:10.2139/ssrn.3811475

Vocabulary Based Information Retrieval: A Look at CSR Reports by European Listed Companies

Anastasiia Borisova, Paul Andre

https://doi.org/10.2139/ssrn.3811475

Copy DOI

Journal: SSRN

Publication Date: Mar 24, 2021

#Corporate Social Responsibility #Corporate Social Responsibility Reports + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper offers a solution for the automatic retrieval of topic-specific discussions from .pdf documents. The suggested algorithm's main contribution consists of isolating topical discussions independently of the document's structure and disclosure location within the .pdf. We demonstrate this property by exploring corporate social responsibility (CSR) reporting that varies considerably across companies and countries. Our final successful extraction rate is calculated based on a randomly selected 50 annual reports where human readers identified CSR discussions. The final percentage of retrieval exceeds 90 %. Statistical validation of this approach also confirms capturing the underlying CSR construct by its high correlation with a CSR performance rating.

Full Text