Abstract

AbstractWe critically assess mainstream accounting and finance research applying methods from computational linguistics (CL) to study financial discourse. We also review common themes and innovations in the literature and assess the incremental contributions of studies applying CL methods over manual content analysis. Key conclusions emerging from our analysis are: (a) accounting and finance research is behind the curve in terms of CL methods generally and word sense disambiguation in particular; (b) implementation issues mean the proposed benefits of CL are often less pronounced than proponents suggest; (c) structural issues limit practical relevance; and (d) CL methods and high quality manual analysis represent complementary approaches to analyzing financial discourse. We describe four CL tools that have yet to gain traction in mainstream AF research but which we believe offer promising ways to enhance the study of meaning in financial discourse. The four tools are named entity recognition (NER), summarization, semantics and corpus linguistics.

Highlights

  • Information is the lifeblood of financial markets and the amount of data available to decision-makers is increasing exponentially

  • Intrinsic evaluation involves understanding the performance of a specific processing tool. It can be distinguished from extrinsic evaluation which seeks to determine how well a specific processing tool is performing as part of an entire language technology system

  • We conclude that the main weakness of prior work is its continued reliance on simple bag-of-words content analysis methods that fail to reflect context and disambiguate meaning

Read more

Summary

Introduction

Information is the lifeblood of financial markets and the amount of data available to decision-makers is increasing exponentially. Bank of England (2015) estimates that 90% of global information has been created during the last decade, the vast majority of which is unstructured data (e.g., free-form text).. Dyer et al (2017) find a 113% increase in the median length of U.S registrants’ 10-K annual reports over the period 1996-2013 and Lewis and. Young (2018) report similar results for U.K. annual reports. The volume of unstructured financial data exceeds the capacity of humans to process the content manually. Users of financial market data are turning increasingly to computational linguistics to assist with the task of analyzing large volumes of unstructured data.. Users of financial market data are turning increasingly to computational linguistics to assist with the task of analyzing large volumes of unstructured data. Academic research in accounting and finance is mirroring this trend

Objectives
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.