Abstract

We introduce the Freedom of Information Archive (FOIArchive) Database, a collection of over 3 million documents about state diplomacy. Substantively, our database focusses on the USA and provides opportunities to analyze previously classified (or publicly unavailable) corpora of internal government documents which include the raw—often full—text of those documents. We also provide within-country diplomatic records for the USA, UK, and Brazil. The full span of the data is 1620–2013, but it is mainly from the twentieth century. Our database allows scholars to view text and associated statistics online and to download and view customized datasets via an application programming interface. We provide extensive metadata about the documents, including the countries and persons they mention, and their topics and classification levels. The metadata includes information we extracted with domain-specific, customized natural language processing tools. To demonstrate the potential of this data, we use it to design and validate a new index for “country importance” in the context of US foreign policy priorities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.