Abstract
This paper describes the use of Latent Dirichlet Allocation (LDA), or topic modeling, to explore the discursive makeup of the18th-century Encyclopedie of Denis Diderot and Jean le Rond d’Alembert (1751-1772). Expanding upon previous work modeling the Encyclopedie’s ontology, or classification scheme, we examine the abstractions used by its editors to visualize the various ‘systems’ of knowledge that the work proposes, considered here as heuristic tools for navigating the complex information space of the Encyclopedie. Using these earlier experiments with supervised machine learning models as a point of reference, we introduce the notion of topic modeling as a ‘discourse analysis tool’ for Enlightenment studies. In so doing, we draw upon the tradition of post-structuralist French discourse analysis, one of the first fields to embrace computational approaches to discursive text analysis. Our particular use of LDA is thus aimed primarily at uncovering inter-disciplinary ‘discourses’ in the Encyclopedie that run alongside, under, above, and through the original classifications. By mapping these discourses and discursive practices we can begin to move beyond the organizational (and physical) limitations of the print edition, suggesting several possible avenues of future research. These experiments thus attest once again to the enduring relevance of the Encyclopedie as an exemplary Enlightenment text. Its rich dialogical structure, whether studied using traditional methods of close reading or through the algorithmic processes described in this paper, is perhaps only now coming fully to light thanks to recent developments in digital resources and methods.
Highlights
GRAPHS, MAPS, AND TREESIn many ways, the eighteenth century French Encyclopédie, created under the direction of Denis Diderot and Jean le Rond d’Alembert between 1751 and 1772, seems to have almost been designed as a document classification exercise
Expanding upon previous work modeling the Encyclopédie’s ontology, or classification scheme, we examine the abstractions used by its editors to visualize the various “systems” of knowledge that the work proposes, considered here as heuristic tools for navigating the complex information space of the Encyclopédie
Using these earlier experiments with supervised machine-learning models as a point of reference, we introduce the notion of topic modeling as a “discourse analysis tool” for Enlightenment studies
Summary
The eighteenth century French Encyclopédie, created under the direction of Denis Diderot and Jean le Rond d’Alembert between 1751 and 1772, seems to have almost been designed as a document classification exercise. Its structure, spread over 17 in-folio volumes of text and another 11 volumes of engravings, comes complete with a branching, hierarchical ontology, or classification scheme. Of the almost 75,000 articles contained in the Encyclopédie, some 62,000 were classified by the editors according to this ontology, while, for a variety of editorial reasons, 13,000 or so were left with no explicit classification. All references to the text, articles, and classes of the Encyclopédie are drawn from the digital edition made available by the ARTFL Project at the
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have