Distributional semantic analysis in a technical corpus

Ann Bertels

doi:10.1080/2050571x.2017.1368970

Distributional semantic analysis in a technical corpus

Ann Bertels

https://doi.org/10.1080/2050571x.2017.1368970

Copy DOI

Journal: Speech, Language and Hearing

Publication Date: Aug 23, 2017

Affiliation: KU Leuven

Abstract
Full-Text PDF
Similar Papers

Abstract

ABSTRACTThis paper presents the methodology of a distributional analysis for the exploration of semantic similarity in a small technical corpus. The aim is to show the semantic issues at stake in a series of statistical experiments. First-order co-occurrences of a technical node are clustered based on shared-order second and third-order co-occurrences and on the respective association strength. The results of the statistical clustering techniques show semantic similarities and dissimilarities between first-order co-occurrences by means of proximities and distances on 2D-plots and 3D-plots. By fine-tuning the statistical analysis and by using enriched distributional data, we aim at achieving more revealing semantic interpretations.

Full Text