Chemosensory sensations are often hard to describe and quantify. Language models may facilitate a systematic understanding of sensory descriptions. We accessed consumer and expert reviews of wine, perfume, and food products (English language; about 68 million words in total) and analyzed their sensory descriptions. Using a novel data-driven method based on natural language data, we compared the three chemosensory vocabularies (wine, perfume, food) with respect to their vocabulary overlap and semantic properties, and explored their semantic spaces. The three vocabularies primarily differ with respect to domain specificity, concreteness, descriptor type preference and degree of gustatory vs. olfactory association. Wine vocabulary primarily distinguishes between white wine and red wine flavors and qualities. Food vocabulary separates drinkable and edible food products and ingredients, on the one hand, and savory and non-savory products, on the other. A salient distinction in all three vocabularies is between concrete and abstract/evaluative terms. Valence also plays a role in the semantic spaces of all three vocabularies, but valence is less prominent here than ingeneral olfactory vocabulary. Our method allows a systematic comparison of sensory descriptors in the three product domains and provides a data-driven approach to derive sensory lexicons that can be applied by sensory scientists.
Read full abstract