Automated Extraction of Typical Expressions Describing Product Features from Customer Reviews

Karel Barák,František Dařena,Jan Žižka

doi:10.11118/ejobsat.v1i2.27

Karel Barák, František Dařena + Show 1 more

Open Access

https://doi.org/10.11118/ejobsat.v1i2.27

Copy DOI

Journal: European Journal of Business Science and Technology	Publication Date: Dec 30, 2015
Citations: 33	License type: CC BY-SA 4.0

Abstract

The paper presents a procedure that helps in revealing topics hidden in large collections of textual documents (such as customer reviews) related to a certain group of products or services. Together with identification of the groups containing the topics the lists of important expressions is presented which helps in understanding what characterizes these aspects most typically from the semantic point of view. The procedure includes determining an appropriate number of groups representing the prevailing topics, partitioning the documents into a desired number of groups using clustering, extracting significant typical features of documents from each group with application of feature selection methods, and evaluating the outcomes with the assistance of a human expert. The results show that the presented approach, consisting mostly of automated steps, is able to separate and characterize the aspects of a certain product as discussed by the customers and be later useful, e.g., for handling customer complaints, designing promotional campaigns, or improving the products.

Full Text