A Bayesian Sampling Method for Product Feature Extraction From Large-Scale Textual Data

Sunghoon Lim,Conrad S Tucker

doi:10.1115/1.4033238

Abstract

The authors of this work propose an algorithm that determines optimal search keyword combinations for querying online product data sources in order to minimize identification errors during the product feature extraction process. Data-driven product design methodologies based on acquiring and mining online product-feature-related data are presented with two fundamental challenges: (1) determining optimal search keywords that result in relevant product related data being returned and (2) determining how many search keywords are sufficient to minimize identification errors during the product feature extraction process. These challenges exist because online data, which is primarily textual in nature, may violate several statistical assumptions relating to the independence and identical distribution of samples relating to a query. Existing design methodologies have predetermined search terms that are used to acquire textual data online, which makes the resulting data acquired, a function of the quality of the search term(s) themselves. Furthermore, the lack of independence and identical distribution of text data from online sources impacts the quality of the acquired data. For example, a designer may search for a product feature using the term “screen,” which may return relevant results such as “the screen size is just perfect,” but may also contain irrelevant noise such as “researchers should really screen for this type of error.” A text mining algorithm is introduced to determine the optimal terms without labeled training data that would maximize the veracity of the data acquired to make a valid conclusion. A case study involving real-world smartphones is used to validate the proposed methodology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Bayesian Sampling Method for Product Feature Extraction From Large-Scale Textual Data

Abstract

Talk to us

Similar Papers

More From: Journal of Mechanical Design

Lead the way for us

Journal: Journal of Mechanical Design	Publication Date: Apr 20, 2016
Citations: 27

Similar Papers

Implicit feature detection by ontology aided feature-based opinion summarization
Derviş Kanbur ... Mehmet S Aktaş
-
Derviş Kanbur, et. al.Derviş Kanbur ... Mehmet S Aktaş
01 Oct 2017
01 Oct 2017

Understanding what concerns consumers: a semantic approach to product feature extraction from consumer reviews
Chih-Ping Wei ... Chin-Sheng Yang
Information Systems and e-Business Management | VOL. 8
Chih-Ping Wei, et. al.Chih-Ping Wei ... Chin-Sheng Yang
16 Apr 2009
Information Systems and e-Business Management | VOL. 8

Product feature extraction from Chinese online reviews: application to product improvement
Lili Shi ... Guoquan Liu
RAIRO - Operations Research | VOL. 57
Lili Shi, et. al.Lili Shi ... Guoquan Liu
01 May 2023
RAIRO - Operations Research | VOL. 57

Product Feature Extraction via Topic Model and Synonym Recognition Approach
Jun Feng ... Xiaodong Li
-
Jun Feng, et. al.Jun Feng ... Xiaodong Li
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Bayesian Sampling Method for Product Feature Extraction From Large-Scale Textual Data

Abstract

Talk to us

Similar Papers

More From: Journal of Mechanical Design