Large Set Of Attributes Research Articles

ObjectivesFeature selection in data sets is an important task allowing to alleviate various machine learning and data mining issues. The main objectives of a feature selection method consist on building simpler and more understandable classifier models in order to improve the data mining and processing performances. Therefore, a comparative evaluation of the Chi-square method, recursive feature elimination method, and tree-based method (using Random Forest) used on the three common machine learning methods (K-Nearest Neighbor, naïve Bayesian classifier and decision tree classifier) are performed to select the most relevant primitives from a large set of attributes. Furthermore, determining the most suitable couple (i.e., feature selection method-machine learning method) that provides the best performance is performed. Materials and methodsIn this paper, an overview of the most common feature selection techniques is first provided: the Chi-Square method, the Recursive Feature Elimination method (RFE) and the tree-based method (using Random Forest). A comparative evaluation of the improvement (brought by such feature selection methods) to the three common machine learning methods (K- Nearest Neighbor, naïve Bayesian classifier and decision tree classifier) are performed. For evaluation purposes, the following measures: micro-F1, accuracy and root mean square error are used on the stroke disease data set. ResultsThe obtained results show that the proposed approach (i.e., Tree Based Method using Random Forest, TBM-RF, decision tree classifier, DTC) provides accuracy higher than 85%, F1-score higher than 88%, thus, better than the KNN and NB using the Chi-Square, RFE and TBM-RF methods. ConclusionThis study shows that the couple - Tree Based Method using Random Forest (TBM-RF) decision tree classifier successfully and efficiently contributes to find the most relevant features and to predict and classify patient suffering of stroke disease.”

Purpose– The purpose of this paper is to develop a comprehensive set of grocery store attributes that can be standardized and used in empirical research aiming at increasing retailers’ understanding of determinants of grocery store choice, and assessing how the relative importance of the attributes is affected by consumer socio-demographic characteristics and shopping behaviour.Design/methodology/approach– An internet survey of 1,575 Swedish consumers was conducted. A large set of attributes was rated by the participants on seven-point scales with respect to their importance for choice of grocery store. Principal component analysis (PCA) resulted in a reduced set of reliably measured aggregated attributes. This set included the attractiveness attributes price level, supply range, supply quality, service quality, storescape quality, facilities for childcare, and closeness to other stores, and the accessibility attributes easy access by car, easy access by other travel modes, and availability (closeness to store and opening hours).Findings– The results showed that accessibility by car is the most important grocery store attribute, storescape quality and availability the next most important and facilities for childcare the least important. It was also found that socio-demographic factors and shopping behaviour have an impact on the importance of the store attributes.Originality/value– A comprehensive set of attractiveness and accessibility attributes of grocery stores that can be standardized and used in empirical research is established. The results are valid for the Swedish-European conditions that differ from the conditions in North America where most previous research has been conducted. The results reveal the relative importance grocery-shopping consumers place on controllable attractiveness attributes compared to uncontrollable accessibility attributes as well as the relative importance of the attributes within each category.

Large Set Of Attributes Research Articles

Articles published on Large Set Of Attributes

Stroke Treatment Prediction Using Features Selection Methods and Machine Learning Classifiers

The q-Rung orthopair fuzzy hamacher generalized shapley choquet integral operator and its application to multiattribute decision making

Aggressive driving behavior prediction considering driver’s intention based on multivariate-temporal feature data

Artificial Neuro Probit Regression Based Associative Frequent Pattern Mining for Heart Disease Prediction

Conditional Preference Networks for Cloud Service Selection and Ranking With Many Irrelevant Attributes

Tie-formation process within the communities of the Japanese production network: application of an exponential random graph model

Mini-Batch Normalized Mutual Information: A Hybrid Feature Selection Method

A hierarchical mixture modeling framework for population synthesis

Upgraded data envelopment analysis model application for total productivity comparison in major airports of the European Union

Scaled conjugate gradient back-propagation algorithm for selection of industrial robots

Importance ratings of grocery store attributes

Re-heat simulated annealing algorithm for rough set attribute reduction

Modeling households activity participation decisions in a rule-based system of travel demand

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large Set Of Attributes Research Articles

Articles published on Large Set Of Attributes

Stroke Treatment Prediction Using Features Selection Methods and Machine Learning Classifiers

The q-Rung orthopair fuzzy hamacher generalized shapley choquet integral operator and its application to multiattribute decision making

Aggressive driving behavior prediction considering driver’s intention based on multivariate-temporal feature data

Artificial Neuro Probit Regression Based Associative Frequent Pattern Mining for Heart Disease Prediction

Conditional Preference Networks for Cloud Service Selection and Ranking With Many Irrelevant Attributes

Tie-formation process within the communities of the Japanese production network: application of an exponential random graph model

Mini-Batch Normalized Mutual Information: A Hybrid Feature Selection Method

A hierarchical mixture modeling framework for population synthesis

Upgraded data envelopment analysis model application for total productivity comparison in major airports of the European Union

Scaled conjugate gradient back-propagation algorithm for selection of industrial robots

Importance ratings of grocery store attributes

Re-heat simulated annealing algorithm for rough set attribute reduction

Modeling households activity participation decisions in a rule-based system of travel demand