Abstract

Customer Choice Modeling aims to model the decision-making process of customers, or segments of customers, through their choices and preferences identified by the analysis of their behaviors in one or more specific contexts. Clustering techniques are used in this context to identify patterns in their choices and preferences, to define segments of customers with similar behaviors, and to model how customers of different segments respond to competing products and offers. However, data clustering is an unsupervised learning task by nature, that is the grouping of customers with similar behaviors in clusters must be performed without prior knowledge about the nature and the number of intrinsic groups of data instances, i.e., customers, in the data space. Thus, the choice of both the clustering algorithm used and its parameterization, and of the evaluation method used to assess the relevance of the resulting clusters are central issues. Consensus clustering, or ensemble clustering, aims to solve these issues by combining the results of different clustering algorithms and parameterizations to generate a more robust and relevant final clustering result. We present a Multi-level Consensus Clustering approach combining the results of several clustering algorithmic configurations to generate a hierarchy of consensus clusters in which each cluster represents an agreement between different clustering results. A closed sets based approach is used to identified relevant agreements, and a graphical hierarchical representation of the consensus cluster construction process and their inclusion relationships is provided to the end-user. This approach was developed and experimented in travel industry context with Amadeus SAS. Experiments show how it can provide a better segmentation, and refine the customer segments by identifying relevant sub-segments represented as sub-clusters in the hierarchical representation, for Customer Choice Modeling. The clustering of travelers was able to distinguish relevant segments of customers with similar needs and desires (i.e., customers purchasing tickets according to different criteria, like price, duration of flight, lay-over time, etc.) and at different levels of precision, which is a major issue for improving the personalization of recommendations in flight search queries.

Highlights

  • This article is an extended version of article [1] published in the iCETiC’2020 international conference during which he received the best paper award

  • The Multi-level Consensus Clustering framework presented is extended here with the description of the algorithmic processes involved by the implementation of the framework in the general context of Customer Choice Modelling, considering both the context of unsupervised clustering, where no background information is used in the process, and semi-supervised clustering, where background knowledge can be introduced in the process to improve the relevance of the results

  • The techniques developed during this project first aim to solve central issues for the Customer Choice Modeling data clustering steps by providing a multi-level consensus clustering based solution that:

Read more

Summary

Introduction

This article is an extended version of article [1] published in the iCETiC’2020 international conference during which he received the best paper award. Customer segments are identified as clusters, i.e. groups with similar properties, of customers in the data space of travel search queries. The most frequently used measures are the Adjusted Rand Index (ARI) and the Normalized Mutual Information (NMI) that evaluates the relevance of the consensus clustering as its average similarity with all base clusterings in the ensemble [19,20,21,22] Such consensus clustering validation measures provide an efficient solution to identify and rank the best agreements among all the base clusterings regarding the possible different data distribution models, e.g., densitybased or centroid-based, in sub-spaces of the data space corresponding to clusters.

Multiple Consensus Clustering Framework
Multiple Consensus Clustering Approach
Traveler Choice Modelling Problem Decomposition
Identify traveler segments
Understand traveler choice patterns
Optimize bookings for each segment
Multi-Level Consensus Clustering Framework for Customer Choice Modelling
Multiple Consensus Clustering Process
Data Exploration and Preprocessing
Multi-level Ensemble Consensus Clustering
Clusters to Classes Learning
Technical and Scientific Challenges
C-1 C-2-1 C-2-2 C-3
Definition of Base Clustering Algorithmic Configurations
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call