A comparative study of dimensionality reduction techniques to enhance trace clustering performances

M Song,H Yang,S.H Siadat,M Pechenizkiy

doi:10.1016/j.eswa.2012.12.078

Abstract

Process mining techniques have been used to analyze event logs from information systems in order to derive useful patterns. However, in the big data era, real-life event logs are huge, unstructured, and complex so that traditional process mining techniques have difficulties in the analysis of big logs. To reduce the complexity during the analysis, trace clustering can be used to group similar traces together and to mine more structured and simpler process models for each of the clusters locally. However, a high dimensionality of the feature space in which all the traces are presented poses different problems to trace clustering. In this paper, we study the effect of applying dimensionality reduction (preprocessing) techniques on the performance of trace clustering. In our experimental study we use three popular feature transformation techniques; singular value decomposition (SVD), random projection (RP), and principal components analysis (PCA), and the state-of-the art trace clustering in process mining. The experimental results on the dataset constructed from a real event log recorded from patient treatment processes in a Dutch hospital show that dimensionality reduction can improve trace clustering performance with respect to the computation time and average fitness of the mined local process models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparative study of dimensionality reduction techniques to enhance trace clustering performances

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Jan 3, 2013
Citations: 66

Similar Papers

How Can Interactive Process Discovery Address Data Quality Issues in Real Business Settings? Evidence from a Case Study in Healthcare
Elisabetta Benevento ... Wil M.P Van Der Aalst
Journal of Biomedical Informatics | VOL. 130
Elisabetta Benevento, et. al.Elisabetta Benevento ... Wil M.P Van Der Aalst
30 Apr 2022
Journal of Biomedical Informatics | VOL. 130

Multi-Perspective Clustering of Process Execution Traces
...
-
, et. al. ...
05 Feb 2019
05 Feb 2019

Active Trace Clustering for Improved Process Discovery
Jochen De Weerdt ... Seppe Vanden Broucke
IEEE Transactions on Knowledge and Data Engineering | VOL. 25
Jochen De Weerdt, et. al.Jochen De Weerdt ... Seppe Vanden Broucke
01 Dec 2013
IEEE Transactions on Knowledge and Data Engineering | VOL. 25

Generalized Alignment-Based Trace Clustering of Process Behavior
Mathilde Boltenhagen ... Josep Carmona
-
Mathilde Boltenhagen, et. al.Mathilde Boltenhagen ... Josep Carmona
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparative study of dimensionality reduction techniques to enhance trace clustering performances

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications