Abstract

Conformance checking is a subarea of process mining that studies relations between designed processes, also called process models, and records of observed processes, also called event logs. In the last decade, research in conformance checking has proposed a plethora of techniques for characterizing the discrepancies between process models and event logs. Often, these techniques are also applied to measure the quality of process models automatically discovered from event logs. Recently, the process mining community has initiated a discussion on the desired properties of such measures. This discussion witnesses the lack of measures with the desired properties and the lack of properties intended for measures that support partially matching processes, i.e., processes that are not identical but differ in some steps. The paper at hand addresses these limitations. Firstly, it extends the recently introduced precision and recall conformance measures between process models and event logs that possess the desired property of monotonicity with the support of partially matching processes. Secondly, it introduces new intuitively desired properties of conformance measures that support partially matching processes and shows that our measures indeed possess them. The new measures have been implemented in a publicly available tool. The reported qualitative and quantitative evaluations based on our implementation demonstrate the feasibility of using the proposed measures in industrial settings.

Highlights

  • Process mining aims to discover, monitor, and improve processes observed in the real world using the knowledge accumulated in event logs produced by executions of designed processes implemented in modern information systems [1], where an event log is a collection of recorded traces each capturing an instance of an executed business process

  • We demonstrate that the extended measures possess intuitively desired properties for precision and recall in the presence of partially matching traces, viz. the more and the longer are the partial matches between the traces captured in an event log and process model, the greater are the precision and recall values between them

  • We experiment with a synthetic event log and a set of corresponding process models described in [14], [15]

Read more

Summary

INTRODUCTION

Process mining aims to discover, monitor, and improve processes observed in the real world using the knowledge accumulated in event logs produced by executions of designed processes implemented in modern information systems [1], where an event log is a collection of recorded traces each capturing an instance of an executed business process. The precision and recall values of zero represent a situation of no similarities between the traces of the event log and the model traces. The precision and recall values of one represent the situation when the event log and process model capture the same behavior. The more (in the number of traces) and the longer (in the number of matched process steps) are the partial matches between the traces captured in an event log and process model, the greater are the precision and recall values between them. Extends the entropy-based precision and recall measures proposed in [8] to support the partial matching between traces via comparisons of their sub-traces;.

MOTIVATING EXAMPLE
Finite Automata
ENTROPY-BASED CONFORMANCE CHECKING
Topological Entropy
Short-circuit Entropy
Precision and Recall
PARTIALLY MATCHING APPROACH
Entropy and τ -closure of Regular Languages
Synthetic Dataset
Real-life Event Data
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.