Skip to main navigation Skip to search Skip to main content

Automating mixture model fitting of task durations for process conformance checking

Research output: Contribution to journalArticlepeer-review

29 Downloads (Pure)

Abstract

Process task duration data often exhibit multiple peaks, indicating differences in, for example, customer ages and preferences, resource capabilities or the day/hour of a week. This heterogeneous data, which captures diverse customer patterns, should be represented using different models, resulting in an overall mixture model. This paper introduces gamma mixture models to represent various customer patterns in task duration data, with a focus on automating the fitting process. The approach involves a two-stage procedure: first, divide-and-conquer using peak-, equidistance- and cluster-based techniques to partition data, and automatically fit gamma distributions to each subset. The second stage then improves the fitted mixture model by directly searching the log-likelihood surface. The method is compared with the expectation–maximization (EM) algorithm and an open tool (HyperStar), using both artificially generated datasets and a publicly available hospital billing dataset, demonstrating its effectiveness and time efficiency in modelling heterogeneous process duration data. Furthermore, a case study on process conformance checking is conducted using the hospital billing dataset, highlighting a potential application area for the method in process mining.
Original languageEnglish
Article number53
Pages (from-to)1-35
Number of pages35
JournalData Mining and Knowledge Discovery
Volume39
Issue number5
Early online date21 Jul 2025
DOIs
Publication statusPublished online - 21 Jul 2025

Bibliographical note

Publisher Copyright:
© The Author(s) 2025.

Data Availability Statement

data that support the findings of this study have been deposited in the 4tu research data with the following URL, https://data.4tu.nl/articles/dataset/Hospital_Billing_-_Event_Log/12705113/1

Funding

This research is supported by BTIIC (the British Telecom Ireland Innovation Centre), funded by British Telecom and Invest Northern Ireland.

Funders
Invest Northern Ireland

    Keywords

    • Process duration modelling
    • Gamma mixture model
    • Process conformance checking
    • Process mining
    • Nelder-Mead optimisation
    • Divide-and-conquer fitting

    Fingerprint

    Dive into the research topics of 'Automating mixture model fitting of task durations for process conformance checking'. Together they form a unique fingerprint.

    Cite this