Detecting process duration drift using gamma mixture models in a left-truncated and right-censored environment

Lingkai Yang, Sally McClean, Mark Donnelly, Kashaf Khan, Kevin Burke

Research output: Contribution to journalArticlepeer-review

Abstract

Within the realm of business context, process duration signifies time spent by customers between successive activities. This temporal perspective offers important insight to customer behaviour, highlighting potential bottlenecks, and influencing business management decisions. The distribution of these process duration often changes over time due to factors such as seasonality, emerging legislation, changes to supply chains and customer demand. Referred to as concept drift, these variations pose challenges for robust process modelling, understanding, and refinement. Subsequently, gamma mixture models are widely employed to model durations. These source data can, however, become left-truncated and right-censored within any specific observation window thereby necessitating a (well-known) modification to the likelihood function. The approach reported in this paper leveraged this adapted likelihood across a series of observation windows, applying the likelihood ratio test to identify duration changes/concept drift. Due to its flexibility in modelling any duration distribution, the gamma mixture model was used with Nelder-Mead optimized likelihood for the left-truncated and right-censored data. The number of gamma components was determined by the Bayesian information criterion. The proposed framework underwent validation through simulated exponential samples, leading to recommendations for its practical application. Subsequently, we applied the methodology to three real-life event logs exhibiting diverse characteristics. Experimental results showcase the effectiveness of our approach in terms of data fitting, as compared to Kaplan-Meier curves, and in detecting instances of drift. This comprehensive validation underscores the practical utility and reliability of our framework for dynamic business scenarios.
Original languageEnglish
JournalACM Transactions on Knowledge Discovery from Data
Early online date12 Jun 2024
DOIs
Publication statusPublished (in print/issue) - 12 Jun 2024

Keywords

  • business process duration
  • concept drift detection
  • left-truncated and right-censored
  • gamma mixture models
  • likelihood ratio test
  • Nelder-Mead optimization

Fingerprint

Dive into the research topics of 'Detecting process duration drift using gamma mixture models in a left-truncated and right-censored environment'. Together they form a unique fingerprint.

Cite this