Process Duration Modelling and Concept Drift Detection for Business Process Mining

Lingkai Yang, Sally McClean, Mark Donnelly, Kevin Burke, Kashaf Khan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Customer behaviour within business processes can change over time, making it difficult for market understanding and decision making. Detecting such variations, also referred to as concept drift, can provide insight into the evolution of the business environment, offer opportunities for model refinement and provide target-oriented services to improve customer satisfaction. Concept drift in the control-flow perspective has been extensively studied but there is a research gap in detecting process duration drift. In this paper, we use gamma mixture models (GMMs) with an expectation-maximization (EM) algorithm to fit process durations and then detect variations in their histogram, density and cumulative distributions. Specifically, three metrics: the overall difference in back-to-back histograms, the Kullback-Leibler (KL) divergence and the maximum difference in cumulative distributions are used to evaluate how different the process durations are. Furthermore, three corresponding statistical tests: the multinomial test, log-likelihood ratio (LLR) test and Kolmogorov-Smirnov (KS) test are applied to determine whether, or not, the differences are statistically significant. The approach is applied to a public real-life hospital billing process where two concept drift occurrences are discovered. The main contribution of this paper is the approach aiming for detecting process duration changes.

Original languageEnglish
Title of host publicationProceedings - 2021 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People, and Smart City Innovations, SmartWorld/ScalCom/UIC/ATC/IoP/SCI 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages653-658
Number of pages6
ISBN (Electronic)9781665412360
DOIs
Publication statusPublished - 2021
Event2021 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People, and Smart City Innovations, SmartWorld/ScalCom/UIC/ATC/IoP/SCI 2021 - Virtual, Online, United States
Duration: 18 Oct 202121 Oct 2021

Publication series

NameProceedings - 2021 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People, and Smart City Innovations, SmartWorld/ScalCom/UIC/ATC/IoP/SCI 2021

Conference

Conference2021 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People, and Smart City Innovations, SmartWorld/ScalCom/UIC/ATC/IoP/SCI 2021
Country/TerritoryUnited States
CityVirtual, Online
Period18/10/2121/10/21

Keywords

  • Business process
  • Concept drift
  • EM algorithm
  • Gamma mixture model
  • Process duration

Fingerprint

Dive into the research topics of 'Process Duration Modelling and Concept Drift Detection for Business Process Mining'. Together they form a unique fingerprint.

Cite this