TY - GEN
T1 - Process Duration Modelling and Concept Drift Detection for Business Process Mining
AU - Yang, Lingkai
AU - McClean, Sally
AU - Donnelly, Mark
AU - Burke, Kevin
AU - Khan, Kashaf
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Customer behaviour within business processes can change over time, making it difficult for market understanding and decision making. Detecting such variations, also referred to as concept drift, can provide insight into the evolution of the business environment, offer opportunities for model refinement and provide target-oriented services to improve customer satisfaction. Concept drift in the control-flow perspective has been extensively studied but there is a research gap in detecting process duration drift. In this paper, we use gamma mixture models (GMMs) with an expectation-maximization (EM) algorithm to fit process durations and then detect variations in their histogram, density and cumulative distributions. Specifically, three metrics: the overall difference in back-to-back histograms, the Kullback-Leibler (KL) divergence and the maximum difference in cumulative distributions are used to evaluate how different the process durations are. Furthermore, three corresponding statistical tests: the multinomial test, log-likelihood ratio (LLR) test and Kolmogorov-Smirnov (KS) test are applied to determine whether, or not, the differences are statistically significant. The approach is applied to a public real-life hospital billing process where two concept drift occurrences are discovered. The main contribution of this paper is the approach aiming for detecting process duration changes.
AB - Customer behaviour within business processes can change over time, making it difficult for market understanding and decision making. Detecting such variations, also referred to as concept drift, can provide insight into the evolution of the business environment, offer opportunities for model refinement and provide target-oriented services to improve customer satisfaction. Concept drift in the control-flow perspective has been extensively studied but there is a research gap in detecting process duration drift. In this paper, we use gamma mixture models (GMMs) with an expectation-maximization (EM) algorithm to fit process durations and then detect variations in their histogram, density and cumulative distributions. Specifically, three metrics: the overall difference in back-to-back histograms, the Kullback-Leibler (KL) divergence and the maximum difference in cumulative distributions are used to evaluate how different the process durations are. Furthermore, three corresponding statistical tests: the multinomial test, log-likelihood ratio (LLR) test and Kolmogorov-Smirnov (KS) test are applied to determine whether, or not, the differences are statistically significant. The approach is applied to a public real-life hospital billing process where two concept drift occurrences are discovered. The main contribution of this paper is the approach aiming for detecting process duration changes.
KW - Business process
KW - Concept drift
KW - EM algorithm
KW - Gamma mixture model
KW - Process duration
UR - http://www.scopus.com/inward/record.url?scp=85123288658&partnerID=8YFLogxK
U2 - 10.1109/SWC50871.2021.00097
DO - 10.1109/SWC50871.2021.00097
M3 - Conference contribution
AN - SCOPUS:85123288658
T3 - Proceedings - 2021 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People, and Smart City Innovations, SmartWorld/ScalCom/UIC/ATC/IoP/SCI 2021
SP - 653
EP - 658
BT - Proceedings - 2021 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People, and Smart City Innovations, SmartWorld/ScalCom/UIC/ATC/IoP/SCI 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People, and Smart City Innovations, SmartWorld/ScalCom/UIC/ATC/IoP/SCI 2021
Y2 - 18 October 2021 through 21 October 2021
ER -