{"id":326,"date":"2021-04-15T14:29:50","date_gmt":"2021-04-15T14:29:50","guid":{"rendered":"http:\/\/dataninja.nrw\/?page_id=326"},"modified":"2025-02-10T15:01:34","modified_gmt":"2025-02-10T15:01:34","slug":"ml4prom-machine-learning-and-drift-detection-methods-for-predictive-prevention-technologies-in-process-mining","status":"publish","type":"page","link":"https:\/\/dataninja.nrw\/?page_id=326","title":{"rendered":"ML4ProM: Machine Learning and Drift Detection Methods for Predictive Prevention Technologies in Process Mining"},"content":{"rendered":"\n<h3 class=\"wp-block-heading\">Goal<\/h3>\n\n\n\n<p>Process mining is an interdisciplinary field that aims to bridge the gap between data science and process science\u200e. \u200eProcess discovery\u200e, \u200econformance checking and process improvement are the three main types of process mining\u200e. In Process Mining we use knowledge extracted from event logs to uncover, monitor, and improve real business processes, allowing us to understand what is really going on in a business process rather than what we assume is going on. Event data is used to define and construct symbolic process models in various fields, including logistics, finance, hospital treatment, and industrial production. Process models may be learnt in this way, and they give a formal and interpretable description of the underlying events. In order to verify an existing model or uncover problematic sections of a model, process mining tools are frequently employed retroactively. The <em>ML4ProM<\/em> project intends to supplement such retrospective analysis with predictive Machine Learning technologies in order to comprehend the processes better and provide predictions to enrich the information provided by the process model. Furthermore, the project would also produce alternative solutions by discovering deviations from the model as early as possible. \u200eCurrently our goal in the Tandem project is defined to connect these two research fields\u200e, \u200efocusing on benefiting from the machine learning techniques to improve process mining tasks.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"724\" src=\"https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/01_ML4ProM_A3_draft_vs3-1024x724.jpg\" alt=\"illustration_ml4prom\" class=\"wp-image-327\" srcset=\"https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/01_ML4ProM_A3_draft_vs3-1024x724.jpg 1024w, https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/01_ML4ProM_A3_draft_vs3-300x212.jpg 300w, https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/01_ML4ProM_A3_draft_vs3-768x543.jpg 768w, https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/01_ML4ProM_A3_draft_vs3-1536x1086.jpg 1536w, https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/01_ML4ProM_A3_draft_vs3-2048x1448.jpg 2048w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Process Mining deals with discovering a process (lower left part), e.g. detecting how patients are treated and handled in a hospital. In ML4ProM Machine Learning will be used to detect deviations from that process through predictions and propose how to adapt to changes of reality. Illustration: Christoph J Kellner, Studio Animanova<\/figcaption><\/figure>\n\n\n\n<p>Process models are simple to understand and may be used to express predictions and actions available in the historical event data. But it fails to make predictions regarding the future events, processes, bottlenecks. In other words, the majority of process enhancement works focus on post-mortem analysis; that is, conformance approaches are reactive in the sense that they discover a violation after it has occured. Predictive monitoring approaches, on the other hand, are proactive in that they provide predictions before a violation occurs, therefore increasing the process performance and reducing risks. Hence, these kind of approaches are known as &#8216;forward looking forms of process mining&#8217; which has predictive capabilities with the help of AI models being used. Recurrent Neural Networks(RNNs) are the main AI models used in predictive monitoring which is able to process sequential event data. One of the main goals of the project is to build such AI models leveraging the state-of-the-art architectures to provide predictions. Another task is to learn drifts of essential sections of (sub-)processes in the process model to prevent emerging incompatibilities. In addition, it&#8217;s critical to retain explainability in the expansion of learnt components, such as employing relevance learning to identify relevant aspects in connection to a process.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Project Overview<\/h3>\n\n\n\n<p><strong>Machine Learning<\/strong><\/p>\n\n\n\n<p>Every human being in the planet is unique, and how we sense and act may differ from one another. Some people like to drink the tea with milk, some not. Some people like to work in the mornings, some prefer to work in the night. And when it comes to work, regardless of which industry, company, or department we work in, we do not always follow a prescribed path, policy, or work instruction. We have our own ways to do things, so to say, and this makes managing processes really challenging and unpredictable. Process Mining sheds a light on processes and generates a symbolic representation of them, which helps from the managing perspective. But it lacks the ability to &#8216;look forward&#8217;, in other words, it can not predict the future events\/actions. This is where Machine Learning(ML) comes into play with its predictive capabilities.<\/p>\n\n\n\n<p>The use of Artificial Intelligence (AI) in process mining leads to a new type of research, that is: &#8216;forward looking forms of process mining&#8217;, which has predictive capabilities that improve the process performance and reduce risks. But such AI models are usually black-box, meaning that models may not be interpretable. However, current advances on explainable AI (XAI) continue to shed a light on the model&#8217;s decisions. By using XAI methods as concerns the explanations of the model&#8217;s predictions we achieve interpretability as well as the predictive power.<\/p>\n\n\n\n<p><strong>Process Mining<\/strong><\/p>\n\n\n\n<p>Process discovery is a type of process mining that aims to discover a process model from an event log\u200e. \u200eMany discovery techniques are introduced to discover a comprehensive process model from an event log [1]\u200e. \u200eProcess models are represented using different formalizations such as directly follows graphs\u200e, \u200eBPMN models\u200e, \u200eand Petri nets [2]\u200e. \u200eSeveral quality dimensions are proposed to check the conformance of a discovered model\u200e, \u200esuch as fitness\u200e, \u200eprecision\u200e, \u200egeneralization\u200e, \u200eand simplicity [3]. \u200eAlthough these measurements provide insights into the quality of discovered models\u200e, \u200edeciding on an appropriate model depends on applications and expectations\u200e.<\/p>\n\n\n\n<p>\u200eThe selected research topic &#8220;Discovering Process Models That Support Desired Behavior and Avoid Undesired Behavior\u200e&#8221; \u200eis relatively new in the process mining field\u200e. \u200eThe main focus of the current research project is on the development of an algorithm that has a desirable event log and an undesirable event log as inputs and discovers a Petri net model that supports desirable behavior and avoids undesirable behavior of the process\u200e. \u200eThe problem was investigated from different aspects\u200e. \u200eAlthough there are some initial efforts to consider negative examples in process mining literature\u200e, \u200estill there is no comprehensive framework to do so\u200e.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Preliminary Results<\/h3>\n\n\n\n<p>As the first step\u200e, \u200ewe designed some evaluation metrics to measure the improvements\u200e. We developed our algorithm based on Inductive Miner as a well-known process discovery algorithm\u200e. \u200eOur algorithm is recursive and each recursion searches for a cut\u200e, \u200ethat has the minimum cost\u200e. \u200eEach cut divides activities into two non-overlapping partitions with an operator that could be sequence cut\u200e, \u200eexclusive choice cut\u200e, \u200eparallel cut\u200e, \u200eor loop cut\u200e. \u200eThen\u200e, \u200ethe event log is projected onto the partitions to create new sub-event logs\u200e. \u200eRecursively\u200e, \u200efor each sub-event log\u200e, \u200ethe algorithm searches for the cut with minimum cost\u200e. \u200eThe recursion continues until reaching base cases\u200e. \u200eThe algorithm generates a process tree that could easily be converted to a Petri net\u200e.<\/p>\n\n\n\n<p>In our project\u200e, \u200ewe discover white-box interpretable process models\u200e. \u200eIn the Tandem partner project\u200e, \u200edesirable and undesirable event logs are used to train machine learning models that classify the traces as desirable or undesirable\u200e. \u200eThe trained machine learning models classify the traces with high accuracy but do not describe the control flow of the process from beginning to end\u200e. \u200eAn interesting question for our future research is investigating the possibility of discovering white-box interpretable process models using machine learning techniques\u200e.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Cooperation<\/h3>\n\n\n\n<div class=\"wp-block-columns\">\n    <div class=\"wp-block-column contrib-container\" style=\"flex-basis:20%\">\n        <a href=\"https:\/\/www.rwth-aachen.de\/cms\/~a\/root\/?lidx=1\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/500px-RWTH_Logo_3.png\" alt=\"RWTH Aachen\" class=\"wp-image-197\" height=\"100%\"><\/a>\n    <\/div>\n    <div class=\"wp-block-column\" style=\"margin-right:0.5cm\"><\/div>\n    <div class=\"wp-block-column\" style=\"flex-basis:80%\">\n        <a href=\"https:\/\/www.pads.rwth-aachen.de\/cms\/~pnbx\/PADS\/?lidx=1\"><b><p class=\"contrib-card-label\">Process and Data Science Group<\/p><\/b><\/a>\n        <a href=\"https:\/\/www.pads.rwth-aachen.de\/cms\/PADS\/Die-Organisationseinheit\/Team\/Professor\/~pxtb\/Wil-van-der-Aalst\/\"><p class=\"contrib-card-label\">Prof. Wil van der Aalst<\/p><\/a>\n        <p class=\"contrib-card-label\">PhD student: <a href=\"https:\/\/www.pads.rwth-aachen.de\/cms\/PADS\/Der-Lehrstuhl\/Team\/Wissenschaftliche-Mitarbeiter\/~nvxrd\/Ali-Norouzifar\/lidx\/1\/\">Ali Norouzifar<\/a><\/p>\n    <\/div>\n<\/div>\n<div class=\"wp-block-columns\">\n    <div class=\"wp-block-column contrib-container\" style=\"flex-basis:20%;\">\n        <a href=\"https:\/\/uni-bielefeld.de\/\"><img decoding=\"async\" width=\"459\" height=\"110\" loading=\"lazy\" src=\"https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/uni_bi_logo-3.png\" alt=\"\" class=\"wp-image-197\" srcset=\"https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/uni_bi_logo-3.png 459w, https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/uni_bi_logo-3-300x72.png 300w\" sizes=\"(max-width: 459px) 100vw, 459px\" \/><\/a>\n    <\/div>\n    <div class=\"wp-block-column\" style=\"margin-right:0.5cm\"><\/div>\n    <div class=\"wp-block-column\" style=\"flex-basis:80%\">\n        <a href=\"https:\/\/www.cit-ec.de\/en\/tcs\"><b><p class=\"contrib-card-label\">Machine Learning Group<\/p><\/b><\/a>\n        <a href=\"https:\/\/www.cit-ec.de\/en\/tcs\/barbara-hammer\"><p class=\"contrib-card-label\">Prof. Dr. Barbara Hammer<\/p><\/a>\n        <p class=\"contrib-card-label\">PhD student: <a href=\"https:\/\/rizavelioglu.github.io\/\">Riza Velioglu<\/a><\/p>\n    <\/div>\n<\/div>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">References<\/h3>\n\n\n\n<p>[1]: Augusto, A., Conforti, R., Dumas, M., La Rosa, M., Maggi, F.M., Marrella, A., Mecella, M. and Soo, A., 2018. Automated discovery of process models from event logs: review and benchmark. IEEE transactions on knowledge and data engineering, 31(4), pp.686-705.<\/p>\n\n\n\n<p>[2]: van der Aalst, W.M., 2016. Process mining: data science in action. Springer.<\/p>\n\n\n\n<p>[3]: Buijs, J.C., van Dongen, B.F. and van der Aalst, W.M., 2014. Quality dimensions in process discovery: The importance of fitness, precision, generalization and simplicity. International Journal of Cooperative Information Systems, 23(01), p.1440001.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Project publications<\/h3>\n\n\n\n<ul>\n<li>Artelt, Andr\u00e9, Valerie Vaquet, Riza Velioglu, Fabian Hinder, Johannes Brinkrolf, Malte Schilling, and Barbara Hammer (2021). \u2018\u2018Evaluating robustness of counterfactual explanations\u2019\u2019. In:&nbsp;IEEE Symposium Series on Computational Intelligence (SSCI).<\/li>\n\n\n\n<li>Eichenberger, Christian, Moritz Neun, Henry Martin, Pedro Herruzo, Markus Spanring, Yichao Lu, Sungbin Choi, Vsevolod Konyakhin, Nina Lukashina, Aleksei Shpilman, et al. (2022). \u2018\u2018Traffic4cast at neurips 2021-temporal and spatial few-shot transfer learning in gridded geo-spatial processes\u2019\u2019. In:&nbsp;NeurIPS 2021 Competitions and Demonstrations Track.<\/li>\n\n\n\n<li>Norouzifar, Ali and Wil M. P. van der Aalst (2023). \u2018\u2018Discovering Process Models that Support Desired Behavior and Avoid Undesired Behavior\u2019\u2019. In:Proceedings of the 38th ACM\/SIGAPP Symposium on Applied Computing, SAC 2023, Tallinn, Estonia, March 27-31, 2023. Ed. by Jiman Hong, Maart Lanperne, Juw Won Park, Tom\u00e1s Cern\u00fd, and Hossain Shahriar. ACM, pp. 365\u2013368.&nbsp;doi:&nbsp;10.1145\/3555776.3577818.&nbsp;url:<a href=\"https:\/\/doi.org\/10.1145\/3555776.3577818\">https:\/\/doi.org\/10.1145\/3555776.3577818<\/a>.<\/li>\n\n\n\n<li>Norouzifar, Ali and Wil M. P. van der Aalst (2024). \u2018\u2018Leveraging Desirable and Undesirable Event Logs in Process Mining Tasks\u2019\u2019. In:&nbsp;DataNinja sAIOnARA 2024 Conference&nbsp;abs\/2408.17316, pp. 32\u201335.&nbsp;doi:&nbsp;10.11576\/dataninja-1164.&nbsp;url:&nbsp;<a href=\"https:\/\/doi.org\/10.11576\/dataninja-1164\">https:\/\/doi.org\/10.11576\/dataninja-1164<\/a>.<\/li>\n\n\n\n<li>Norouzifar, Ali, Marcus Dees, and Wil M. P. van der Aalst (2024). \u2018\u2018Imposing Rules in Process Discovery: An Inductive Mining Approach\u2019\u2019. In:&nbsp;Research Challenges in Information Science &#8211; 18th International Conference, RCIS 2024, Guimar\u00e3es, Portugal, May 14-17, 2024, Proceedings, Part I. Ed. by Jo\u00e3o Ara\u00fajo, Jose Luis de la Vara, Maribel Yasmina Santos, and Said Assar. Vol. 513. Lecture Notes in Business Information Processing. Springer, pp. 220\u2013236.&nbsp;doi:&nbsp;10.1007\/978-3-031-59465-6\\_14.&nbsp;url: <a href=\"https:\/\/doi.org\/10.1007\/978-3-031-59465-6%5C_14\">https:\/\/doi.org\/10.1007\/978-3-031-59465-6%5C_14<\/a>.<\/li>\n\n\n\n<li>Norouzifar, Ali, Humam Kourani, Marcus Dees, and Wil M. P. van der Aalst (2024). \u2018\u2018Bridging Domain Knowledge and Process Discovery Using Large Language Models\u2019\u2019. In: Proceedings of the 22nd International Conference on Business Process Management,&nbsp;<em>in print<\/em>.<\/li>\n\n\n\n<li>Norouzifar, Ali, Majid Rafiei, Marcus Dees, and Wil M. P. van der Aalst (2024a). \u2018\u2018An Framework for Explainable Process Variant Analysis on Continuous Features\u2019\u2019. <em>submitted<\/em>.<\/li>\n\n\n\n<li>Norouzifar, Ali, Majid Rafiei, Marcus Dees, and Wil M. P. van der Aalst (2024b). \u2018\u2018Process Variant Analysis Across Continuous Features: A Novel Framework\u2019\u2019. In:&nbsp;Enterprise, Business-Process and Information Systems Modeling &#8211; 25th International Conference, BPMDS 2024, and 29th International Conference, EMMSAD 2024, Limassol, Cyprus, June 3-4, 2024, Proceedings. Ed. by Han van der Aa, Dominik Bork, Rainer Schmidt, and Arnon Sturm. Vol. 511. Lecture Notes in Business Information Processing. Springer, pp. 129\u2013142.&nbsp;doi:&nbsp;10.1007\/978-3-031-61007-3\\_11.&nbsp;url:&nbsp;<a href=\"https:\/\/doi.org\/10.1007\/978-3-031-61007-3%5C_11.\">https:\/\/doi.org\/10.1007\/978-3-031-61007-3%5C_11.<\/a><\/li>\n\n\n\n<li>\u00d6zdemir, \u00d6zg\u00fcr, Emre Salih Ak\u0131n, R\u0131za Velio\u011flu, and Tu\u011fba Dalyan (2022). \u2018\u2018A comparative study of neural machine translation models for Turkish language\u2019\u2019. In:&nbsp;Journal of Intelligent &amp; Fuzzy Systems.<\/li>\n\n\n\n<li>Velioglu, Riza, Petra Bevandic, Robin Chan, and Barbara Hammer (2024). \u2018\u2018Clothify: Redefining Fashion E-Commerce Photography with Virtual Try-Off and Diffusion Models\u2019\u2019. <em>submitted to<\/em>:&nbsp;IEEE\/CVF international conference on computer vision.<\/li>\n\n\n\n<li>Velioglu, Riza, Robin Chan, and Barbara Hammer (2024). \u2018\u2018FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation\u2019\u2019. In:&nbsp;International Joint Conference on Neural Networks (IJCNN). IEEE. <\/li>\n\n\n\n<li>Velioglu, Riza, Jan Philip G\u00f6pfert, Andr\u00e9 Artelt, and Barbara Hammer (2022). \u2018\u2018Explainable Artificial Intelligence for Improved Modeling of Processes\u2019\u2019. In: Intelligent Data Engineering and Automated Learning (IDEAL).<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Goal Process mining is an interdisciplinary field that aims to bridge the gap between data science and process science\u200e. \u200eProcess [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":327,"parent":119,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"ub_ctt_via":"","footnotes":""},"featured_image_src":"https:\/\/dataninja.nrw\/wp-content\/uploads\/2021\/04\/01_ML4ProM_A3_draft_vs3-scaled.jpg","_links":{"self":[{"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/pages\/326"}],"collection":[{"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dataninja.nrw\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=326"}],"version-history":[{"count":9,"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/pages\/326\/revisions"}],"predecessor-version":[{"id":2766,"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/pages\/326\/revisions\/2766"}],"up":[{"embeddable":true,"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/pages\/119"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dataninja.nrw\/index.php?rest_route=\/wp\/v2\/media\/327"}],"wp:attachment":[{"href":"https:\/\/dataninja.nrw\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=326"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}