NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PMDG: Privacy for Multi-perspective Process Mining Through Data Generalization

https://doi.org/10.1007/978-3-031-34560-9_30

Hildebrant, Ryan; Fahrenkrog-Petersen, Stephan A.; Weidlich, Matthias; Ren, Shangping (June 2023, 35th International Conference on Advanced Information Systems Engineering (CAiSE) 2023)

Anonymization of event logs facilitates process mining while protecting sensitive information of process stakeholders. Existing techniques, however, focus on the privatization of the control-flow. Other process perspectives, such as roles, resources, and objects are neglected or subject to randomization, which breaks the dependencies between the perspectives. Hence, existing techniques are not suited for advanced process mining tasks, e.g., social network mining or predictive monitoring . To address this gap, we propose PMDG, a framework to ensure privacy for multi-perspective process mining through data generalization. It provides group-based privacy guarantees for an event log, while preserving the characteristic dependencies between the control-flow and further process perspectives. Unlike existing privatization techniques that rely on data suppression or noise insertion, PMDG adopts data generalization: a technique where the activities and attribute values referenced in events are generalized into more abstract ones, to obtain equivalence classes that are sufficiently large from a privacy point of view. We demonstrate empirically that PMDG outperforms state-of-the-art anonymization techniques, when mining handovers and predicting outcomes.
more » « less
Full Text Available
STEWART: STacking Ensemble for White-Box AdversaRial Attacks Towards more resilient data-driven predictive maintenance

https://doi.org/10.1016/j.compind.2022.103660

Gungor, Onat; Rosing, Tajana; Aksanli, Baris (September 2022, Computers in Industry)

Full Text Available
Process scenario discovery from event logs based on activity and timing information

https://doi.org/https://doi.org/10.1016/j.sysarc.2022.102435

Zhenyu, Zhang; Caleb, Johnson; Nalini, Venkatasubramanian; Shangping, Ren (April 2022, Journal of systems architecture)

Full Text Available
DOWELL: Diversity-Induced Optimally Weighted Ensemble Learner for Predictive Maintenance of Industrial Internet of Things Devices

https://doi.org/10.1109/JIOT.2021.3097269

Gungor, Onat; Rosing, Tajana S.; Aksanli, Baris (February 2022, IEEE Internet of Things Journal)

Full Text Available
Empirical Studies of Three Commonly Used Process Mining Algorithms

Wenyu, Peng; Zhenyu, Zhang; Ryan, Hildebrant; Shangping, Ren (January 2022, 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC))

Full Text Available
ENFES: ENsemble FEw-Shot Learning For Intelligent Fault Diagnosis with Limited Data

https://doi.org/10.1109/SENSORS47087.2021.9639633

Gungor, Onat; Rosing, Tajana; Aksanli, Baris (October 2021, 2021 IEEE Sensors)

Full Text Available
Improving Process Discovery Results by Filtering Out Outliers from Event Logs with Hidden Markov Models

https://doi.org/10.1109/CBI52690.2021.00028

Zhang, Zhenyu; Hildebrant, Ryan; Asgarinejad, Fatemeh; Venkatasubramanian, Nalini; Ren, Shangping (September 2021, 2021 IEEE 23rd Conference on Business Informatics (CBI))

Process Mining is a technique for extracting process models from event logs. Event logs contain abundant explicit information related to events, such as the timestamp and the actions that trigger the event. Much of the existing process mining research has focused on discovering the process models behind these event logs. However, Process Mining relies on the assumption that these event logs contain accurate representations of an ideal set of processes. These ideal sets of processes imply that the information contained within the log represents what is really happening in a given environment. However, many of these event logs might contain noisy, infrequent, missing, or false process information that is generally classified as outliers. Extending beyond process discovery, there are many research efforts towards cleaning the event logs to deal with these outliers. In this paper, we present an approach that uses hidden Markov models to filter out outliers from event logs prior to applying any process discovery algorithms. Our proposed filtering approach can detect outlier behavior, and consequently, help process discovery algorithms return models that better reflect the real processes within an organization. Furthermore, we show that this filtering method outperforms two commonly used filtering approaches, namely the Matrix Filter approach and the Anomaly Free Automation approach for both artificial event logs and real-life event logs.
more » « less
Full Text Available
OPELRUL: OPtimally Weighted Ensemble Learner for Remaining Useful Life Prediction

https://doi.org/10.1109/ICPHM51084.2021.9486535

Gungor, Onat; Rosing, Tajana S.; Aksanli, Baris (June 2021, 2021 IEEE International Conference on Prognostics and Health Management (ICPHM))

Full Text Available
Using Event Log Timing Information to Assist Process Scenario Discoveries

https://doi.org/10.1109/AIKE48582.2020.00017

Zhang, Zhenyu; Guo, Chunhui; Peng, Wenyu; Ren, Shangping (December 2020, 2020 IEEE Third International Conference on Artificial Intelligence and Knowledge Engineering (AIKE))

Event logs contain abundant information, such as activity names, time stamps, activity executors, etc. However, much of existing trace clustering research has been focused on applying activity names to assist process scenarios discovery. In addition, many existing trace clustering algorithms commonly used in the literature, such as k-means clustering approach, require prior knowledge about the number of process scenarios existed in the log, which sometimes are not known aprior. This paper presents a two-phase approach that obtains timing information from event logs and uses the information to assist process scenario discoveries without requiring any prior knowledge about process scenarios. We use five real-life event logs to compare the performance of the proposed two-phase approach for process scenario discoveries with the commonly used k-means clustering approach in terms of model’s harmonic mean of the weighted average fitness and precision, i.e., the F1 score. The experiment data shows that (1) the process scenario models obtained with the additional timing information have both higher fitness and precision scores than the models obtained without the timing information; (2) the two-phase approach not only removes the need for prior information related to k, but also results in a comparable F1 score compared to the optimal k-means approach with the optimal k obtained through exhaustive search.
more » « less
Full Text Available
Using Event Log Timing Information to Assist Process Scenario Discoveries

Zhenyu Zhang, Chunhui Guo (January 2020, The 3rd IEEE International Conference on Artificial Intelligence and Knowledge Engineering)
null (Ed.)
Full Text Available

Search for: All records