Clone detection for business process models

Models are key in software engineering, especially with the rise of model-driven software engineering. One such use of modeling is in business process modeling, where models are used to represent processes in enterprises. As the number of these process models grow in repositories, it leads to an inc...

Full description

Bibliographic Details
Main Authors: Mahdi Saeedi Nikoo, Önder Babur, Mark van den Brand
Format: Article
Language:English
Published: PeerJ Inc. 2022-08-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1046.pdf
_version_ 1817996872739979264
author Mahdi Saeedi Nikoo
Önder Babur
Mark van den Brand
author_facet Mahdi Saeedi Nikoo
Önder Babur
Mark van den Brand
author_sort Mahdi Saeedi Nikoo
collection DOAJ
description Models are key in software engineering, especially with the rise of model-driven software engineering. One such use of modeling is in business process modeling, where models are used to represent processes in enterprises. As the number of these process models grow in repositories, it leads to an increasing management and maintenance cost. Clone detection is a means that may provide various benefits such as repository management, data prepossessing, filtering, refactoring, and process family detection. In model clone detection, highly similar model fragments are mined from larger model repositories. In this study, we have extended SAMOS (Statistical Analysis of Models) framework for clone detection of business process models. The framework has been developed to support different types of analytics on models, including clone detection. We present the underlying techniques utilized in the framework, as well as our approach in extending the framework. We perform three experimental evaluations to demonstrate the effectiveness of our approach. We first compare our tool against the Apromore toolset for a pairwise model similarity using a synthetic model mutation dataset. As indicated by the results, SAMOS seems to outperform Apromore in the coverage of the metrics in pairwise similarity of models. Later, we do a comparative analysis of the tools on model clone detection using a dataset derived from the SAP Reference Model Collection. In this case, the results show a better precision for Apromore, while a higher recall measure for SAMOS. Finally, we show the additional capabilities of our approach for different model scoping styles through another set of experimental evaluations.
first_indexed 2024-04-14T02:28:50Z
format Article
id doaj.art-5bf5b49b13464c15af48054200d93611
institution Directory Open Access Journal
issn 2376-5992
language English
last_indexed 2024-04-14T02:28:50Z
publishDate 2022-08-01
publisher PeerJ Inc.
record_format Article
series PeerJ Computer Science
spelling doaj.art-5bf5b49b13464c15af48054200d936112022-12-22T02:17:45ZengPeerJ Inc.PeerJ Computer Science2376-59922022-08-018e104610.7717/peerj-cs.1046Clone detection for business process modelsMahdi Saeedi Nikoo0Önder Babur1Mark van den Brand2Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The NetherlandsDepartment of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The NetherlandsDepartment of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The NetherlandsModels are key in software engineering, especially with the rise of model-driven software engineering. One such use of modeling is in business process modeling, where models are used to represent processes in enterprises. As the number of these process models grow in repositories, it leads to an increasing management and maintenance cost. Clone detection is a means that may provide various benefits such as repository management, data prepossessing, filtering, refactoring, and process family detection. In model clone detection, highly similar model fragments are mined from larger model repositories. In this study, we have extended SAMOS (Statistical Analysis of Models) framework for clone detection of business process models. The framework has been developed to support different types of analytics on models, including clone detection. We present the underlying techniques utilized in the framework, as well as our approach in extending the framework. We perform three experimental evaluations to demonstrate the effectiveness of our approach. We first compare our tool against the Apromore toolset for a pairwise model similarity using a synthetic model mutation dataset. As indicated by the results, SAMOS seems to outperform Apromore in the coverage of the metrics in pairwise similarity of models. Later, we do a comparative analysis of the tools on model clone detection using a dataset derived from the SAP Reference Model Collection. In this case, the results show a better precision for Apromore, while a higher recall measure for SAMOS. Finally, we show the additional capabilities of our approach for different model scoping styles through another set of experimental evaluations.https://peerj.com/articles/cs-1046.pdfModel-driven engineeringBusiness process modelsModel analyticsModel clone detectionVector space modelClustering
spellingShingle Mahdi Saeedi Nikoo
Önder Babur
Mark van den Brand
Clone detection for business process models
PeerJ Computer Science
Model-driven engineering
Business process models
Model analytics
Model clone detection
Vector space model
Clustering
title Clone detection for business process models
title_full Clone detection for business process models
title_fullStr Clone detection for business process models
title_full_unstemmed Clone detection for business process models
title_short Clone detection for business process models
title_sort clone detection for business process models
topic Model-driven engineering
Business process models
Model analytics
Model clone detection
Vector space model
Clustering
url https://peerj.com/articles/cs-1046.pdf
work_keys_str_mv AT mahdisaeedinikoo clonedetectionforbusinessprocessmodels
AT onderbabur clonedetectionforbusinessprocessmodels
AT markvandenbrand clonedetectionforbusinessprocessmodels