Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink Models

The use of different techniques and tools is a common practice to cover all stages in the development life-cycle of systems generating a significant number of work products. These artefacts are frequently encoded using diverse formats, and often require access through non-standard protocols and form...

Full description

Bibliographic Details
Main Authors: Eduardo Cibrián, Jose María Álvarez-Rodríguez, Roy Mendieta, Juan Llorens
Format: Article
Language:English
Published: MDPI AG 2023-11-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/21/11999
_version_ 1797632157720510464
author Eduardo Cibrián
Jose María Álvarez-Rodríguez
Roy Mendieta
Juan Llorens
author_facet Eduardo Cibrián
Jose María Álvarez-Rodríguez
Roy Mendieta
Juan Llorens
author_sort Eduardo Cibrián
collection DOAJ
description The use of different techniques and tools is a common practice to cover all stages in the development life-cycle of systems generating a significant number of work products. These artefacts are frequently encoded using diverse formats, and often require access through non-standard protocols and formats. In this context, Model-Based Systems Engineering (MBSE) emerges as a methodology to shift the paradigm of Systems Engineering practice from a document-oriented environment to a model-intensive environment. To achieve this major goal, a formalised application of modelling is employed throughout the life-cycle of systems to generate various system artefacts represented as models, such as requirements, logical models, and multi-physics models. However, the mere use of models does not guarantee one of the main challenges in the Systems Engineering discipline, namely, the reuse of system artefacts. Considering the fact that models are becoming the main type of system artefact, it is necessary to provide the capability to properly and efficiently represent and retrieve the generated models. In light of this, traditional information retrieval techniques have been widely studied to match existing software assets according to a set of capabilities or restrictions. However, there is much more at stake than the simple retrieval of models or even any piece of knowledge. An environment for model reuse must provide the proper mechanisms to (1) represent any piece of data, information, or knowledge under a common and shared data model, and (2) provide advanced retrieval mechanisms to elevate the meaning of information resources from text-based descriptions to concept-based ones. This need has led to novel methods using word embeddings and vector-based representations to semantically encode information. Such methods are applied to encode the information of physical models while preserving their underlying semantics. In this study, a text corpus from MATLAB Simulink models was preprocessed using Natural Language Processing (NLP) techniques and trained to generate word vector representations. Then, the presented method was validated using a testbed of MATLAB Simulink physical models in which verbalisations of models are transformed into vectors. The effectiveness of the proposed solution was assessed through a use case study. Evaluation of the results demonstrates a precision value of 0.925, a recall value of 0.865, and an F1 score of 0.884.
first_indexed 2024-03-11T11:33:57Z
format Article
id doaj.art-b7f9fb6b5747490387dbd64f119566f7
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T11:33:57Z
publishDate 2023-11-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-b7f9fb6b5747490387dbd64f119566f72023-11-10T14:59:26ZengMDPI AGApplied Sciences2076-34172023-11-0113211199910.3390/app132111999Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink ModelsEduardo Cibrián0Jose María Álvarez-Rodríguez1Roy Mendieta2Juan Llorens3Computer Science and Engineering Department, Carlos III University of Madrid, 28911 Leganés, SpainComputer Science and Engineering Department, Carlos III University of Madrid, 28911 Leganés, SpainThe REUSE Company, 28919 Leganés, SpainComputer Science and Engineering Department, Carlos III University of Madrid, 28911 Leganés, SpainThe use of different techniques and tools is a common practice to cover all stages in the development life-cycle of systems generating a significant number of work products. These artefacts are frequently encoded using diverse formats, and often require access through non-standard protocols and formats. In this context, Model-Based Systems Engineering (MBSE) emerges as a methodology to shift the paradigm of Systems Engineering practice from a document-oriented environment to a model-intensive environment. To achieve this major goal, a formalised application of modelling is employed throughout the life-cycle of systems to generate various system artefacts represented as models, such as requirements, logical models, and multi-physics models. However, the mere use of models does not guarantee one of the main challenges in the Systems Engineering discipline, namely, the reuse of system artefacts. Considering the fact that models are becoming the main type of system artefact, it is necessary to provide the capability to properly and efficiently represent and retrieve the generated models. In light of this, traditional information retrieval techniques have been widely studied to match existing software assets according to a set of capabilities or restrictions. However, there is much more at stake than the simple retrieval of models or even any piece of knowledge. An environment for model reuse must provide the proper mechanisms to (1) represent any piece of data, information, or knowledge under a common and shared data model, and (2) provide advanced retrieval mechanisms to elevate the meaning of information resources from text-based descriptions to concept-based ones. This need has led to novel methods using word embeddings and vector-based representations to semantically encode information. Such methods are applied to encode the information of physical models while preserving their underlying semantics. In this study, a text corpus from MATLAB Simulink models was preprocessed using Natural Language Processing (NLP) techniques and trained to generate word vector representations. Then, the presented method was validated using a testbed of MATLAB Simulink physical models in which verbalisations of models are transformed into vectors. The effectiveness of the proposed solution was assessed through a use case study. Evaluation of the results demonstrates a precision value of 0.925, a recall value of 0.865, and an F1 score of 0.884.https://www.mdpi.com/2076-3417/13/21/11999reuse modelsphysical modelsword embeddingsemantic representationsimilarity algorithm
spellingShingle Eduardo Cibrián
Jose María Álvarez-Rodríguez
Roy Mendieta
Juan Llorens
Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink Models
Applied Sciences
reuse models
physical models
word embedding
semantic representation
similarity algorithm
title Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink Models
title_full Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink Models
title_fullStr Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink Models
title_full_unstemmed Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink Models
title_short Towards a Method to Enable the Selection of Physical Models within the Systems Engineering Process: A Case Study with Simulink Models
title_sort towards a method to enable the selection of physical models within the systems engineering process a case study with simulink models
topic reuse models
physical models
word embedding
semantic representation
similarity algorithm
url https://www.mdpi.com/2076-3417/13/21/11999
work_keys_str_mv AT eduardocibrian towardsamethodtoenabletheselectionofphysicalmodelswithinthesystemsengineeringprocessacasestudywithsimulinkmodels
AT josemariaalvarezrodriguez towardsamethodtoenabletheselectionofphysicalmodelswithinthesystemsengineeringprocessacasestudywithsimulinkmodels
AT roymendieta towardsamethodtoenabletheselectionofphysicalmodelswithinthesystemsengineeringprocessacasestudywithsimulinkmodels
AT juanllorens towardsamethodtoenabletheselectionofphysicalmodelswithinthesystemsengineeringprocessacasestudywithsimulinkmodels