Efficient and fault tolerant HLA-based simulation

Distributed simulation subdivides a complex simulation (federation) into a group of simulation components (federates) and executes them in distributed manner. The High Level Architecture (HLA), an IEEE 1516 standard, provides a general framework for developing large-scale distributed simulations. Th...

Full description

Bibliographic Details
Main Author: Li, Zengxiang
Other Authors: Stephen John Turner
Format: Thesis
Language:English
Published: 2012
Subjects:
Online Access:https://hdl.handle.net/10356/48170
_version_ 1826126582965600256
author Li, Zengxiang
author2 Stephen John Turner
author_facet Stephen John Turner
Li, Zengxiang
author_sort Li, Zengxiang
collection NTU
description Distributed simulation subdivides a complex simulation (federation) into a group of simulation components (federates) and executes them in distributed manner. The High Level Architecture (HLA), an IEEE 1516 standard, provides a general framework for developing large-scale distributed simulations. The Runtime Infrastructure (RTI) is a middleware that controls the communication among federates according to the HLA interface specification. The simulation executions may involve a large number of computationally intensive federates and thus are time and resource consuming. What is worse, these federates may be subject to crash-stop and Byzantine failures and the risk of federation failure increases with the federation scale. In this thesis, we propose mechanisms to support efficient and fault tolerant HLA-based simulation by exploiting the advantages of decoupled federate architecture, in which a federate connects to federation through its corresponding Decoupled RTI Component (DRC). Workload imbalance generally leads to poor distributed simulation performance. To achieve load balancing, we propose to migrate federates from heavily-loaded computing nodes to lightly-loaded ones. Using the decoupled federate architecture, only needs the federate to be migrated to the destination computing node; whereas the DRC can stay at the same place and keep the connection to the federation. One-phase migration protocol is first proposed to illustrate the federate migration process. Then, two-phases and relay-based migration protocols are further developed to reduce migration overhead by overlapping federate migration with continuous federate execution.
first_indexed 2024-10-01T06:55:01Z
format Thesis
id ntu-10356/48170
institution Nanyang Technological University
language English
last_indexed 2024-10-01T06:55:01Z
publishDate 2012
record_format dspace
spelling ntu-10356/481702023-03-04T00:48:12Z Efficient and fault tolerant HLA-based simulation Li, Zengxiang Stephen John Turner Cai Wentong School of Computer Engineering Parallel and Distributed Computing Centre DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation Distributed simulation subdivides a complex simulation (federation) into a group of simulation components (federates) and executes them in distributed manner. The High Level Architecture (HLA), an IEEE 1516 standard, provides a general framework for developing large-scale distributed simulations. The Runtime Infrastructure (RTI) is a middleware that controls the communication among federates according to the HLA interface specification. The simulation executions may involve a large number of computationally intensive federates and thus are time and resource consuming. What is worse, these federates may be subject to crash-stop and Byzantine failures and the risk of federation failure increases with the federation scale. In this thesis, we propose mechanisms to support efficient and fault tolerant HLA-based simulation by exploiting the advantages of decoupled federate architecture, in which a federate connects to federation through its corresponding Decoupled RTI Component (DRC). Workload imbalance generally leads to poor distributed simulation performance. To achieve load balancing, we propose to migrate federates from heavily-loaded computing nodes to lightly-loaded ones. Using the decoupled federate architecture, only needs the federate to be migrated to the destination computing node; whereas the DRC can stay at the same place and keep the connection to the federation. One-phase migration protocol is first proposed to illustrate the federate migration process. Then, two-phases and relay-based migration protocols are further developed to reduce migration overhead by overlapping federate migration with continuous federate execution. DOCTOR OF PHILOSOPHY (SCE) 2012-03-26T09:04:04Z 2012-03-26T09:04:04Z 2012 2012 Thesis Li, Z. X. (2012). Efficient and fault tolerant HLA-based simulation. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/48170 10.32657/10356/48170 en 177 p. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
Li, Zengxiang
Efficient and fault tolerant HLA-based simulation
title Efficient and fault tolerant HLA-based simulation
title_full Efficient and fault tolerant HLA-based simulation
title_fullStr Efficient and fault tolerant HLA-based simulation
title_full_unstemmed Efficient and fault tolerant HLA-based simulation
title_short Efficient and fault tolerant HLA-based simulation
title_sort efficient and fault tolerant hla based simulation
topic DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
url https://hdl.handle.net/10356/48170
work_keys_str_mv AT lizengxiang efficientandfaulttoleranthlabasedsimulation