Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model

Aiming at the problem of insufficient accuracy caused by the insufficient mining of spatiotemporal features in the process of unsafe behavior and danger identification of construction personnel, the traditional two-stream convolution model is improved, and a two-stream convolution dangerous behavior...

Full description

Bibliographic Details
Main Authors: Xu Li, Tianxuan Hao, Fan Li, Lizhen Zhao, Zehua Wang
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/19/10700
_version_ 1797576240349052928
author Xu Li
Tianxuan Hao
Fan Li
Lizhen Zhao
Zehua Wang
author_facet Xu Li
Tianxuan Hao
Fan Li
Lizhen Zhao
Zehua Wang
author_sort Xu Li
collection DOAJ
description Aiming at the problem of insufficient accuracy caused by the insufficient mining of spatiotemporal features in the process of unsafe behavior and danger identification of construction personnel, the traditional two-stream convolution model is improved, and a two-stream convolution dangerous behavior recognition model based on Faster R-CNN-LSTM is proposed. In this model, the Faster R-CNN network is connected in parallel with the LSTM network. The Faster R-CNN network is used as the spatial flow, and the human spatial motion posture is divided into static and dynamic features to extract the anchor point features, respectively. The fusion of the two is used as the output of the spatial flow. An improved sliding long-term and short-term memory network is used in the time flow to increase the extraction ability of the time series features of the construction personnel. Finally, the two branches are fused in time and space to classify and identify whether the construction personnel wear safety helmets. The results show that the MAP of the improved Faster R-CNN-LSTM network framework is increased by 15%. The original CNN-LSTM network framework detected four targets, but there was one misdetection, with an accuracy of 91.48%. The improved frame detection accuracy reaches 99.99%, and there is no error detection. The proposed method is superior to the pre-improvement and other methods that can effectively identify the unsafe behavior of construction workers on construction sites and also has a good distinction effect on fuzzy actions.
first_indexed 2024-03-10T21:49:28Z
format Article
id doaj.art-55ebd0e105984a6d8d56145ce2d45bee
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T21:49:28Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-55ebd0e105984a6d8d56145ce2d45bee2023-11-19T14:02:56ZengMDPI AGApplied Sciences2076-34172023-09-0113191070010.3390/app131910700Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition ModelXu Li0Tianxuan Hao1Fan Li2Lizhen Zhao3Zehua Wang4College of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaAiming at the problem of insufficient accuracy caused by the insufficient mining of spatiotemporal features in the process of unsafe behavior and danger identification of construction personnel, the traditional two-stream convolution model is improved, and a two-stream convolution dangerous behavior recognition model based on Faster R-CNN-LSTM is proposed. In this model, the Faster R-CNN network is connected in parallel with the LSTM network. The Faster R-CNN network is used as the spatial flow, and the human spatial motion posture is divided into static and dynamic features to extract the anchor point features, respectively. The fusion of the two is used as the output of the spatial flow. An improved sliding long-term and short-term memory network is used in the time flow to increase the extraction ability of the time series features of the construction personnel. Finally, the two branches are fused in time and space to classify and identify whether the construction personnel wear safety helmets. The results show that the MAP of the improved Faster R-CNN-LSTM network framework is increased by 15%. The original CNN-LSTM network framework detected four targets, but there was one misdetection, with an accuracy of 91.48%. The improved frame detection accuracy reaches 99.99%, and there is no error detection. The proposed method is superior to the pre-improvement and other methods that can effectively identify the unsafe behavior of construction workers on construction sites and also has a good distinction effect on fuzzy actions.https://www.mdpi.com/2076-3417/13/19/10700behaviorsecurityrecognitionConvolutional Neural Networkslong short-term memory networksFaster R-CNN
spellingShingle Xu Li
Tianxuan Hao
Fan Li
Lizhen Zhao
Zehua Wang
Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model
Applied Sciences
behavior
security
recognition
Convolutional Neural Networks
long short-term memory networks
Faster R-CNN
title Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model
title_full Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model
title_fullStr Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model
title_full_unstemmed Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model
title_short Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model
title_sort faster r cnn lstm construction site unsafe behavior recognition model
topic behavior
security
recognition
Convolutional Neural Networks
long short-term memory networks
Faster R-CNN
url https://www.mdpi.com/2076-3417/13/19/10700
work_keys_str_mv AT xuli fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel
AT tianxuanhao fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel
AT fanli fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel
AT lizhenzhao fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel
AT zehuawang fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel