Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model
Aiming at the problem of insufficient accuracy caused by the insufficient mining of spatiotemporal features in the process of unsafe behavior and danger identification of construction personnel, the traditional two-stream convolution model is improved, and a two-stream convolution dangerous behavior...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-09-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/13/19/10700 |
_version_ | 1797576240349052928 |
---|---|
author | Xu Li Tianxuan Hao Fan Li Lizhen Zhao Zehua Wang |
author_facet | Xu Li Tianxuan Hao Fan Li Lizhen Zhao Zehua Wang |
author_sort | Xu Li |
collection | DOAJ |
description | Aiming at the problem of insufficient accuracy caused by the insufficient mining of spatiotemporal features in the process of unsafe behavior and danger identification of construction personnel, the traditional two-stream convolution model is improved, and a two-stream convolution dangerous behavior recognition model based on Faster R-CNN-LSTM is proposed. In this model, the Faster R-CNN network is connected in parallel with the LSTM network. The Faster R-CNN network is used as the spatial flow, and the human spatial motion posture is divided into static and dynamic features to extract the anchor point features, respectively. The fusion of the two is used as the output of the spatial flow. An improved sliding long-term and short-term memory network is used in the time flow to increase the extraction ability of the time series features of the construction personnel. Finally, the two branches are fused in time and space to classify and identify whether the construction personnel wear safety helmets. The results show that the MAP of the improved Faster R-CNN-LSTM network framework is increased by 15%. The original CNN-LSTM network framework detected four targets, but there was one misdetection, with an accuracy of 91.48%. The improved frame detection accuracy reaches 99.99%, and there is no error detection. The proposed method is superior to the pre-improvement and other methods that can effectively identify the unsafe behavior of construction workers on construction sites and also has a good distinction effect on fuzzy actions. |
first_indexed | 2024-03-10T21:49:28Z |
format | Article |
id | doaj.art-55ebd0e105984a6d8d56145ce2d45bee |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-10T21:49:28Z |
publishDate | 2023-09-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-55ebd0e105984a6d8d56145ce2d45bee2023-11-19T14:02:56ZengMDPI AGApplied Sciences2076-34172023-09-0113191070010.3390/app131910700Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition ModelXu Li0Tianxuan Hao1Fan Li2Lizhen Zhao3Zehua Wang4College of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaCollege of Safety Science and Engineering, Henan Polytechnic University, Jiaozuo 454150, ChinaAiming at the problem of insufficient accuracy caused by the insufficient mining of spatiotemporal features in the process of unsafe behavior and danger identification of construction personnel, the traditional two-stream convolution model is improved, and a two-stream convolution dangerous behavior recognition model based on Faster R-CNN-LSTM is proposed. In this model, the Faster R-CNN network is connected in parallel with the LSTM network. The Faster R-CNN network is used as the spatial flow, and the human spatial motion posture is divided into static and dynamic features to extract the anchor point features, respectively. The fusion of the two is used as the output of the spatial flow. An improved sliding long-term and short-term memory network is used in the time flow to increase the extraction ability of the time series features of the construction personnel. Finally, the two branches are fused in time and space to classify and identify whether the construction personnel wear safety helmets. The results show that the MAP of the improved Faster R-CNN-LSTM network framework is increased by 15%. The original CNN-LSTM network framework detected four targets, but there was one misdetection, with an accuracy of 91.48%. The improved frame detection accuracy reaches 99.99%, and there is no error detection. The proposed method is superior to the pre-improvement and other methods that can effectively identify the unsafe behavior of construction workers on construction sites and also has a good distinction effect on fuzzy actions.https://www.mdpi.com/2076-3417/13/19/10700behaviorsecurityrecognitionConvolutional Neural Networkslong short-term memory networksFaster R-CNN |
spellingShingle | Xu Li Tianxuan Hao Fan Li Lizhen Zhao Zehua Wang Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model Applied Sciences behavior security recognition Convolutional Neural Networks long short-term memory networks Faster R-CNN |
title | Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model |
title_full | Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model |
title_fullStr | Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model |
title_full_unstemmed | Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model |
title_short | Faster R-CNN-LSTM Construction Site Unsafe Behavior Recognition Model |
title_sort | faster r cnn lstm construction site unsafe behavior recognition model |
topic | behavior security recognition Convolutional Neural Networks long short-term memory networks Faster R-CNN |
url | https://www.mdpi.com/2076-3417/13/19/10700 |
work_keys_str_mv | AT xuli fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel AT tianxuanhao fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel AT fanli fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel AT lizhenzhao fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel AT zehuawang fasterrcnnlstmconstructionsiteunsafebehaviorrecognitionmodel |