Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras

Tower cranes can cover most of the area of a construction site, which brings significant safety risks, including potential collisions with other entities. To address these issues, it is necessary to obtain accurate and real-time information on the orientation and location of tower cranes and hooks....

Full description

Bibliographic Details
Main Authors: Jiyao Wang, Qilin Zhang, Bin Yang, Binghan Zhang
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/10/4851
_version_ 1797598413218381824
author Jiyao Wang
Qilin Zhang
Bin Yang
Binghan Zhang
author_facet Jiyao Wang
Qilin Zhang
Bin Yang
Binghan Zhang
author_sort Jiyao Wang
collection DOAJ
description Tower cranes can cover most of the area of a construction site, which brings significant safety risks, including potential collisions with other entities. To address these issues, it is necessary to obtain accurate and real-time information on the orientation and location of tower cranes and hooks. As a non-invasive sensing method, computer vision-based (CVB) technology is widely applied on construction sites for object detection and three-dimensional (3D) localization. However, most existing methods mainly address the localization on the construction ground plane or rely on specific viewpoints and positions. To address these issues, this study proposes a framework for the real-time recognition and localization of tower cranes and hooks using monocular far-field cameras. The framework consists of four steps: far-field camera autocalibration using feature matching and horizon-line detection, deep learning-based segmentation of tower cranes, geometric feature reconstruction of tower cranes, and 3D localization estimation. The pose estimation of tower cranes using monocular far-field cameras with arbitrary views is the main contribution of this paper. To evaluate the proposed framework, a series of comprehensive experiments were conducted on construction sites in different scenarios and compared with ground-truth data obtained by sensors. The experimental results show that the proposed framework achieves high precision in both crane jib orientation estimation and hook position estimation, thereby contributing to the development of safety management and productivity analysis.
first_indexed 2024-03-11T03:20:52Z
format Article
id doaj.art-ddfb00b6a3a14817a10fcfc3b987ec57
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-11T03:20:52Z
publishDate 2023-05-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-ddfb00b6a3a14817a10fcfc3b987ec572023-11-18T03:13:34ZengMDPI AGSensors1424-82202023-05-012310485110.3390/s23104851Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field CamerasJiyao Wang0Qilin Zhang1Bin Yang2Binghan Zhang3Department of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaDepartment of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaDepartment of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaDepartment of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaTower cranes can cover most of the area of a construction site, which brings significant safety risks, including potential collisions with other entities. To address these issues, it is necessary to obtain accurate and real-time information on the orientation and location of tower cranes and hooks. As a non-invasive sensing method, computer vision-based (CVB) technology is widely applied on construction sites for object detection and three-dimensional (3D) localization. However, most existing methods mainly address the localization on the construction ground plane or rely on specific viewpoints and positions. To address these issues, this study proposes a framework for the real-time recognition and localization of tower cranes and hooks using monocular far-field cameras. The framework consists of four steps: far-field camera autocalibration using feature matching and horizon-line detection, deep learning-based segmentation of tower cranes, geometric feature reconstruction of tower cranes, and 3D localization estimation. The pose estimation of tower cranes using monocular far-field cameras with arbitrary views is the main contribution of this paper. To evaluate the proposed framework, a series of comprehensive experiments were conducted on construction sites in different scenarios and compared with ground-truth data obtained by sensors. The experimental results show that the proposed framework achieves high precision in both crane jib orientation estimation and hook position estimation, thereby contributing to the development of safety management and productivity analysis.https://www.mdpi.com/1424-8220/23/10/4851tower cranecomputer visionsensing systemthree-dimensional localizationfar-field camera
spellingShingle Jiyao Wang
Qilin Zhang
Bin Yang
Binghan Zhang
Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras
Sensors
tower crane
computer vision
sensing system
three-dimensional localization
far-field camera
title Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras
title_full Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras
title_fullStr Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras
title_full_unstemmed Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras
title_short Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras
title_sort vision based automated recognition and 3d localization framework for tower cranes using far field cameras
topic tower crane
computer vision
sensing system
three-dimensional localization
far-field camera
url https://www.mdpi.com/1424-8220/23/10/4851
work_keys_str_mv AT jiyaowang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras
AT qilinzhang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras
AT binyang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras
AT binghanzhang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras