Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras
Tower cranes can cover most of the area of a construction site, which brings significant safety risks, including potential collisions with other entities. To address these issues, it is necessary to obtain accurate and real-time information on the orientation and location of tower cranes and hooks....
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-05-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/23/10/4851 |
_version_ | 1797598413218381824 |
---|---|
author | Jiyao Wang Qilin Zhang Bin Yang Binghan Zhang |
author_facet | Jiyao Wang Qilin Zhang Bin Yang Binghan Zhang |
author_sort | Jiyao Wang |
collection | DOAJ |
description | Tower cranes can cover most of the area of a construction site, which brings significant safety risks, including potential collisions with other entities. To address these issues, it is necessary to obtain accurate and real-time information on the orientation and location of tower cranes and hooks. As a non-invasive sensing method, computer vision-based (CVB) technology is widely applied on construction sites for object detection and three-dimensional (3D) localization. However, most existing methods mainly address the localization on the construction ground plane or rely on specific viewpoints and positions. To address these issues, this study proposes a framework for the real-time recognition and localization of tower cranes and hooks using monocular far-field cameras. The framework consists of four steps: far-field camera autocalibration using feature matching and horizon-line detection, deep learning-based segmentation of tower cranes, geometric feature reconstruction of tower cranes, and 3D localization estimation. The pose estimation of tower cranes using monocular far-field cameras with arbitrary views is the main contribution of this paper. To evaluate the proposed framework, a series of comprehensive experiments were conducted on construction sites in different scenarios and compared with ground-truth data obtained by sensors. The experimental results show that the proposed framework achieves high precision in both crane jib orientation estimation and hook position estimation, thereby contributing to the development of safety management and productivity analysis. |
first_indexed | 2024-03-11T03:20:52Z |
format | Article |
id | doaj.art-ddfb00b6a3a14817a10fcfc3b987ec57 |
institution | Directory Open Access Journal |
issn | 1424-8220 |
language | English |
last_indexed | 2024-03-11T03:20:52Z |
publishDate | 2023-05-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj.art-ddfb00b6a3a14817a10fcfc3b987ec572023-11-18T03:13:34ZengMDPI AGSensors1424-82202023-05-012310485110.3390/s23104851Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field CamerasJiyao Wang0Qilin Zhang1Bin Yang2Binghan Zhang3Department of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaDepartment of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaDepartment of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaDepartment of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, ChinaTower cranes can cover most of the area of a construction site, which brings significant safety risks, including potential collisions with other entities. To address these issues, it is necessary to obtain accurate and real-time information on the orientation and location of tower cranes and hooks. As a non-invasive sensing method, computer vision-based (CVB) technology is widely applied on construction sites for object detection and three-dimensional (3D) localization. However, most existing methods mainly address the localization on the construction ground plane or rely on specific viewpoints and positions. To address these issues, this study proposes a framework for the real-time recognition and localization of tower cranes and hooks using monocular far-field cameras. The framework consists of four steps: far-field camera autocalibration using feature matching and horizon-line detection, deep learning-based segmentation of tower cranes, geometric feature reconstruction of tower cranes, and 3D localization estimation. The pose estimation of tower cranes using monocular far-field cameras with arbitrary views is the main contribution of this paper. To evaluate the proposed framework, a series of comprehensive experiments were conducted on construction sites in different scenarios and compared with ground-truth data obtained by sensors. The experimental results show that the proposed framework achieves high precision in both crane jib orientation estimation and hook position estimation, thereby contributing to the development of safety management and productivity analysis.https://www.mdpi.com/1424-8220/23/10/4851tower cranecomputer visionsensing systemthree-dimensional localizationfar-field camera |
spellingShingle | Jiyao Wang Qilin Zhang Bin Yang Binghan Zhang Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras Sensors tower crane computer vision sensing system three-dimensional localization far-field camera |
title | Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras |
title_full | Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras |
title_fullStr | Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras |
title_full_unstemmed | Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras |
title_short | Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras |
title_sort | vision based automated recognition and 3d localization framework for tower cranes using far field cameras |
topic | tower crane computer vision sensing system three-dimensional localization far-field camera |
url | https://www.mdpi.com/1424-8220/23/10/4851 |
work_keys_str_mv | AT jiyaowang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras AT qilinzhang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras AT binyang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras AT binghanzhang visionbasedautomatedrecognitionand3dlocalizationframeworkfortowercranesusingfarfieldcameras |