Enhancing action recognition of construction workers using data-driven scene parsing
Vision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Vilnius Gediminas Technical University
2018-11-01
|
Series: | Journal of Civil Engineering and Management |
Subjects: | |
Online Access: | https://journals.vgtu.lt/index.php/JCEM/article/view/6133 |
_version_ | 1818787573990948864 |
---|---|
author | Jun Yang |
author_facet | Jun Yang |
author_sort | Jun Yang |
collection | DOAJ |
description | Vision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic construction sites. Considering that workers actions are closely related to various construction entities, this paper proposes a novel system on enhancing action recognition using semantic information. A data-driven scene parsing method, named label transfer, is adopted to recognize construction entities in the entire scene. A probabilistic model of actions with context is established. Worker actions are first classified using dense trajectories, and then improved by construction object recognition. The experimental results on a comprehensive dataset show that the proposed system outperforms the baseline algorithm by 10.5%. The paper provides a new solution to integrate semantic information globally, other than conventional object detection, which can only depict local context. The proposed system is especially suitable for construction sites, where semantic information is rich from local objects to global surroundings. As compared to other methods using object detection to integrate context information, it is easy to implement, requiring no tedious training or parameter tuning, and is scalable to the number of recognizable objects. |
first_indexed | 2024-12-18T14:09:56Z |
format | Article |
id | doaj.art-6bb376b17ee544fc99e699d69a216f3a |
institution | Directory Open Access Journal |
issn | 1392-3730 1822-3605 |
language | English |
last_indexed | 2024-12-18T14:09:56Z |
publishDate | 2018-11-01 |
publisher | Vilnius Gediminas Technical University |
record_format | Article |
series | Journal of Civil Engineering and Management |
spelling | doaj.art-6bb376b17ee544fc99e699d69a216f3a2022-12-21T21:05:09ZengVilnius Gediminas Technical UniversityJournal of Civil Engineering and Management1392-37301822-36052018-11-0124710.3846/jcem.2018.6133Enhancing action recognition of construction workers using data-driven scene parsingJun Yang0School of Automation, Northwestern Polytechnical University, Xi’an, ChinaVision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic construction sites. Considering that workers actions are closely related to various construction entities, this paper proposes a novel system on enhancing action recognition using semantic information. A data-driven scene parsing method, named label transfer, is adopted to recognize construction entities in the entire scene. A probabilistic model of actions with context is established. Worker actions are first classified using dense trajectories, and then improved by construction object recognition. The experimental results on a comprehensive dataset show that the proposed system outperforms the baseline algorithm by 10.5%. The paper provides a new solution to integrate semantic information globally, other than conventional object detection, which can only depict local context. The proposed system is especially suitable for construction sites, where semantic information is rich from local objects to global surroundings. As compared to other methods using object detection to integrate context information, it is easy to implement, requiring no tedious training or parameter tuning, and is scalable to the number of recognizable objects.https://journals.vgtu.lt/index.php/JCEM/article/view/6133workeraction recognitionscene parsingcomputer visioncontext |
spellingShingle | Jun Yang Enhancing action recognition of construction workers using data-driven scene parsing Journal of Civil Engineering and Management worker action recognition scene parsing computer vision context |
title | Enhancing action recognition of construction workers using data-driven scene parsing |
title_full | Enhancing action recognition of construction workers using data-driven scene parsing |
title_fullStr | Enhancing action recognition of construction workers using data-driven scene parsing |
title_full_unstemmed | Enhancing action recognition of construction workers using data-driven scene parsing |
title_short | Enhancing action recognition of construction workers using data-driven scene parsing |
title_sort | enhancing action recognition of construction workers using data driven scene parsing |
topic | worker action recognition scene parsing computer vision context |
url | https://journals.vgtu.lt/index.php/JCEM/article/view/6133 |
work_keys_str_mv | AT junyang enhancingactionrecognitionofconstructionworkersusingdatadrivensceneparsing |