Enhancing action recognition of construction workers using data-driven scene parsing

Vision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic...

Full description

Bibliographic Details
Main Author: Jun Yang
Format: Article
Language:English
Published: Vilnius Gediminas Technical University 2018-11-01
Series:Journal of Civil Engineering and Management
Subjects:
Online Access:https://journals.vgtu.lt/index.php/JCEM/article/view/6133
_version_ 1818787573990948864
author Jun Yang
author_facet Jun Yang
author_sort Jun Yang
collection DOAJ
description Vision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic construction sites. Considering that workers actions are closely related to various construction entities, this paper proposes a novel system on enhancing action recognition using semantic information. A data-driven scene parsing method, named label transfer, is adopted to recognize construction entities in the entire scene. A probabilistic model of actions with context is established. Worker actions are first classified using dense trajectories, and then improved by construction object recognition. The experimental results on a comprehensive dataset show that the proposed system outperforms the baseline algorithm by 10.5%. The paper provides a new solution to integrate semantic information globally, other than conventional object detection, which can only depict local context. The proposed system is especially suitable for construction sites, where semantic information is rich from local objects to global surroundings. As compared to other methods using object detection to integrate context information, it is easy to implement, requiring no tedious training or parameter tuning, and is scalable to the number of recognizable objects.
first_indexed 2024-12-18T14:09:56Z
format Article
id doaj.art-6bb376b17ee544fc99e699d69a216f3a
institution Directory Open Access Journal
issn 1392-3730
1822-3605
language English
last_indexed 2024-12-18T14:09:56Z
publishDate 2018-11-01
publisher Vilnius Gediminas Technical University
record_format Article
series Journal of Civil Engineering and Management
spelling doaj.art-6bb376b17ee544fc99e699d69a216f3a2022-12-21T21:05:09ZengVilnius Gediminas Technical UniversityJournal of Civil Engineering and Management1392-37301822-36052018-11-0124710.3846/jcem.2018.6133Enhancing action recognition of construction workers using data-driven scene parsingJun Yang0School of Automation, Northwestern Polytechnical University, Xi’an, ChinaVision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic construction sites. Considering that workers actions are closely related to various construction entities, this paper proposes a novel system on enhancing action recognition using semantic information. A data-driven scene parsing method, named label transfer, is adopted to recognize construction entities in the entire scene. A probabilistic model of actions with context is established. Worker actions are first classified using dense trajectories, and then improved by construction object recognition. The experimental results on a comprehensive dataset show that the proposed system outperforms the baseline algorithm by 10.5%. The paper provides a new solution to integrate semantic information globally, other than conventional object detection, which can only depict local context. The proposed system is especially suitable for construction sites, where semantic information is rich from local objects to global surroundings. As compared to other methods using object detection to integrate context information, it is easy to implement, requiring no tedious training or parameter tuning, and is scalable to the number of recognizable objects.https://journals.vgtu.lt/index.php/JCEM/article/view/6133workeraction recognitionscene parsingcomputer visioncontext
spellingShingle Jun Yang
Enhancing action recognition of construction workers using data-driven scene parsing
Journal of Civil Engineering and Management
worker
action recognition
scene parsing
computer vision
context
title Enhancing action recognition of construction workers using data-driven scene parsing
title_full Enhancing action recognition of construction workers using data-driven scene parsing
title_fullStr Enhancing action recognition of construction workers using data-driven scene parsing
title_full_unstemmed Enhancing action recognition of construction workers using data-driven scene parsing
title_short Enhancing action recognition of construction workers using data-driven scene parsing
title_sort enhancing action recognition of construction workers using data driven scene parsing
topic worker
action recognition
scene parsing
computer vision
context
url https://journals.vgtu.lt/index.php/JCEM/article/view/6133
work_keys_str_mv AT junyang enhancingactionrecognitionofconstructionworkersusingdatadrivensceneparsing