Enhancing action recognition of construction workers using data-driven scene parsing

Vision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic...

Full description

Bibliographic Details
Main Author:	Jun Yang
Format:	Article
Language:	English
Published:	Vilnius Gediminas Technical University 2018-11-01
Series:	Journal of Civil Engineering and Management
Subjects:	worker action recognition scene parsing computer vision context
Online Access:	https://journals.vgtu.lt/index.php/JCEM/article/view/6133

_version_	1831598695578075136
author	Jun Yang
author_facet	Jun Yang
author_sort	Jun Yang
collection	DOAJ
description	Vision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic construction sites. Considering that workers actions are closely related to various construction entities, this paper proposes a novel system on enhancing action recognition using semantic information. A data-driven scene parsing method, named label transfer, is adopted to recognize construction entities in the entire scene. A probabilistic model of actions with context is established. Worker actions are first classified using dense trajectories, and then improved by construction object recognition. The experimental results on a comprehensive dataset show that the proposed system outperforms the baseline algorithm by 10.5%. The paper provides a new solution to integrate semantic information globally, other than conventional object detection, which can only depict local context. The proposed system is especially suitable for construction sites, where semantic information is rich from local objects to global surroundings. As compared to other methods using object detection to integrate context information, it is easy to implement, requiring no tedious training or parameter tuning, and is scalable to the number of recognizable objects.
first_indexed	2024-12-18T14:09:56Z
format	Article
id	doaj.art-6bb376b17ee544fc99e699d69a216f3a
institution	Directory Open Access Journal
issn	1392-3730 1822-3605
language	English
last_indexed	2024-12-18T14:09:56Z
publishDate	2018-11-01
publisher	Vilnius Gediminas Technical University
record_format	Article
series	Journal of Civil Engineering and Management
spelling	doaj.art-6bb376b17ee544fc99e699d69a216f3a2022-12-21T21:05:09ZengVilnius Gediminas Technical UniversityJournal of Civil Engineering and Management1392-37301822-36052018-11-0124710.3846/jcem.2018.6133Enhancing action recognition of construction workers using data-driven scene parsingJun Yang0School of Automation, Northwestern Polytechnical University, Xi’an, ChinaVision-based action recognition of construction workers has attracted increasing attention for its diverse applications. Though state-of-the-art performances have been achieved using spatial-temporal features in previous studies, considerable challenges remain in the context of cluttered and dynamic construction sites. Considering that workers actions are closely related to various construction entities, this paper proposes a novel system on enhancing action recognition using semantic information. A data-driven scene parsing method, named label transfer, is adopted to recognize construction entities in the entire scene. A probabilistic model of actions with context is established. Worker actions are first classified using dense trajectories, and then improved by construction object recognition. The experimental results on a comprehensive dataset show that the proposed system outperforms the baseline algorithm by 10.5%. The paper provides a new solution to integrate semantic information globally, other than conventional object detection, which can only depict local context. The proposed system is especially suitable for construction sites, where semantic information is rich from local objects to global surroundings. As compared to other methods using object detection to integrate context information, it is easy to implement, requiring no tedious training or parameter tuning, and is scalable to the number of recognizable objects.https://journals.vgtu.lt/index.php/JCEM/article/view/6133workeraction recognitionscene parsingcomputer visioncontext
spellingShingle	Jun Yang Enhancing action recognition of construction workers using data-driven scene parsing Journal of Civil Engineering and Management worker action recognition scene parsing computer vision context
title	Enhancing action recognition of construction workers using data-driven scene parsing
title_full	Enhancing action recognition of construction workers using data-driven scene parsing
title_fullStr	Enhancing action recognition of construction workers using data-driven scene parsing
title_full_unstemmed	Enhancing action recognition of construction workers using data-driven scene parsing
title_short	Enhancing action recognition of construction workers using data-driven scene parsing
title_sort	enhancing action recognition of construction workers using data driven scene parsing
topic	worker action recognition scene parsing computer vision context
url	https://journals.vgtu.lt/index.php/JCEM/article/view/6133
work_keys_str_mv	AT junyang enhancingactionrecognitionofconstructionworkersusingdatadrivensceneparsing

Enhancing action recognition of construction workers using data-driven scene parsing

Similar Items