Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition

The characteristics of human silhouette shape can be used for action recognition and classification. In this paper, a novel feature extraction method for the silhouette-based classification of human actions in videos is proposed. The proposed method is based on polygonization of silhouette images an...

Full description

Bibliographic Details
Main Authors: Ogul Gocmen, Murat Emin Akata
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10144743/
_version_ 1797804045741588480
author Ogul Gocmen
Murat Emin Akata
author_facet Ogul Gocmen
Murat Emin Akata
author_sort Ogul Gocmen
collection DOAJ
description The characteristics of human silhouette shape can be used for action recognition and classification. In this paper, a novel feature extraction method for the silhouette-based classification of human actions in videos is proposed. The proposed method is based on polygonization of silhouette images and coding. Since conventional silhouette generation methods do not satisfy the integrity of silhouettes, Yolact++ is modified as a silhouette generator. Our innovative approach Yolact++ masks are used as silhouettes to overcome this problem. For this purpose, a new image form called Poly Silhouette (PoS), a new Polygonization (PoG) algorithm and a new Polygon Coding (PoC) algorithm have been developed. The polygonization step is based on, but is not similar to curve and image polygonization. It is fast, adaptable, and accurate on the contour coordinates of the PoS images. PoCs were generated by projecting each edge vector generated from the corner coordinates of the PoS onto the angular areas and codes for the PoS were formed. These codes are grouped into k-mers similar to genetic algorithms and are used as features. The proposed innovative feature extraction method guarantees that feature vectors of equal length are generated from any action video. Thus, no additional action is required to overcome the dimensionality problem. By using different k-mer lengths, the classification accuracy of the method versus computation time was analyzed and depicted in figures. The method developed was tested on HMDB51 & UCF101 datasets: for SVM 20.98%, 1.63% for k-NN 4.96%, 6.83%, respectively, and significant improvements were achieved.
first_indexed 2024-03-13T05:31:14Z
format Article
id doaj.art-31c6026099a042818a2ff20f9da6ec58
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-13T05:31:14Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-31c6026099a042818a2ff20f9da6ec582023-06-14T23:00:18ZengIEEEIEEE Access2169-35362023-01-0111570215703610.1109/ACCESS.2023.328345810144743Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action RecognitionOgul Gocmen0https://orcid.org/0000-0001-9059-9800Murat Emin Akata1Department of Computer Engineering, Engineering Faculty, Baskent University, Ankara, TurkeyDepartment of Computer Engineering, Engineering Faculty, Baskent University, Ankara, TurkeyThe characteristics of human silhouette shape can be used for action recognition and classification. In this paper, a novel feature extraction method for the silhouette-based classification of human actions in videos is proposed. The proposed method is based on polygonization of silhouette images and coding. Since conventional silhouette generation methods do not satisfy the integrity of silhouettes, Yolact++ is modified as a silhouette generator. Our innovative approach Yolact++ masks are used as silhouettes to overcome this problem. For this purpose, a new image form called Poly Silhouette (PoS), a new Polygonization (PoG) algorithm and a new Polygon Coding (PoC) algorithm have been developed. The polygonization step is based on, but is not similar to curve and image polygonization. It is fast, adaptable, and accurate on the contour coordinates of the PoS images. PoCs were generated by projecting each edge vector generated from the corner coordinates of the PoS onto the angular areas and codes for the PoS were formed. These codes are grouped into k-mers similar to genetic algorithms and are used as features. The proposed innovative feature extraction method guarantees that feature vectors of equal length are generated from any action video. Thus, no additional action is required to overcome the dimensionality problem. By using different k-mer lengths, the classification accuracy of the method versus computation time was analyzed and depicted in figures. The method developed was tested on HMDB51 & UCF101 datasets: for SVM 20.98%, 1.63% for k-NN 4.96%, 6.83%, respectively, and significant improvements were achieved.https://ieeexplore.ieee.org/document/10144743/Polygonized silhouettespolygon codinghuman action recognition
spellingShingle Ogul Gocmen
Murat Emin Akata
Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition
IEEE Access
Polygonized silhouettes
polygon coding
human action recognition
title Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition
title_full Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition
title_fullStr Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition
title_full_unstemmed Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition
title_short Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition
title_sort polygonized silhouettes and polygon coding based feature representation for human action recognition
topic Polygonized silhouettes
polygon coding
human action recognition
url https://ieeexplore.ieee.org/document/10144743/
work_keys_str_mv AT ogulgocmen polygonizedsilhouettesandpolygoncodingbasedfeaturerepresentationforhumanactionrecognition
AT murateminakata polygonizedsilhouettesandpolygoncodingbasedfeaturerepresentationforhumanactionrecognition