BADANet: Boundary Aware Dilated Attention Network for Face Parsing

Over the past few years, deep learning techniques have revolutionized the field of face parsing by utilizing massive datasets to generate high-level features and achieve outstanding performance. Usually, these techniques involve Convolutional Neural Networks (CNNs) to derive features from the input...

Full description

Bibliographic Details
Main Authors: S. Raghavendra, S. K. Abhilash, Venu Madhav Nookala, N. N. Srinidhi, N. D. Adesh
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10264074/
_version_ 1797663087004745728
author S. Raghavendra
S. K. Abhilash
Venu Madhav Nookala
N. N. Srinidhi
N. D. Adesh
author_facet S. Raghavendra
S. K. Abhilash
Venu Madhav Nookala
N. N. Srinidhi
N. D. Adesh
author_sort S. Raghavendra
collection DOAJ
description Over the past few years, deep learning techniques have revolutionized the field of face parsing by utilizing massive datasets to generate high-level features and achieve outstanding performance. Usually, these techniques involve Convolutional Neural Networks (CNNs) to derive features from the input image and then a decoder network to forecast the semantic labels for every pixel. Even with complex deep convolutional neural networks (DCNN), incorrect parsing leads to a semantic gap between identical features, especially at boundary levels. Face parsing has encountered additional challenges due to the inclusion of pose, illumination and facial expressions. In order to address these issues, a Boundary Aware Dilated Attention Network (BADANet) is introduced which explores the use of multi-scale techniques to improve the accuracy and robustness of the per-pixel frames. BADANet&#x2019;s dilated attention module has been combined with a lightweight backbone to achieve exceptional results on LaPa, CelebAMask-HQ, and iBugMask. Extensive evaluations of the proposed method demonstrate its performance to be on par with various state-of-the-art techniques. The pipeline for the BADANet is available at <uri>https://github.com/abhigoku10/BADANet</uri>
first_indexed 2024-03-11T19:09:27Z
format Article
id doaj.art-bb9d28ace5194bf9a79c0fea8c08e935
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-11T19:09:27Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-bb9d28ace5194bf9a79c0fea8c08e9352023-10-09T23:01:18ZengIEEEIEEE Access2169-35362023-01-011110674910675910.1109/ACCESS.2023.331956110264074BADANet: Boundary Aware Dilated Attention Network for Face ParsingS. Raghavendra0https://orcid.org/0000-0003-2733-3916S. K. Abhilash1https://orcid.org/0000-0002-1119-4782Venu Madhav Nookala2https://orcid.org/0000-0002-0078-5050N. N. Srinidhi3https://orcid.org/0000-0002-2865-1185N. D. Adesh4https://orcid.org/0000-0002-9750-113XDepartment of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, IndiaPathPartner Technology Pvt. Ltd., Bengaluru, IndiaPathPartner Technology Pvt. Ltd., Bengaluru, IndiaDepartment of Computer Science and Engineering, Manipal Institute of Technology Bengaluru, Manipal Academy of Higher Education, Manipal, IndiaDepartment of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, IndiaOver the past few years, deep learning techniques have revolutionized the field of face parsing by utilizing massive datasets to generate high-level features and achieve outstanding performance. Usually, these techniques involve Convolutional Neural Networks (CNNs) to derive features from the input image and then a decoder network to forecast the semantic labels for every pixel. Even with complex deep convolutional neural networks (DCNN), incorrect parsing leads to a semantic gap between identical features, especially at boundary levels. Face parsing has encountered additional challenges due to the inclusion of pose, illumination and facial expressions. In order to address these issues, a Boundary Aware Dilated Attention Network (BADANet) is introduced which explores the use of multi-scale techniques to improve the accuracy and robustness of the per-pixel frames. BADANet&#x2019;s dilated attention module has been combined with a lightweight backbone to achieve exceptional results on LaPa, CelebAMask-HQ, and iBugMask. Extensive evaluations of the proposed method demonstrate its performance to be on par with various state-of-the-art techniques. The pipeline for the BADANet is available at <uri>https://github.com/abhigoku10/BADANet</uri>https://ieeexplore.ieee.org/document/10264074/BADANetbinary cross entropyCNNface parsingindustry innovation
spellingShingle S. Raghavendra
S. K. Abhilash
Venu Madhav Nookala
N. N. Srinidhi
N. D. Adesh
BADANet: Boundary Aware Dilated Attention Network for Face Parsing
IEEE Access
BADANet
binary cross entropy
CNN
face parsing
industry innovation
title BADANet: Boundary Aware Dilated Attention Network for Face Parsing
title_full BADANet: Boundary Aware Dilated Attention Network for Face Parsing
title_fullStr BADANet: Boundary Aware Dilated Attention Network for Face Parsing
title_full_unstemmed BADANet: Boundary Aware Dilated Attention Network for Face Parsing
title_short BADANet: Boundary Aware Dilated Attention Network for Face Parsing
title_sort badanet boundary aware dilated attention network for face parsing
topic BADANet
binary cross entropy
CNN
face parsing
industry innovation
url https://ieeexplore.ieee.org/document/10264074/
work_keys_str_mv AT sraghavendra badanetboundaryawaredilatedattentionnetworkforfaceparsing
AT skabhilash badanetboundaryawaredilatedattentionnetworkforfaceparsing
AT venumadhavnookala badanetboundaryawaredilatedattentionnetworkforfaceparsing
AT nnsrinidhi badanetboundaryawaredilatedattentionnetworkforfaceparsing
AT ndadesh badanetboundaryawaredilatedattentionnetworkforfaceparsing