SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion

Depth completion involves recovering a dense depth map from a sparse map and an RGB image. Recent approaches focus on utilizing color images as guidance images to recover depth at invalid pixels. However, color images alone are not enough to provide the necessary semantic understanding of the scene....

Full description

Bibliographic Details
Main Authors: Danish Nazir, Alain Pagani, Marcus Liwicki, Didier Stricker, Muhammad Zeshan Afzal
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9918022/
_version_ 1828096951725326336
author Danish Nazir
Alain Pagani
Marcus Liwicki
Didier Stricker
Muhammad Zeshan Afzal
author_facet Danish Nazir
Alain Pagani
Marcus Liwicki
Didier Stricker
Muhammad Zeshan Afzal
author_sort Danish Nazir
collection DOAJ
description Depth completion involves recovering a dense depth map from a sparse map and an RGB image. Recent approaches focus on utilizing color images as guidance images to recover depth at invalid pixels. However, color images alone are not enough to provide the necessary semantic understanding of the scene. Consequently, the depth completion task suffers from sudden illumination changes in RGB images (e.g., shadows). In this paper, we propose a novel three-branch backbone comprising color-guided, semantic-guided, and depth-guided branches. Specifically, the color-guided branch takes a sparse depth map and RGB image as an input and generates color depth which includes color cues (e.g., object boundaries) of the scene. The predicted dense depth map of color-guided branch along-with semantic image and sparse depth map is passed as input to semantic-guided branch for estimating semantic depth. The depth-guided branch takes sparse, color, and semantic depths to generate the dense depth map. The color depth, semantic depth, and guided depth are adaptively fused to produce the output of our proposed three-branch backbone. In addition, we also propose to apply semantic-aware multi-modal attention-based fusion block (SAMMAFB) to fuse features between all three branches. We further use CSPN++ with Atrous convolutions to refine the dense depth map produced by our three-branch backbone. Extensive experiments show that our model achieves state-of-the-art performance in the KITTI depth completion benchmark at the time of submission.
first_indexed 2024-04-11T07:38:25Z
format Article
id doaj.art-4cdc388f284b4efb88c90b9fb2cd5870
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T07:38:25Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-4cdc388f284b4efb88c90b9fb2cd58702022-12-22T04:36:39ZengIEEEIEEE Access2169-35362022-01-011012078112079110.1109/ACCESS.2022.32143169918022SemAttNet: Toward Attention-Based Semantic Aware Guided Depth CompletionDanish Nazir0https://orcid.org/0000-0001-6364-8427Alain Pagani1Marcus Liwicki2https://orcid.org/0000-0003-4029-6574Didier Stricker3Muhammad Zeshan Afzal4https://orcid.org/0000-0002-0536-6867Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, GermanyDeutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, GermanyDepartment of Computer Science, Luleå University of Technology, Luleå, SwedenDeutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, GermanyDeutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, GermanyDepth completion involves recovering a dense depth map from a sparse map and an RGB image. Recent approaches focus on utilizing color images as guidance images to recover depth at invalid pixels. However, color images alone are not enough to provide the necessary semantic understanding of the scene. Consequently, the depth completion task suffers from sudden illumination changes in RGB images (e.g., shadows). In this paper, we propose a novel three-branch backbone comprising color-guided, semantic-guided, and depth-guided branches. Specifically, the color-guided branch takes a sparse depth map and RGB image as an input and generates color depth which includes color cues (e.g., object boundaries) of the scene. The predicted dense depth map of color-guided branch along-with semantic image and sparse depth map is passed as input to semantic-guided branch for estimating semantic depth. The depth-guided branch takes sparse, color, and semantic depths to generate the dense depth map. The color depth, semantic depth, and guided depth are adaptively fused to produce the output of our proposed three-branch backbone. In addition, we also propose to apply semantic-aware multi-modal attention-based fusion block (SAMMAFB) to fuse features between all three branches. We further use CSPN++ with Atrous convolutions to refine the dense depth map produced by our three-branch backbone. Extensive experiments show that our model achieves state-of-the-art performance in the KITTI depth completion benchmark at the time of submission.https://ieeexplore.ieee.org/document/9918022/State-of-the-art depth completion approach on KITTI depth completion benchmarkattention-based fusion for depth completionsemantic-guided depth completion
spellingShingle Danish Nazir
Alain Pagani
Marcus Liwicki
Didier Stricker
Muhammad Zeshan Afzal
SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
IEEE Access
State-of-the-art depth completion approach on KITTI depth completion benchmark
attention-based fusion for depth completion
semantic-guided depth completion
title SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
title_full SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
title_fullStr SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
title_full_unstemmed SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
title_short SemAttNet: Toward Attention-Based Semantic Aware Guided Depth Completion
title_sort semattnet toward attention based semantic aware guided depth completion
topic State-of-the-art depth completion approach on KITTI depth completion benchmark
attention-based fusion for depth completion
semantic-guided depth completion
url https://ieeexplore.ieee.org/document/9918022/
work_keys_str_mv AT danishnazir semattnettowardattentionbasedsemanticawareguideddepthcompletion
AT alainpagani semattnettowardattentionbasedsemanticawareguideddepthcompletion
AT marcusliwicki semattnettowardattentionbasedsemanticawareguideddepthcompletion
AT didierstricker semattnettowardattentionbasedsemanticawareguideddepthcompletion
AT muhammadzeshanafzal semattnettowardattentionbasedsemanticawareguideddepthcompletion