Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB Images

Six-dimensional pose estimation for non-Lambertian objects, such as metal parts, is essential in intelligent manufacturing. Current methods pay much less attention to the influence of the surface reflection problem in 6D pose estimation. In this paper, we propose a cross-attention-based reflection-a...

Full description

Bibliographic Details
Main Authors: Chenrui Wu, Long Chen, Shiqing Wu
Format: Article
Language:English
Published: MDPI AG 2022-11-01
Series:Machines
Subjects:
Online Access:https://www.mdpi.com/2075-1702/10/12/1107
_version_ 1797456656830824448
author Chenrui Wu
Long Chen
Shiqing Wu
author_facet Chenrui Wu
Long Chen
Shiqing Wu
author_sort Chenrui Wu
collection DOAJ
description Six-dimensional pose estimation for non-Lambertian objects, such as metal parts, is essential in intelligent manufacturing. Current methods pay much less attention to the influence of the surface reflection problem in 6D pose estimation. In this paper, we propose a cross-attention-based reflection-aware 6D pose estimation network (CAR6D) for solving the surface reflection problem in 6D pose estimation. We use a pseudo-Siamese network structure to extract features from both an RGB image and a 3D model. The cross-attention layers are designed as a bi-directional filter for each of the inputs (the RGB image and 3D model) to focus on calculating the correspondences of the objects. The network is trained to segment the reflection area from the object area. Training images with ground-truth labels of the reflection area are generated with a physical-based rendering method. The experimental results on a 6D dataset of metal parts demonstrate the superiority of CAR6D in comparison with other state-of-the-art models.
first_indexed 2024-03-09T16:10:58Z
format Article
id doaj.art-65c6b00ce23b4de78def2f599a385a8c
institution Directory Open Access Journal
issn 2075-1702
language English
last_indexed 2024-03-09T16:10:58Z
publishDate 2022-11-01
publisher MDPI AG
record_format Article
series Machines
spelling doaj.art-65c6b00ce23b4de78def2f599a385a8c2023-11-24T16:15:44ZengMDPI AGMachines2075-17022022-11-011012110710.3390/machines10121107Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB ImagesChenrui Wu0Long Chen1Shiqing Wu2College of Mechanical Engineering, University of Shanghai for Science and Technology, Shanghai 200093, ChinaCollege of Mechanical Engineering, University of Shanghai for Science and Technology, Shanghai 200093, ChinaCollege of Mechanical Engineering, University of Shanghai for Science and Technology, Shanghai 200093, ChinaSix-dimensional pose estimation for non-Lambertian objects, such as metal parts, is essential in intelligent manufacturing. Current methods pay much less attention to the influence of the surface reflection problem in 6D pose estimation. In this paper, we propose a cross-attention-based reflection-aware 6D pose estimation network (CAR6D) for solving the surface reflection problem in 6D pose estimation. We use a pseudo-Siamese network structure to extract features from both an RGB image and a 3D model. The cross-attention layers are designed as a bi-directional filter for each of the inputs (the RGB image and 3D model) to focus on calculating the correspondences of the objects. The network is trained to segment the reflection area from the object area. Training images with ground-truth labels of the reflection area are generated with a physical-based rendering method. The experimental results on a 6D dataset of metal parts demonstrate the superiority of CAR6D in comparison with other state-of-the-art models.https://www.mdpi.com/2075-1702/10/12/11076D pose estimationnon-Lambertian objectsphysical-based renderingdense matchingPnP
spellingShingle Chenrui Wu
Long Chen
Shiqing Wu
Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB Images
Machines
6D pose estimation
non-Lambertian objects
physical-based rendering
dense matching
PnP
title Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB Images
title_full Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB Images
title_fullStr Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB Images
title_full_unstemmed Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB Images
title_short Cross-Attention-Based Reflection-Aware 6D Pose Estimation Network for Non-Lambertian Objects from RGB Images
title_sort cross attention based reflection aware 6d pose estimation network for non lambertian objects from rgb images
topic 6D pose estimation
non-Lambertian objects
physical-based rendering
dense matching
PnP
url https://www.mdpi.com/2075-1702/10/12/1107
work_keys_str_mv AT chenruiwu crossattentionbasedreflectionaware6dposeestimationnetworkfornonlambertianobjectsfromrgbimages
AT longchen crossattentionbasedreflectionaware6dposeestimationnetworkfornonlambertianobjectsfromrgbimages
AT shiqingwu crossattentionbasedreflectionaware6dposeestimationnetworkfornonlambertianobjectsfromrgbimages