RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets

Accurate grasping state detection is critical to the dexterous operation of robots. Robots must use multiple modalities to perceive external information, similar to humans. The direct fusion method of visual and tactile sensing may not provide effective visual–tactile features for the grasping state...

Full description

Bibliographic Details
Main Authors:	Wenjun Ruan, Wenbo Zhu, Zhijia Zhao, Kai Wang, Qinghua Lu, Lufeng Luo, Wei-Chang Yeh
Format:	Article
Language:	English
Published:	MDPI AG 2023-09-01
Series:	Mathematics
Subjects:	visual–tactile fusion perception target grasping state detection grasping multimodal perception
Online Access:	https://www.mdpi.com/2227-7390/11/18/3969

_version_	1827725411112452096
author	Wenjun Ruan Wenbo Zhu Zhijia Zhao Kai Wang Qinghua Lu Lufeng Luo Wei-Chang Yeh
author_facet	Wenjun Ruan Wenbo Zhu Zhijia Zhao Kai Wang Qinghua Lu Lufeng Luo Wei-Chang Yeh
author_sort	Wenjun Ruan
collection	DOAJ
description	Accurate grasping state detection is critical to the dexterous operation of robots. Robots must use multiple modalities to perceive external information, similar to humans. The direct fusion method of visual and tactile sensing may not provide effective visual–tactile features for the grasping state detection network of the target. To address this issue, we present a novel visual–tactile fusion model (i.e., RFCT) and provide an incremental dimensional tensor product method for detecting grasping states of weak-stiffness targets. We investigate whether convolutional block attention mechanisms (CBAM) can enhance feature representations by selectively attending to salient visual and tactile cues while suppressing less important information and eliminating redundant information for the initial fusion. We conducted 2250 grasping experiments using 15 weak-stiffness targets. We used 12 targets for training and three for testing. When evaluated on untrained targets, our RFCT model achieved a precision of 82.89%, a recall rate of 82.07%, and an F1 score of 81.65%. We compared RFCT model performance with various combinations of Resnet50 + LSTM and C3D models commonly used in grasping state detection. The experimental results show that our RFCT model significantly outperforms these models. Our proposed method provides accurate grasping state detection and has the potential to provide robust support for robot grasping operations in real-world applications.
first_indexed	2024-03-10T22:29:55Z
format	Article
id	doaj.art-596c2d1ec71a4848a0a74c8caf47ee6d
institution	Directory Open Access Journal
issn	2227-7390
language	English
last_indexed	2024-03-10T22:29:55Z
publishDate	2023-09-01
publisher	MDPI AG
record_format	Article
series	Mathematics
spelling	doaj.art-596c2d1ec71a4848a0a74c8caf47ee6d2023-11-19T11:50:07ZengMDPI AGMathematics2227-73902023-09-011118396910.3390/math11183969RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness TargetsWenjun Ruan0Wenbo Zhu1Zhijia Zhao2Kai Wang3Qinghua Lu4Lufeng Luo5Wei-Chang Yeh6School of Mechatronic Engineering and Automation, Foshan University, Foshan 528225, ChinaSchool of Mechatronic Engineering and Automation, Foshan University, Foshan 528225, ChinaSchool of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou 510006, ChinaSchool of Mechatronic Engineering and Automation, Foshan University, Foshan 528225, ChinaSchool of Mechatronic Engineering and Automation, Foshan University, Foshan 528225, ChinaSchool of Mechatronic Engineering and Automation, Foshan University, Foshan 528225, ChinaDepartment of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu 30013, TaiwanAccurate grasping state detection is critical to the dexterous operation of robots. Robots must use multiple modalities to perceive external information, similar to humans. The direct fusion method of visual and tactile sensing may not provide effective visual–tactile features for the grasping state detection network of the target. To address this issue, we present a novel visual–tactile fusion model (i.e., RFCT) and provide an incremental dimensional tensor product method for detecting grasping states of weak-stiffness targets. We investigate whether convolutional block attention mechanisms (CBAM) can enhance feature representations by selectively attending to salient visual and tactile cues while suppressing less important information and eliminating redundant information for the initial fusion. We conducted 2250 grasping experiments using 15 weak-stiffness targets. We used 12 targets for training and three for testing. When evaluated on untrained targets, our RFCT model achieved a precision of 82.89%, a recall rate of 82.07%, and an F1 score of 81.65%. We compared RFCT model performance with various combinations of Resnet50 + LSTM and C3D models commonly used in grasping state detection. The experimental results show that our RFCT model significantly outperforms these models. Our proposed method provides accurate grasping state detection and has the potential to provide robust support for robot grasping operations in real-world applications.https://www.mdpi.com/2227-7390/11/18/3969visual–tactile fusion perceptiontarget grasping state detectiongraspingmultimodal perception
spellingShingle	Wenjun Ruan Wenbo Zhu Zhijia Zhao Kai Wang Qinghua Lu Lufeng Luo Wei-Chang Yeh RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets Mathematics visual–tactile fusion perception target grasping state detection grasping multimodal perception
title	RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets
title_full	RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets
title_fullStr	RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets
title_full_unstemmed	RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets
title_short	RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets
title_sort	rfct multimodal sensing enhances grasping state detection for weak stiffness targets
topic	visual–tactile fusion perception target grasping state detection grasping multimodal perception
url	https://www.mdpi.com/2227-7390/11/18/3969
work_keys_str_mv	AT wenjunruan rfctmultimodalsensingenhancesgraspingstatedetectionforweakstiffnesstargets AT wenbozhu rfctmultimodalsensingenhancesgraspingstatedetectionforweakstiffnesstargets AT zhijiazhao rfctmultimodalsensingenhancesgraspingstatedetectionforweakstiffnesstargets AT kaiwang rfctmultimodalsensingenhancesgraspingstatedetectionforweakstiffnesstargets AT qinghualu rfctmultimodalsensingenhancesgraspingstatedetectionforweakstiffnesstargets AT lufengluo rfctmultimodalsensingenhancesgraspingstatedetectionforweakstiffnesstargets AT weichangyeh rfctmultimodalsensingenhancesgraspingstatedetectionforweakstiffnesstargets

RFCT: Multimodal Sensing Enhances Grasping State Detection for Weak-Stiffness Targets

Similar Items