DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification

Skin cancer is a dangerous form of cancer that develops slowly in skin cells. Delays in diagnosing and treating these malignant skin conditions may have serious repercussions. Likewise, early skin cancer detection has been shown to improve treatment outcomes. This paper proposes DeepMetaForge, a dee...

Full description

Bibliographic Details
Main Authors: Sirawich Vachmanus, Thanapon Noraset, Waritsara Piyanonpong, Teerapong Rattananukrom, Suppawong Tuarob
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10366268/
_version_ 1827396116185874432
author Sirawich Vachmanus
Thanapon Noraset
Waritsara Piyanonpong
Teerapong Rattananukrom
Suppawong Tuarob
author_facet Sirawich Vachmanus
Thanapon Noraset
Waritsara Piyanonpong
Teerapong Rattananukrom
Suppawong Tuarob
author_sort Sirawich Vachmanus
collection DOAJ
description Skin cancer is a dangerous form of cancer that develops slowly in skin cells. Delays in diagnosing and treating these malignant skin conditions may have serious repercussions. Likewise, early skin cancer detection has been shown to improve treatment outcomes. This paper proposes DeepMetaForge, a deep-learning framework for skin cancer detection from metadata-accompanied images. The proposed framework utilizes BEiT, a vision transformer pre-trained as a masked image modeling task, as the image-encoding backbone. We further propose merging the encoded metadata with the derived visual characteristics while compressing the aggregate information simultaneously, simulating how photos with metadata are interpreted. The experiment results on four public datasets of dermoscopic and smartphone skin lesion images reveal that the best configuration of our proposed framework yields 87.1% macro-average F1 on average. The empirical scalability analysis further shows that the proposed framework can be implemented in a variety of machine-learning paradigms, including applications on low-resource devices and as services. The findings shed light on not only the possibility of implementing telemedicine solutions for skin cancer on a nationwide scale that could benefit those in need of quality healthcare but also open doors to many intelligent applications in medicine where images and metadata are collected together, such as disease detection from CT-scan images and patients’ expression recognition from facial images.
first_indexed 2024-03-08T18:46:07Z
format Article
id doaj.art-735aef0d6fd346b994569d35d850588f
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T18:46:07Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-735aef0d6fd346b994569d35d850588f2023-12-29T00:03:49ZengIEEEIEEE Access2169-35362023-01-011114546714548410.1109/ACCESS.2023.334522510366268DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion ClassificationSirawich Vachmanus0https://orcid.org/0000-0002-6472-8426Thanapon Noraset1Waritsara Piyanonpong2Teerapong Rattananukrom3Suppawong Tuarob4https://orcid.org/0000-0002-5201-5699Faculty of Information and Communication Technology, Mahidol University, Nakhon Pathom, ThailandFaculty of Information and Communication Technology, Mahidol University, Nakhon Pathom, ThailandDivision of Dermatology, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, ThailandDivision of Dermatology, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, ThailandFaculty of Information and Communication Technology, Mahidol University, Nakhon Pathom, ThailandSkin cancer is a dangerous form of cancer that develops slowly in skin cells. Delays in diagnosing and treating these malignant skin conditions may have serious repercussions. Likewise, early skin cancer detection has been shown to improve treatment outcomes. This paper proposes DeepMetaForge, a deep-learning framework for skin cancer detection from metadata-accompanied images. The proposed framework utilizes BEiT, a vision transformer pre-trained as a masked image modeling task, as the image-encoding backbone. We further propose merging the encoded metadata with the derived visual characteristics while compressing the aggregate information simultaneously, simulating how photos with metadata are interpreted. The experiment results on four public datasets of dermoscopic and smartphone skin lesion images reveal that the best configuration of our proposed framework yields 87.1% macro-average F1 on average. The empirical scalability analysis further shows that the proposed framework can be implemented in a variety of machine-learning paradigms, including applications on low-resource devices and as services. The findings shed light on not only the possibility of implementing telemedicine solutions for skin cancer on a nationwide scale that could benefit those in need of quality healthcare but also open doors to many intelligent applications in medicine where images and metadata are collected together, such as disease detection from CT-scan images and patients’ expression recognition from facial images.https://ieeexplore.ieee.org/document/10366268/Image-metadata fusiondeep learningskin lesion classification
spellingShingle Sirawich Vachmanus
Thanapon Noraset
Waritsara Piyanonpong
Teerapong Rattananukrom
Suppawong Tuarob
DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
IEEE Access
Image-metadata fusion
deep learning
skin lesion classification
title DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
title_full DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
title_fullStr DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
title_full_unstemmed DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
title_short DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
title_sort deepmetaforge a deep vision transformer metadata fusion network for automatic skin lesion classification
topic Image-metadata fusion
deep learning
skin lesion classification
url https://ieeexplore.ieee.org/document/10366268/
work_keys_str_mv AT sirawichvachmanus deepmetaforgeadeepvisiontransformermetadatafusionnetworkforautomaticskinlesionclassification
AT thanaponnoraset deepmetaforgeadeepvisiontransformermetadatafusionnetworkforautomaticskinlesionclassification
AT waritsarapiyanonpong deepmetaforgeadeepvisiontransformermetadatafusionnetworkforautomaticskinlesionclassification
AT teerapongrattananukrom deepmetaforgeadeepvisiontransformermetadatafusionnetworkforautomaticskinlesionclassification
AT suppawongtuarob deepmetaforgeadeepvisiontransformermetadatafusionnetworkforautomaticskinlesionclassification