Unsupervised detection of contrast enhanced highlight landmarks

Abstract In the field of landmark detection based on deep learning, most of the research utilise convolutional neural networks to represent landmarks, and rarely adopt Transformer to represent and encode landmarks. Meanwhile, many works focus on modifying the network structure to improve network per...

Full description

Bibliographic Details
Main Authors: Tao Wu, Wenzhuo Fan, Shuxian Li, Qingqing Li, Jianlin Zhang, Meihui Li
Format: Article
Language:English
Published: Wiley 2023-10-01
Series:IET Computer Vision
Subjects:
Online Access:https://doi.org/10.1049/cvi2.12197
_version_ 1797663021031489536
author Tao Wu
Wenzhuo Fan
Shuxian Li
Qingqing Li
Jianlin Zhang
Meihui Li
author_facet Tao Wu
Wenzhuo Fan
Shuxian Li
Qingqing Li
Jianlin Zhang
Meihui Li
author_sort Tao Wu
collection DOAJ
description Abstract In the field of landmark detection based on deep learning, most of the research utilise convolutional neural networks to represent landmarks, and rarely adopt Transformer to represent and encode landmarks. Meanwhile, many works focus on modifying the network structure to improve network performance, and there is little research on the distribution of landmarks. In this article,the authors propose an unsupervised model to extract landmarks of objects in images. First, Transformer structure is combined with the convolutional neural network structure to represent and encode the landmarks; next, positive and negative sample pairs between landmarks are constructed, so that the semantically consistent landmarks on the image are pulled closer in the feature space and the semantically inconsistent landmarks are pushed farther in the feature space; then the authors concentrate their attention on the most active points to distinguish the landmarks of an object from the background; finally, based on the new contrastive loss, the network reconstructs the image by the landmarks of the object that are continuously learnt during training. Experiments show that the proposed model achieves better performance than other unsupervised methods on the CelebA, Annotated Facial Landmarks in the Wild, 300W datasets.
first_indexed 2024-03-11T19:08:32Z
format Article
id doaj.art-bf87f964cea94dc08ae3ac211e91296e
institution Directory Open Access Journal
issn 1751-9632
1751-9640
language English
last_indexed 2024-03-11T19:08:32Z
publishDate 2023-10-01
publisher Wiley
record_format Article
series IET Computer Vision
spelling doaj.art-bf87f964cea94dc08ae3ac211e91296e2023-10-10T04:15:41ZengWileyIET Computer Vision1751-96321751-96402023-10-0117780481310.1049/cvi2.12197Unsupervised detection of contrast enhanced highlight landmarksTao Wu0Wenzhuo Fan1Shuxian Li2Qingqing Li3Jianlin Zhang4Meihui Li5Key Laboratory of Optical Engineering Chinese Academy of Sciences Chengdu ChinaSchool of Electronic, Electrical and Communication Engineering University of Chinese Academy of Sciences Beijing ChinaKey Laboratory of Optical Engineering Chinese Academy of Sciences Chengdu ChinaInstitute of Optics and Electronics Chinese Academy of Sciences Chengdu ChinaInstitute of Optics and Electronics Chinese Academy of Sciences Chengdu ChinaInstitute of Optics and Electronics Chinese Academy of Sciences Chengdu ChinaAbstract In the field of landmark detection based on deep learning, most of the research utilise convolutional neural networks to represent landmarks, and rarely adopt Transformer to represent and encode landmarks. Meanwhile, many works focus on modifying the network structure to improve network performance, and there is little research on the distribution of landmarks. In this article,the authors propose an unsupervised model to extract landmarks of objects in images. First, Transformer structure is combined with the convolutional neural network structure to represent and encode the landmarks; next, positive and negative sample pairs between landmarks are constructed, so that the semantically consistent landmarks on the image are pulled closer in the feature space and the semantically inconsistent landmarks are pushed farther in the feature space; then the authors concentrate their attention on the most active points to distinguish the landmarks of an object from the background; finally, based on the new contrastive loss, the network reconstructs the image by the landmarks of the object that are continuously learnt during training. Experiments show that the proposed model achieves better performance than other unsupervised methods on the CelebA, Annotated Facial Landmarks in the Wild, 300W datasets.https://doi.org/10.1049/cvi2.12197computer visionimage processingunsupervised learning
spellingShingle Tao Wu
Wenzhuo Fan
Shuxian Li
Qingqing Li
Jianlin Zhang
Meihui Li
Unsupervised detection of contrast enhanced highlight landmarks
IET Computer Vision
computer vision
image processing
unsupervised learning
title Unsupervised detection of contrast enhanced highlight landmarks
title_full Unsupervised detection of contrast enhanced highlight landmarks
title_fullStr Unsupervised detection of contrast enhanced highlight landmarks
title_full_unstemmed Unsupervised detection of contrast enhanced highlight landmarks
title_short Unsupervised detection of contrast enhanced highlight landmarks
title_sort unsupervised detection of contrast enhanced highlight landmarks
topic computer vision
image processing
unsupervised learning
url https://doi.org/10.1049/cvi2.12197
work_keys_str_mv AT taowu unsuperviseddetectionofcontrastenhancedhighlightlandmarks
AT wenzhuofan unsuperviseddetectionofcontrastenhancedhighlightlandmarks
AT shuxianli unsuperviseddetectionofcontrastenhancedhighlightlandmarks
AT qingqingli unsuperviseddetectionofcontrastenhancedhighlightlandmarks
AT jianlinzhang unsuperviseddetectionofcontrastenhancedhighlightlandmarks
AT meihuili unsuperviseddetectionofcontrastenhancedhighlightlandmarks