Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscript

Abstract The Dunhuang ancient manuscripts are an excellent and precious cultural heritage of humanity. However, due to their age, the vast majority of these treasures are damaged and fragmented. Faced with a wide range of sources and numerous fragments, the process of restoration generally involves...

Full description

Bibliographic Details
Main Authors: Yutong Zheng, Xuelong Li, Yu Weng
Format: Article
Language:English
Published: SpringerOpen 2024-02-01
Series:Heritage Science
Subjects:
Online Access:https://doi.org/10.1186/s40494-024-01150-3
_version_ 1827326506072801280
author Yutong Zheng
Xuelong Li
Yu Weng
author_facet Yutong Zheng
Xuelong Li
Yu Weng
author_sort Yutong Zheng
collection DOAJ
description Abstract The Dunhuang ancient manuscripts are an excellent and precious cultural heritage of humanity. However, due to their age, the vast majority of these treasures are damaged and fragmented. Faced with a wide range of sources and numerous fragments, the process of restoration generally involves two core elements: sibling fragments identification and fragment assembly. Currently, fragment restoration still heavily relies on manual labor. During the long practice, a consensus has been reached on the importance of edge features for not only assembly but also for identification. However, accurate extraction of edge features and their use for efficient identification requires extensive knowledge and strong memory. This is a challenge for the human brain. So that in previous studies, fragment edge features have been used for assembly validation but rarely for identification. Therefore, an edge matcher is proposed, working like a bloodhound, capable of “sniffing out” specific “flavors” in edge features and performing efficient sibling fragment identification accordingly, providing guidance when experts perform entity assembly subsequently. Firstly, the fragmented images are standardized. Secondly, traditional methods are used to compress the representation of fragment edges and obtain paired local edge images. Finally, these images are fed into the edge matcher for classification discrimination, which is a CNN-based pairwise similarity metric model proposed in this paper, introducing residual blocks and depthwise separable convolutions, and adding multi-scale convolutional layers. With the edge matcher, a complex matching problem is successfully transformed into a simple classification problem. In the absence of a standard public dataset, a Dunhuang manuscript fragment edge dataset is constructed. Experiments are conducted on that dataset, and the accuracy, precision, recall, and F1 scores of the edge matcher all exceeded 97%. The effectiveness of the edge matcher is demonstrated by comparative experiments, and the rationality of the method design is verified by ablation experiments. The method combines traditional methods and deep learning methods to creatively use the edge geometric features of fragments for sibling fragment identification in a natural rather than coded way, making full use of the computer’s computational and memory capabilities. The edge matcher can significantly reduce the time and scope of searching, matching, and inferring fragments, and assist in the reconstruction of Dunhuang ancient manuscript fragments.
first_indexed 2024-03-07T14:46:34Z
format Article
id doaj.art-49b0cd56be004cb192d507ca21513a8f
institution Directory Open Access Journal
issn 2050-7445
language English
last_indexed 2024-03-07T14:46:34Z
publishDate 2024-02-01
publisher SpringerOpen
record_format Article
series Heritage Science
spelling doaj.art-49b0cd56be004cb192d507ca21513a8f2024-03-05T19:55:31ZengSpringerOpenHeritage Science2050-74452024-02-0112111510.1186/s40494-024-01150-3Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscriptYutong Zheng0Xuelong Li1Yu Weng2Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of ChinaKey Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of ChinaKey Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of ChinaAbstract The Dunhuang ancient manuscripts are an excellent and precious cultural heritage of humanity. However, due to their age, the vast majority of these treasures are damaged and fragmented. Faced with a wide range of sources and numerous fragments, the process of restoration generally involves two core elements: sibling fragments identification and fragment assembly. Currently, fragment restoration still heavily relies on manual labor. During the long practice, a consensus has been reached on the importance of edge features for not only assembly but also for identification. However, accurate extraction of edge features and their use for efficient identification requires extensive knowledge and strong memory. This is a challenge for the human brain. So that in previous studies, fragment edge features have been used for assembly validation but rarely for identification. Therefore, an edge matcher is proposed, working like a bloodhound, capable of “sniffing out” specific “flavors” in edge features and performing efficient sibling fragment identification accordingly, providing guidance when experts perform entity assembly subsequently. Firstly, the fragmented images are standardized. Secondly, traditional methods are used to compress the representation of fragment edges and obtain paired local edge images. Finally, these images are fed into the edge matcher for classification discrimination, which is a CNN-based pairwise similarity metric model proposed in this paper, introducing residual blocks and depthwise separable convolutions, and adding multi-scale convolutional layers. With the edge matcher, a complex matching problem is successfully transformed into a simple classification problem. In the absence of a standard public dataset, a Dunhuang manuscript fragment edge dataset is constructed. Experiments are conducted on that dataset, and the accuracy, precision, recall, and F1 scores of the edge matcher all exceeded 97%. The effectiveness of the edge matcher is demonstrated by comparative experiments, and the rationality of the method design is verified by ablation experiments. The method combines traditional methods and deep learning methods to creatively use the edge geometric features of fragments for sibling fragment identification in a natural rather than coded way, making full use of the computer’s computational and memory capabilities. The edge matcher can significantly reduce the time and scope of searching, matching, and inferring fragments, and assist in the reconstruction of Dunhuang ancient manuscript fragments.https://doi.org/10.1186/s40494-024-01150-3Dunhuang manuscriptsEdge featuresPairwise similarity metricConvolutional neural networks
spellingShingle Yutong Zheng
Xuelong Li
Yu Weng
Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscript
Heritage Science
Dunhuang manuscripts
Edge features
Pairwise similarity metric
Convolutional neural networks
title Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscript
title_full Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscript
title_fullStr Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscript
title_full_unstemmed Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscript
title_short Reunion helper: an edge matcher for sibling fragment identification of the Dunhuang manuscript
title_sort reunion helper an edge matcher for sibling fragment identification of the dunhuang manuscript
topic Dunhuang manuscripts
Edge features
Pairwise similarity metric
Convolutional neural networks
url https://doi.org/10.1186/s40494-024-01150-3
work_keys_str_mv AT yutongzheng reunionhelperanedgematcherforsiblingfragmentidentificationofthedunhuangmanuscript
AT xuelongli reunionhelperanedgematcherforsiblingfragmentidentificationofthedunhuangmanuscript
AT yuweng reunionhelperanedgematcherforsiblingfragmentidentificationofthedunhuangmanuscript