A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis

In forensic document analysis, the authenticity of a document must be properly checked in the context of suspected forgery. Hyperspectral Imaging (HSI) is a non-invasive way of detecting fraudulent papers in a multipage document. The occurrence of a forged paper in a multi-page document may have a s...

Full description

Bibliographic Details
Main Authors: Jobin Francis, Baburaj Madathil, Sudhish N. George, Sony George
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9661320/
_version_ 1818744467500302336
author Jobin Francis
Baburaj Madathil
Sudhish N. George
Sony George
author_facet Jobin Francis
Baburaj Madathil
Sudhish N. George
Sony George
author_sort Jobin Francis
collection DOAJ
description In forensic document analysis, the authenticity of a document must be properly checked in the context of suspected forgery. Hyperspectral Imaging (HSI) is a non-invasive way of detecting fraudulent papers in a multipage document. The occurrence of a forged paper in a multi-page document may have a substantial difference from rest of the papers in its age, type, color, texture, and so on. Each pixel in an HSI data can be used as the material fingerprint for the spatial point it corresponds to. Hence, hyperspectral data of paper samples made of the same substance have similar characteristics and can be grouped into a single cluster. Similarly, paper samples made of different substances have different spectral properties. This paper relies on this heuristic and proposes a tensor based clustering framework for hyperspectral paper data, with an application to detect the forged papers in multi-page documents. Information embedded in the hyperspectral patches of the papers to be clustered is arranged into individual lateral slices of a third-order tensor in this framework. Further, this work employs the self-expressiveness property of submodules and an objective function is formulated to extract self-expressive representation tensor with low multirank and f-diagonal structure. Objective function of the proposed method incorporates <inline-formula> <tex-math notation="LaTeX">$l_{\frac {1}{2}}$ </tex-math></inline-formula>-induced Tensor Nuclear Norm (TNN) and <inline-formula> <tex-math notation="LaTeX">$l_{\frac {1}{2}}$ </tex-math></inline-formula> regularization to impart better low rankness and f-diagonal structure to the representation tensor. Experimental results of the proposed method were compared to the state-of-the-art subspace clustering approaches. The results demonstrate improved performance of the proposed method over the existing clustering algorithms.
first_indexed 2024-12-18T02:44:46Z
format Article
id doaj.art-9bed8a4b773d46f2a96c8d661826d93a
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-18T02:44:46Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-9bed8a4b773d46f2a96c8d661826d93a2022-12-21T21:23:34ZengIEEEIEEE Access2169-35362022-01-01106194620710.1109/ACCESS.2021.31378699661320A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document AnalysisJobin Francis0Baburaj Madathil1https://orcid.org/0000-0003-3151-9270Sudhish N. George2https://orcid.org/0000-0002-0886-9478Sony George3https://orcid.org/0000-0001-8436-3164Department of Electronics and Communication Engineering, National Institute of Technology Calicut, Calicut, IndiaDepartment of Electronics and Instrumentation Engineering, Government Engineering College Kozhikode, Calicut, IndiaDepartment of Electronics and Communication Engineering, National Institute of Technology Calicut, Calicut, IndiaDepartment of Computer Science, Norwegian University of Science and Technology, Gj&#x00F8;vik, NorwayIn forensic document analysis, the authenticity of a document must be properly checked in the context of suspected forgery. Hyperspectral Imaging (HSI) is a non-invasive way of detecting fraudulent papers in a multipage document. The occurrence of a forged paper in a multi-page document may have a substantial difference from rest of the papers in its age, type, color, texture, and so on. Each pixel in an HSI data can be used as the material fingerprint for the spatial point it corresponds to. Hence, hyperspectral data of paper samples made of the same substance have similar characteristics and can be grouped into a single cluster. Similarly, paper samples made of different substances have different spectral properties. This paper relies on this heuristic and proposes a tensor based clustering framework for hyperspectral paper data, with an application to detect the forged papers in multi-page documents. Information embedded in the hyperspectral patches of the papers to be clustered is arranged into individual lateral slices of a third-order tensor in this framework. Further, this work employs the self-expressiveness property of submodules and an objective function is formulated to extract self-expressive representation tensor with low multirank and f-diagonal structure. Objective function of the proposed method incorporates <inline-formula> <tex-math notation="LaTeX">$l_{\frac {1}{2}}$ </tex-math></inline-formula>-induced Tensor Nuclear Norm (TNN) and <inline-formula> <tex-math notation="LaTeX">$l_{\frac {1}{2}}$ </tex-math></inline-formula> regularization to impart better low rankness and f-diagonal structure to the representation tensor. Experimental results of the proposed method were compared to the state-of-the-art subspace clustering approaches. The results demonstrate improved performance of the proposed method over the existing clustering algorithms.https://ieeexplore.ieee.org/document/9661320/Forensic document analysishyperspectral imaging (HSI)clusteringself-expressiveness property
spellingShingle Jobin Francis
Baburaj Madathil
Sudhish N. George
Sony George
A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis
IEEE Access
Forensic document analysis
hyperspectral imaging (HSI)
clustering
self-expressiveness property
title A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis
title_full A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis
title_fullStr A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis
title_full_unstemmed A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis
title_short A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis
title_sort comprehensive tensor framework for the clustering of hyperspectral paper data with an application to forensic document analysis
topic Forensic document analysis
hyperspectral imaging (HSI)
clustering
self-expressiveness property
url https://ieeexplore.ieee.org/document/9661320/
work_keys_str_mv AT jobinfrancis acomprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis
AT baburajmadathil acomprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis
AT sudhishngeorge acomprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis
AT sonygeorge acomprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis
AT jobinfrancis comprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis
AT baburajmadathil comprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis
AT sudhishngeorge comprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis
AT sonygeorge comprehensivetensorframeworkfortheclusteringofhyperspectralpaperdatawithanapplicationtoforensicdocumentanalysis