Global and Local Structure Network for Image Classification

Principal component analysis network (PCANet) is a feature learning algorithm that is widely used in face recognition and object classification. However, original PCANet still has some shortages. One is that the principal component analysis (PCA) algorithm only extracts features by considering the g...

Full description

Bibliographic Details
Main Authors: Jinping Wang, Ruisheng Ran, Bin Fang
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10076452/
Description
Summary:Principal component analysis network (PCANet) is a feature learning algorithm that is widely used in face recognition and object classification. However, original PCANet still has some shortages. One is that the principal component analysis (PCA) algorithm only extracts features by considering the global structure. The other lies in that the original PCANet only employs one particular single layer convolutional results, which loses the information of other convolutional layers. In this paper, we propose a new simple and efficient convolutional neural network called global and local structure network (GLSNet) to address the problems. The network extracts the features both from the global structure and the local structure of the original data space. Specifically, a principal component analysis (PCA) convolutional layer which learns the filters by PCA algorithm is used to remove the noises and redundant information at the first stage. Then at the second stage, another PCA convolution is added to extract features by considering the global structure. As for the local structure, we use the neighborhood preserving embedding (NPE) algorithm to learn the convolutional filters. At the output stage, the global structure feature extracted by PCA convolution and the local structure feature extracted by NPE convolution is concatenated as a united feature. Furthermore, the first layer convolutional feature is also taken into consideration to obtain shallow-level information. Finally, these features are concatenated as a united feature, and a spatial pyramid pooling layer is followed to pool above the united features. To test the effectiveness of the proposed algorithm, the experiments on some image datasets, including three types: human face dataset, object dataset, and handprinted dataset, proceeded. And it performs better than the original PCANet and some improvement algorithms of PCANet, such as PLDANet, and MMPCANet.
ISSN:2169-3536