REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLING

Remote sensing image scene classification has gained remarkable attention, due to its versatile use in different applications like geospatial object detection, ground object information extraction, environment monitoring and etc. The scene not only contains the information of the ground objects, but...

Full description

Bibliographic Details
Main Authors: Y. Yao, H. Zhao, D. Huang, Q. Tan
Format: Article
Language:English
Published: Copernicus Publications 2019-09-01
Series:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W16/279/2019/isprs-archives-XLII-2-W16-279-2019.pdf
_version_ 1818681722098679808
author Y. Yao
Y. Yao
H. Zhao
H. Zhao
D. Huang
D. Huang
Q. Tan
Q. Tan
author_facet Y. Yao
Y. Yao
H. Zhao
H. Zhao
D. Huang
D. Huang
Q. Tan
Q. Tan
author_sort Y. Yao
collection DOAJ
description Remote sensing image scene classification has gained remarkable attention, due to its versatile use in different applications like geospatial object detection, ground object information extraction, environment monitoring and etc. The scene not only contains the information of the ground objects, but also includes the spatial relationship between the ground objects and the environment. With rapid growth of the amount of remote sensing image data, the need for automatic annotation methods for image scenes is more urgent. This paper proposes a new framework for high resolution remote sensing images scene classification based on convolutional neural network. To eliminate the requirement of fixed-size input image, multiple pyramid pooling strategy is equipped between convolutional layers and fully connected layers. Then, the fixed-size features generated by multiple pyramid pooling layer was extended to one-dimension fixed-length vector and fed into fully connected layers. Our method could generate a fixed-length representation regardless of image size, at the same time get higher classification accuracy. On UC-Merced and NWPU-RESISC45 datasets, our framework achieved satisfying accuracies, which is 93.24% and 88.62% respectively.
first_indexed 2024-12-17T10:07:28Z
format Article
id doaj.art-c5dd65f32ae14ba896ce83c654ac7af9
institution Directory Open Access Journal
issn 1682-1750
2194-9034
language English
last_indexed 2024-12-17T10:07:28Z
publishDate 2019-09-01
publisher Copernicus Publications
record_format Article
series The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
spelling doaj.art-c5dd65f32ae14ba896ce83c654ac7af92022-12-21T21:53:08ZengCopernicus PublicationsThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences1682-17502194-90342019-09-01XLII-2-W1627928410.5194/isprs-archives-XLII-2-W16-279-2019REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLINGY. Yao0Y. Yao1H. Zhao2H. Zhao3D. Huang4D. Huang5Q. Tan6Q. Tan7Department of Civil Engineering, Tsinghua University, Beijing 10084, China3S Center, Tsinghua University, Beijing 10084, ChinaDepartment of Civil Engineering, Tsinghua University, Beijing 10084, China3S Center, Tsinghua University, Beijing 10084, ChinaDepartment of Civil Engineering, Tsinghua University, Beijing 10084, China3S Center, Tsinghua University, Beijing 10084, ChinaDepartment of Civil Engineering, Tsinghua University, Beijing 10084, China3S Center, Tsinghua University, Beijing 10084, ChinaRemote sensing image scene classification has gained remarkable attention, due to its versatile use in different applications like geospatial object detection, ground object information extraction, environment monitoring and etc. The scene not only contains the information of the ground objects, but also includes the spatial relationship between the ground objects and the environment. With rapid growth of the amount of remote sensing image data, the need for automatic annotation methods for image scenes is more urgent. This paper proposes a new framework for high resolution remote sensing images scene classification based on convolutional neural network. To eliminate the requirement of fixed-size input image, multiple pyramid pooling strategy is equipped between convolutional layers and fully connected layers. Then, the fixed-size features generated by multiple pyramid pooling layer was extended to one-dimension fixed-length vector and fed into fully connected layers. Our method could generate a fixed-length representation regardless of image size, at the same time get higher classification accuracy. On UC-Merced and NWPU-RESISC45 datasets, our framework achieved satisfying accuracies, which is 93.24% and 88.62% respectively.https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W16/279/2019/isprs-archives-XLII-2-W16-279-2019.pdf
spellingShingle Y. Yao
Y. Yao
H. Zhao
H. Zhao
D. Huang
D. Huang
Q. Tan
Q. Tan
REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLING
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
title REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLING
title_full REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLING
title_fullStr REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLING
title_full_unstemmed REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLING
title_short REMOTE SENSING SCENE CLASSIFICATION USING MULTIPLE PYRAMID POOLING
title_sort remote sensing scene classification using multiple pyramid pooling
url https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W16/279/2019/isprs-archives-XLII-2-W16-279-2019.pdf
work_keys_str_mv AT yyao remotesensingsceneclassificationusingmultiplepyramidpooling
AT yyao remotesensingsceneclassificationusingmultiplepyramidpooling
AT hzhao remotesensingsceneclassificationusingmultiplepyramidpooling
AT hzhao remotesensingsceneclassificationusingmultiplepyramidpooling
AT dhuang remotesensingsceneclassificationusingmultiplepyramidpooling
AT dhuang remotesensingsceneclassificationusingmultiplepyramidpooling
AT qtan remotesensingsceneclassificationusingmultiplepyramidpooling
AT qtan remotesensingsceneclassificationusingmultiplepyramidpooling