Structure-aware multimodal feature fusion for RGB-D scene classification and beyond
While convolutional neural networks (CNNs) have been excellent for object recognition, the greater spatial variability in scene images typically means that the standard full-image CNN features are suboptimal for scene classification. In this article, we investigate a framework allowing greater spati...
Main Authors: | Wang, Anran, Cai, Jianfei, Lu, Jiwen, Cham, Tat-Jen |
---|---|
Other Authors: | School of Computer Science and Engineering |
Format: | Journal Article |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/138263 |
Similar Items
-
Towards robust and efficient multimodal representation learning and fusion
by: Guo, Xiaobao
Published: (2025) -
Fusing pairwise modalities for emotion recognition in conversations
by: Fan, Chunxiao, et al.
Published: (2024) -
Multimodal sentiment analysis using hierarchical fusion with context modeling
by: Majumder, Navonil, et al.
Published: (2020) -
Feature learning for RGB-D scene understanding
by: Wang, Anran
Published: (2016) -
Feature fusion with covariance matrix regularization in face recognition
by: Lu, Ze, et al.
Published: (2018)