Summary: | As smartphones and mobile data become universal in modern society, the opportunities to interact with the real world would grow tremendously. Latest Technologies such as Oculus Rift and Google Glass attempt to bridge the gap between the virtual and the material. With advancements in computing speed and image recognition, the idea of augmented reality (AR) becomes more tangible. However, the sheer complexity of image processing and feature recognition is an area of concern for AR. A successful AR system must distinguish among many landmarks and identify or classify the existence of new landmarks. AR algorithms naturally lend themselves to using deep learning because of the adaptability required to various factors. This paper aims to develop and refine a deep learning algorithm that can distinguish landmarks from images using a google landmark database of known landmarks. Instance-level recognition is universally used in areas of Landmark recognition and is also the upcoming research area. Instance-level recognition is the brain behind Landmark recognition. As in Landmarks, the goal is to seek an instance of a common group instead of a group, requiring new deep learning techniques. In this paper, three different VGG16, Inceptionv3, and ResNet50 models are trained using the transfer learning technique and a Pure Convolutional Neural Network (CNN) model is also trained from scratch. This paper proposes a modified version of the ResNet50 model to increase the accuracy and performance of the models used. The revised version of Resnet50 contains an additional Deep Local Features (DeLF) processing layer before generating the final output.
|