Large scale tattoo localization

This paper examines the use of several popular object detection frameworks, namely Fast-RCNN, Faster-RCNN, and the more recent real-time object detection system, YOLO. The data utilized in this paper was collected from Flickr to more accurately represent images that could be found in the electronic...

Full description

Bibliographic Details
Main Author: Ng, Jing Nee
Other Authors: Kong Wai-Kin Adams
Format: Final Year Project (FYP)
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/70231
Description
Summary:This paper examines the use of several popular object detection frameworks, namely Fast-RCNN, Faster-RCNN, and the more recent real-time object detection system, YOLO. The data utilized in this paper was collected from Flickr to more accurately represent images that could be found in the electronic devices of potential suspects. A total of 90,000 images were used, and split into 4 experiments of 10,000, 20,000, 40,000, and 90,000 images. The VGG_CNN_M_1024 model achieved average precisions (AP)1 of 51.02% and 61.03% for both Fast-RCNN and Faster-RCNN respectively. The PVANet model achieved an AP of 69.15% on Faster-RCNN. Lastly, the YOLO model achieved an AP of 60.60%. All the best APs for each model were attained on the largest dataset, Flickr90k. The trained models were then tested on the NIST database of 2,212 images from the tattoo similarity use case (original, uncropped version), achieving an AP of 97.34% using the PVANet model trained on Flickr90k. Another set of 3,847 images were acquired from NIST’s background tattoo images (original, uncropped version). This set of images achieved an AP of 85.07%.