Places: A 10 Million Image Database for Scene Recognition

The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, lab...

全面介绍

书目详细资料
Main Authors: Zhou, Bolei, Lapedriza Garcia, Agata, Khosla, Aditya, Oliva, Aude, Torralba, Antonio
其他作者: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
格式: 文件
语言:English
出版: Institute of Electrical and Electronics Engineers 2019
在线阅读:https://hdl.handle.net/1721.1/122983
实物特征
总结:The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, labeled with scene semantic categories, comprising a large and diverse list of the types of environments encountered in the world. Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate representation of scene classification. With its high-coverage and high-diversity of exemplars, the Places Database along with the Places-CNNs offer a novel resource to guide future progress on scene recognition problems. Keywords: Scene classification; visual recognition; deep learning; deep feature; image dataset