Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling

© 2018 IEEE. We study 3D shape modeling from a single image and make contributions to it in three aspects. First, we present Pix3D, a large-scale benchmark of diverse image-shape pairs with pixel-level 2D-3D alignment. Pix3D has wide applications in shape-related tasks including reconstruction, retr...

Full description

Bibliographic Details
Main Authors: Sun, Xingyuan, Wu, Jiajun, Zhang, Xiuming, Zhang, Zhoutong, Zhang, Chengkai, Xue, Tianfan, Tenenbaum, Joshua B, Freeman, William T
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:English
Published: 2022
Online Access:https://hdl.handle.net/1721.1/132170.2
_version_ 1811082015453216768
author Sun, Xingyuan
Wu, Jiajun
Zhang, Xiuming
Zhang, Zhoutong
Zhang, Chengkai
Xue, Tianfan
Tenenbaum, Joshua B
Freeman, William T
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Sun, Xingyuan
Wu, Jiajun
Zhang, Xiuming
Zhang, Zhoutong
Zhang, Chengkai
Xue, Tianfan
Tenenbaum, Joshua B
Freeman, William T
author_sort Sun, Xingyuan
collection MIT
description © 2018 IEEE. We study 3D shape modeling from a single image and make contributions to it in three aspects. First, we present Pix3D, a large-scale benchmark of diverse image-shape pairs with pixel-level 2D-3D alignment. Pix3D has wide applications in shape-related tasks including reconstruction, retrieval, viewpoint estimation, etc. Building such a large-scale dataset, however, is highly challenging; existing datasets either contain only synthetic data, or lack precise alignment between 2D images and 3D shapes, or only have a small number of images. Second, we calibrate the evaluation criteria for 3D shape reconstruction through behavioral studies, and use them to objectively and systematically benchmark cutting-edge reconstruction algorithms on Pix3D. Third, we design a novel model that simultaneously performs 3D reconstruction and pose estimation; our multi-task learning approach achieves state-of-the-art performance on both tasks.
first_indexed 2024-09-23T11:56:02Z
format Article
id mit-1721.1/132170.2
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T11:56:02Z
publishDate 2022
record_format dspace
spelling mit-1721.1/132170.22022-08-04T18:56:32Z Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling Sun, Xingyuan Wu, Jiajun Zhang, Xiuming Zhang, Zhoutong Zhang, Chengkai Xue, Tianfan Tenenbaum, Joshua B Freeman, William T Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences © 2018 IEEE. We study 3D shape modeling from a single image and make contributions to it in three aspects. First, we present Pix3D, a large-scale benchmark of diverse image-shape pairs with pixel-level 2D-3D alignment. Pix3D has wide applications in shape-related tasks including reconstruction, retrieval, viewpoint estimation, etc. Building such a large-scale dataset, however, is highly challenging; existing datasets either contain only synthetic data, or lack precise alignment between 2D images and 3D shapes, or only have a small number of images. Second, we calibrate the evaluation criteria for 3D shape reconstruction through behavioral studies, and use them to objectively and systematically benchmark cutting-edge reconstruction algorithms on Pix3D. Third, we design a novel model that simultaneously performs 3D reconstruction and pose estimation; our multi-task learning approach achieves state-of-the-art performance on both tasks. 2022-08-04T18:56:31Z 2021-09-20T18:21:14Z 2022-08-04T18:56:31Z 2018 2019-05-23T15:58:03Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/132170.2 en 10.1109/CVPR.2018.00314 Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/octet-stream MIT web domain
spellingShingle Sun, Xingyuan
Wu, Jiajun
Zhang, Xiuming
Zhang, Zhoutong
Zhang, Chengkai
Xue, Tianfan
Tenenbaum, Joshua B
Freeman, William T
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
title Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
title_full Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
title_fullStr Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
title_full_unstemmed Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
title_short Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
title_sort pix3d dataset and methods for single image 3d shape modeling
url https://hdl.handle.net/1721.1/132170.2
work_keys_str_mv AT sunxingyuan pix3ddatasetandmethodsforsingleimage3dshapemodeling
AT wujiajun pix3ddatasetandmethodsforsingleimage3dshapemodeling
AT zhangxiuming pix3ddatasetandmethodsforsingleimage3dshapemodeling
AT zhangzhoutong pix3ddatasetandmethodsforsingleimage3dshapemodeling
AT zhangchengkai pix3ddatasetandmethodsforsingleimage3dshapemodeling
AT xuetianfan pix3ddatasetandmethodsforsingleimage3dshapemodeling
AT tenenbaumjoshuab pix3ddatasetandmethodsforsingleimage3dshapemodeling
AT freemanwilliamt pix3ddatasetandmethodsforsingleimage3dshapemodeling