Gradient-based 2D-to-3D Conversion for Soccer Videos

A wide spread adoption of 3D videos and technologies is hindered by the lack of high-quality 3D content. One promising solution to address this problem is to use automated 2D-to-3D conversion. However, current conversion methods, while general, produce low-quality results with artifacts that are not...

Full description

Bibliographic Details
Main Authors:	Calagari, Kiana, Elgharib, Mohamed, Didyk, Piotr, Kaspar, Alexandre, Matusik, Wojciech, Hefeeda, Mohamed
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format:	Article
Language:	en_US
Published:	Association for Computing Machinery (ACM) 2015
Online Access:	http://hdl.handle.net/1721.1/99743 https://orcid.org/0000-0002-6090-5392 https://orcid.org/0000-0003-0212-5643

_version_	1811074468393517056
author	Calagari, Kiana Elgharib, Mohamed Didyk, Piotr Kaspar, Alexandre Matusik, Wojciech Hefeeda, Mohamed
author2	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Calagari, Kiana Elgharib, Mohamed Didyk, Piotr Kaspar, Alexandre Matusik, Wojciech Hefeeda, Mohamed
author_sort	Calagari, Kiana
collection	MIT
description	A wide spread adoption of 3D videos and technologies is hindered by the lack of high-quality 3D content. One promising solution to address this problem is to use automated 2D-to-3D conversion. However, current conversion methods, while general, produce low-quality results with artifacts that are not acceptable to many viewers. We address this problem by showing how to construct a high-quality, domain-specific conversion method for soccer videos. We propose a novel, data-driven method that generates stereoscopic frames by transferring depth information from similar frames in a database of 3D stereoscopic videos. Creating a database of 3D stereoscopic videos with accurate depth is, however, very difficult. One of the key findings in this paper is showing that computer generated content in current sports computer games can be used to generate high-quality 3D video reference database for 2D-to-3D conversion methods. Once we retrieve similar 3D video frames, our technique transfers depth gradients to the target frame while respecting object boundaries. It then computes depth maps from the gradients, and generates the output stereoscopic video. We implement our method and validate it by conducting user-studies that evaluate depth perception and visual comfort of the converted 3D videos. We show that our method produces high-quality 3D videos that are almost indistinguishable from videos shot by stereo cameras. In addition, our method significantly outperforms the current state-of-the-art method. For example, up to 20% improvement in the perceived depth is achieved by our method, which translates to improving the mean opinion score from Good to Excellent.
first_indexed	2024-09-23T09:49:55Z
format	Article
id	mit-1721.1/99743
institution	Massachusetts Institute of Technology
language	en_US
last_indexed	2024-09-23T09:49:55Z
publishDate	2015
publisher	Association for Computing Machinery (ACM)
record_format	dspace
spelling	mit-1721.1/997432022-09-30T17:05:49Z Gradient-based 2D-to-3D Conversion for Soccer Videos Calagari, Kiana Elgharib, Mohamed Didyk, Piotr Kaspar, Alexandre Matusik, Wojciech Hefeeda, Mohamed Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Kaspar, Alexandre Matusik, Wojciech A wide spread adoption of 3D videos and technologies is hindered by the lack of high-quality 3D content. One promising solution to address this problem is to use automated 2D-to-3D conversion. However, current conversion methods, while general, produce low-quality results with artifacts that are not acceptable to many viewers. We address this problem by showing how to construct a high-quality, domain-specific conversion method for soccer videos. We propose a novel, data-driven method that generates stereoscopic frames by transferring depth information from similar frames in a database of 3D stereoscopic videos. Creating a database of 3D stereoscopic videos with accurate depth is, however, very difficult. One of the key findings in this paper is showing that computer generated content in current sports computer games can be used to generate high-quality 3D video reference database for 2D-to-3D conversion methods. Once we retrieve similar 3D video frames, our technique transfers depth gradients to the target frame while respecting object boundaries. It then computes depth maps from the gradients, and generates the output stereoscopic video. We implement our method and validate it by conducting user-studies that evaluate depth perception and visual comfort of the converted 3D videos. We show that our method produces high-quality 3D videos that are almost indistinguishable from videos shot by stereo cameras. In addition, our method significantly outperforms the current state-of-the-art method. For example, up to 20% improvement in the perceived depth is achieved by our method, which translates to improving the mean opinion score from Good to Excellent. Qatar Computing Research Institute-CSAIL Partnership National Science Foundation (U.S.) (Grant IIS-1111415) 2015-11-09T13:09:54Z 2015-11-09T13:09:54Z 2015-10 Article http://purl.org/eprint/type/ConferencePaper 978-1-4503-3459-4 http://hdl.handle.net/1721.1/99743 Calagari, Kiana, Mohamed Elgharib, Piotr Didyk, Alexandre Kaspar, Wojciech Matusik, and Mohamed Hefeeda. "Gradient-based 2D-to-3D Conversion for Soccer Videos." 23rd ACM International Conference on Multimedia (October 2015). https://orcid.org/0000-0002-6090-5392 https://orcid.org/0000-0003-0212-5643 en_US http://acmmm.hosting.acm.org/2015/wp-content/uploads/102617-ACM-MM15-d5web.pdf Proceedings of the 23rd ACM International Conference on Multimedia Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Association for Computing Machinery (ACM) Kaspar
spellingShingle	Calagari, Kiana Elgharib, Mohamed Didyk, Piotr Kaspar, Alexandre Matusik, Wojciech Hefeeda, Mohamed Gradient-based 2D-to-3D Conversion for Soccer Videos
title	Gradient-based 2D-to-3D Conversion for Soccer Videos
title_full	Gradient-based 2D-to-3D Conversion for Soccer Videos
title_fullStr	Gradient-based 2D-to-3D Conversion for Soccer Videos
title_full_unstemmed	Gradient-based 2D-to-3D Conversion for Soccer Videos
title_short	Gradient-based 2D-to-3D Conversion for Soccer Videos
title_sort	gradient based 2d to 3d conversion for soccer videos
url	http://hdl.handle.net/1721.1/99743 https://orcid.org/0000-0002-6090-5392 https://orcid.org/0000-0003-0212-5643
work_keys_str_mv	AT calagarikiana gradientbased2dto3dconversionforsoccervideos AT elgharibmohamed gradientbased2dto3dconversionforsoccervideos AT didykpiotr gradientbased2dto3dconversionforsoccervideos AT kasparalexandre gradientbased2dto3dconversionforsoccervideos AT matusikwojciech gradientbased2dto3dconversionforsoccervideos AT hefeedamohamed gradientbased2dto3dconversionforsoccervideos

Gradient-based 2D-to-3D Conversion for Soccer Videos

Similar Items