Gradient-based 2D-to-3D Conversion for Soccer Videos

A wide spread adoption of 3D videos and technologies is hindered by the lack of high-quality 3D content. One promising solution to address this problem is to use automated 2D-to-3D conversion. However, current conversion methods, while general, produce low-quality results with artifacts that are not...

Full description

Bibliographic Details
Main Authors: Calagari, Kiana, Elgharib, Mohamed, Didyk, Piotr, Kaspar, Alexandre, Matusik, Wojciech, Hefeeda, Mohamed
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:en_US
Published: Association for Computing Machinery (ACM) 2015
Online Access:http://hdl.handle.net/1721.1/99743
https://orcid.org/0000-0002-6090-5392
https://orcid.org/0000-0003-0212-5643
_version_ 1811074468393517056
author Calagari, Kiana
Elgharib, Mohamed
Didyk, Piotr
Kaspar, Alexandre
Matusik, Wojciech
Hefeeda, Mohamed
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Calagari, Kiana
Elgharib, Mohamed
Didyk, Piotr
Kaspar, Alexandre
Matusik, Wojciech
Hefeeda, Mohamed
author_sort Calagari, Kiana
collection MIT
description A wide spread adoption of 3D videos and technologies is hindered by the lack of high-quality 3D content. One promising solution to address this problem is to use automated 2D-to-3D conversion. However, current conversion methods, while general, produce low-quality results with artifacts that are not acceptable to many viewers. We address this problem by showing how to construct a high-quality, domain-specific conversion method for soccer videos. We propose a novel, data-driven method that generates stereoscopic frames by transferring depth information from similar frames in a database of 3D stereoscopic videos. Creating a database of 3D stereoscopic videos with accurate depth is, however, very difficult. One of the key findings in this paper is showing that computer generated content in current sports computer games can be used to generate high-quality 3D video reference database for 2D-to-3D conversion methods. Once we retrieve similar 3D video frames, our technique transfers depth gradients to the target frame while respecting object boundaries. It then computes depth maps from the gradients, and generates the output stereoscopic video. We implement our method and validate it by conducting user-studies that evaluate depth perception and visual comfort of the converted 3D videos. We show that our method produces high-quality 3D videos that are almost indistinguishable from videos shot by stereo cameras. In addition, our method significantly outperforms the current state-of-the-art method. For example, up to 20% improvement in the perceived depth is achieved by our method, which translates to improving the mean opinion score from Good to Excellent.
first_indexed 2024-09-23T09:49:55Z
format Article
id mit-1721.1/99743
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T09:49:55Z
publishDate 2015
publisher Association for Computing Machinery (ACM)
record_format dspace
spelling mit-1721.1/997432022-09-30T17:05:49Z Gradient-based 2D-to-3D Conversion for Soccer Videos Calagari, Kiana Elgharib, Mohamed Didyk, Piotr Kaspar, Alexandre Matusik, Wojciech Hefeeda, Mohamed Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Kaspar, Alexandre Matusik, Wojciech A wide spread adoption of 3D videos and technologies is hindered by the lack of high-quality 3D content. One promising solution to address this problem is to use automated 2D-to-3D conversion. However, current conversion methods, while general, produce low-quality results with artifacts that are not acceptable to many viewers. We address this problem by showing how to construct a high-quality, domain-specific conversion method for soccer videos. We propose a novel, data-driven method that generates stereoscopic frames by transferring depth information from similar frames in a database of 3D stereoscopic videos. Creating a database of 3D stereoscopic videos with accurate depth is, however, very difficult. One of the key findings in this paper is showing that computer generated content in current sports computer games can be used to generate high-quality 3D video reference database for 2D-to-3D conversion methods. Once we retrieve similar 3D video frames, our technique transfers depth gradients to the target frame while respecting object boundaries. It then computes depth maps from the gradients, and generates the output stereoscopic video. We implement our method and validate it by conducting user-studies that evaluate depth perception and visual comfort of the converted 3D videos. We show that our method produces high-quality 3D videos that are almost indistinguishable from videos shot by stereo cameras. In addition, our method significantly outperforms the current state-of-the-art method. For example, up to 20% improvement in the perceived depth is achieved by our method, which translates to improving the mean opinion score from Good to Excellent. Qatar Computing Research Institute-CSAIL Partnership National Science Foundation (U.S.) (Grant IIS-1111415) 2015-11-09T13:09:54Z 2015-11-09T13:09:54Z 2015-10 Article http://purl.org/eprint/type/ConferencePaper 978-1-4503-3459-4 http://hdl.handle.net/1721.1/99743 Calagari, Kiana, Mohamed Elgharib, Piotr Didyk, Alexandre Kaspar, Wojciech Matusik, and Mohamed Hefeeda. "Gradient-based 2D-to-3D Conversion for Soccer Videos." 23rd ACM International Conference on Multimedia (October 2015). https://orcid.org/0000-0002-6090-5392 https://orcid.org/0000-0003-0212-5643 en_US http://acmmm.hosting.acm.org/2015/wp-content/uploads/102617-ACM-MM15-d5web.pdf Proceedings of the 23rd ACM International Conference on Multimedia Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Association for Computing Machinery (ACM) Kaspar
spellingShingle Calagari, Kiana
Elgharib, Mohamed
Didyk, Piotr
Kaspar, Alexandre
Matusik, Wojciech
Hefeeda, Mohamed
Gradient-based 2D-to-3D Conversion for Soccer Videos
title Gradient-based 2D-to-3D Conversion for Soccer Videos
title_full Gradient-based 2D-to-3D Conversion for Soccer Videos
title_fullStr Gradient-based 2D-to-3D Conversion for Soccer Videos
title_full_unstemmed Gradient-based 2D-to-3D Conversion for Soccer Videos
title_short Gradient-based 2D-to-3D Conversion for Soccer Videos
title_sort gradient based 2d to 3d conversion for soccer videos
url http://hdl.handle.net/1721.1/99743
https://orcid.org/0000-0002-6090-5392
https://orcid.org/0000-0003-0212-5643
work_keys_str_mv AT calagarikiana gradientbased2dto3dconversionforsoccervideos
AT elgharibmohamed gradientbased2dto3dconversionforsoccervideos
AT didykpiotr gradientbased2dto3dconversionforsoccervideos
AT kasparalexandre gradientbased2dto3dconversionforsoccervideos
AT matusikwojciech gradientbased2dto3dconversionforsoccervideos
AT hefeedamohamed gradientbased2dto3dconversionforsoccervideos