Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance Cameras

Since it is impossible for surveillance personnel to keep monitoring videos from a multiple camera-based surveillance system, an efficient technique is needed to help recognize important situations by retrieving the metadata of an object-of-interest. In a multiple camera-based surveillance system, a...

Full description

Bibliographic Details
Main Authors: Jaehoon Jung, Inhye Yoon, Seungwon Lee, Joonki Paik
Format: Article
Language:English
Published: MDPI AG 2016-06-01
Series:Sensors
Subjects:
Online Access:http://www.mdpi.com/1424-8220/16/7/963
_version_ 1828137314794078208
author Jaehoon Jung
Inhye Yoon
Seungwon Lee
Joonki Paik
author_facet Jaehoon Jung
Inhye Yoon
Seungwon Lee
Joonki Paik
author_sort Jaehoon Jung
collection DOAJ
description Since it is impossible for surveillance personnel to keep monitoring videos from a multiple camera-based surveillance system, an efficient technique is needed to help recognize important situations by retrieving the metadata of an object-of-interest. In a multiple camera-based surveillance system, an object detected in a camera has a different shape in another camera, which is a critical issue of wide-range, real-time surveillance systems. In order to address the problem, this paper presents an object retrieval method by extracting the normalized metadata of an object-of-interest from multiple, heterogeneous cameras. The proposed metadata generation algorithm consists of three steps: (i) generation of a three-dimensional (3D) human model; (ii) human object-based automatic scene calibration; and (iii) metadata generation. More specifically, an appropriately-generated 3D human model provides the foot-to-head direction information that is used as the input of the automatic calibration of each camera. The normalized object information is used to retrieve an object-of-interest in a wide-range, multiple-camera surveillance system in the form of metadata. Experimental results show that the 3D human model matches the ground truth, and automatic calibration-based normalization of metadata enables a successful retrieval and tracking of a human object in the multiple-camera video surveillance system.
first_indexed 2024-04-11T18:16:31Z
format Article
id doaj.art-99b0cd159496483fb5c51a9369389889
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-04-11T18:16:31Z
publishDate 2016-06-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-99b0cd159496483fb5c51a93693898892022-12-22T04:09:52ZengMDPI AGSensors1424-82202016-06-0116796310.3390/s16070963s16070963Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance CamerasJaehoon Jung0Inhye Yoon1Seungwon Lee2Joonki Paik3Department of Image, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaDepartment of Image, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaDepartment of Image, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaDepartment of Image, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaSince it is impossible for surveillance personnel to keep monitoring videos from a multiple camera-based surveillance system, an efficient technique is needed to help recognize important situations by retrieving the metadata of an object-of-interest. In a multiple camera-based surveillance system, an object detected in a camera has a different shape in another camera, which is a critical issue of wide-range, real-time surveillance systems. In order to address the problem, this paper presents an object retrieval method by extracting the normalized metadata of an object-of-interest from multiple, heterogeneous cameras. The proposed metadata generation algorithm consists of three steps: (i) generation of a three-dimensional (3D) human model; (ii) human object-based automatic scene calibration; and (iii) metadata generation. More specifically, an appropriately-generated 3D human model provides the foot-to-head direction information that is used as the input of the automatic calibration of each camera. The normalized object information is used to retrieve an object-of-interest in a wide-range, multiple-camera surveillance system in the form of metadata. Experimental results show that the 3D human model matches the ground truth, and automatic calibration-based normalization of metadata enables a successful retrieval and tracking of a human object in the multiple-camera video surveillance system.http://www.mdpi.com/1424-8220/16/7/963video surveillancevideo retrievalautomatic calibrationmetadata descriptorhomologycolor clusteringobject tracking
spellingShingle Jaehoon Jung
Inhye Yoon
Seungwon Lee
Joonki Paik
Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance Cameras
Sensors
video surveillance
video retrieval
automatic calibration
metadata descriptor
homology
color clustering
object tracking
title Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance Cameras
title_full Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance Cameras
title_fullStr Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance Cameras
title_full_unstemmed Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance Cameras
title_short Normalized Metadata Generation for Human Retrieval Using Multiple Video Surveillance Cameras
title_sort normalized metadata generation for human retrieval using multiple video surveillance cameras
topic video surveillance
video retrieval
automatic calibration
metadata descriptor
homology
color clustering
object tracking
url http://www.mdpi.com/1424-8220/16/7/963
work_keys_str_mv AT jaehoonjung normalizedmetadatagenerationforhumanretrievalusingmultiplevideosurveillancecameras
AT inhyeyoon normalizedmetadatagenerationforhumanretrievalusingmultiplevideosurveillancecameras
AT seungwonlee normalizedmetadatagenerationforhumanretrievalusingmultiplevideosurveillancecameras
AT joonkipaik normalizedmetadatagenerationforhumanretrievalusingmultiplevideosurveillancecameras