Fusing target information from multiple views for robust visual tracking

In this study, the authors address the problem of tracking a single target in a calibrated multi‐camera surveillance system with information on its location in the first frame of each view. Recently, tracking with online multiple instance learning (OMIL) has been shown to give promising tracking res...

Full description

Bibliographic Details
Main Authors: Keli Hu, Xing Zhang, Yuzhang Gu, Yingguan Wang
Format: Article
Language:English
Published: Wiley 2014-04-01
Series:IET Computer Vision
Subjects:
Online Access:https://doi.org/10.1049/iet-cvi.2013.0026
Description
Summary:In this study, the authors address the problem of tracking a single target in a calibrated multi‐camera surveillance system with information on its location in the first frame of each view. Recently, tracking with online multiple instance learning (OMIL) has been shown to give promising tracking results. However, it may fail in a real surveillance system because of problems arising from target orientation, scale or illumination changes. In this study, the authors show that fusing target information from multiple views can avoid these problems and lead to a more robust tracker. At each camera node, an efficient OMIL algorithm is used to model target appearance. To update the OMIL‐based classifier in one view, a co‐training strategy is applied to generate a representative set of training bags from all views. Bags extracted from each view hold a unique weight depending on similarity of target appearance between the current view and the view which contains the classifier that needs to be updated. In addition, target motion on a camera's image plane is modelled by a modified particle filter guided by the corresponding object two‐dimensional (2D) location and fused 3D location. Experimental results demonstrate that the proposed algorithm is robust for human tracking in challenging scenes.
ISSN:1751-9632
1751-9640