Combining image descriptors to effectively retrieve events from visual lifelogs

The SenseCam is a wearable camera that passively captures approximately 3,000 images per day, which equates to almost one million images per year. It is used to create a personal visual recording of the wearer's life and generates information which can be helpful as a human memory aid. For such...

Full description

Bibliographic Details
Main Authors:	Doherty, A, Conaire, C, Blighe, M, Smeaton, A, O'connor, N
Format:	Journal article
Language:	English
Published:	2008

_version_	1826302497997717504
author	Doherty, A Conaire, C Blighe, M Smeaton, A O'connor, N
author_facet	Doherty, A Conaire, C Blighe, M Smeaton, A O'connor, N
author_sort	Doherty, A
collection	OXFORD
description	The SenseCam is a wearable camera that passively captures approximately 3,000 images per day, which equates to almost one million images per year. It is used to create a personal visual recording of the wearer's life and generates information which can be helpful as a human memory aid. For such a large amount of visual information to be of any use, it is accepted that it should be structured into "events", of which there are about 8,000 in a wearer's average year. In automatically segmenting SenseCam images into events, it will then be useful for users to locate other events similar to a given event e.g. "what other times was I walking in the park?", "show me other events when I was in a restaurant". On two datasets of 240k and 1.8M images containing topics with a variety of information needs, we evaluate the fusion of MPEG-7, SIFT, and SURF content-based retrieval techniques to address the event search issue. We have found that our proposed fusion approach of MPEG-7 and SURF offers an improvement on using either of those sources or SIFT individually, and we have also shown how a lifelog event is modeled has a large effect on the retrieval performance. Copyright 2008 ACM.
first_indexed	2024-03-07T05:48:27Z
format	Journal article
id	oxford-uuid:e80238f5-610a-4d50-9fdf-bc27154b7724
institution	University of Oxford
language	English
last_indexed	2024-03-07T05:48:27Z
publishDate	2008
record_format	dspace
spelling	oxford-uuid:e80238f5-610a-4d50-9fdf-bc27154b77242022-03-27T10:43:26ZCombining image descriptors to effectively retrieve events from visual lifelogsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:e80238f5-610a-4d50-9fdf-bc27154b7724EnglishSymplectic Elements at Oxford2008Doherty, AConaire, CBlighe, MSmeaton, AO'connor, NThe SenseCam is a wearable camera that passively captures approximately 3,000 images per day, which equates to almost one million images per year. It is used to create a personal visual recording of the wearer's life and generates information which can be helpful as a human memory aid. For such a large amount of visual information to be of any use, it is accepted that it should be structured into "events", of which there are about 8,000 in a wearer's average year. In automatically segmenting SenseCam images into events, it will then be useful for users to locate other events similar to a given event e.g. "what other times was I walking in the park?", "show me other events when I was in a restaurant". On two datasets of 240k and 1.8M images containing topics with a variety of information needs, we evaluate the fusion of MPEG-7, SIFT, and SURF content-based retrieval techniques to address the event search issue. We have found that our proposed fusion approach of MPEG-7 and SURF offers an improvement on using either of those sources or SIFT individually, and we have also shown how a lifelog event is modeled has a large effect on the retrieval performance. Copyright 2008 ACM.
spellingShingle	Doherty, A Conaire, C Blighe, M Smeaton, A O'connor, N Combining image descriptors to effectively retrieve events from visual lifelogs
title	Combining image descriptors to effectively retrieve events from visual lifelogs
title_full	Combining image descriptors to effectively retrieve events from visual lifelogs
title_fullStr	Combining image descriptors to effectively retrieve events from visual lifelogs
title_full_unstemmed	Combining image descriptors to effectively retrieve events from visual lifelogs
title_short	Combining image descriptors to effectively retrieve events from visual lifelogs
title_sort	combining image descriptors to effectively retrieve events from visual lifelogs
work_keys_str_mv	AT dohertya combiningimagedescriptorstoeffectivelyretrieveeventsfromvisuallifelogs AT conairec combiningimagedescriptorstoeffectivelyretrieveeventsfromvisuallifelogs AT blighem combiningimagedescriptorstoeffectivelyretrieveeventsfromvisuallifelogs AT smeatona combiningimagedescriptorstoeffectivelyretrieveeventsfromvisuallifelogs AT oconnorn combiningimagedescriptorstoeffectivelyretrieveeventsfromvisuallifelogs

Combining image descriptors to effectively retrieve events from visual lifelogs

Similar Items