What am I searching for?

Can we infer intentions and goals from a person's actions? As an example of this family of problems, we consider here whether it is possible to decipher what a person is searching for by decoding their eye movement behavior. We conducted two human psychophysics experiments on object arrays and...

Full description

Bibliographic Details
Main Authors:	Zhang, Mengmi, Feng, Jiashi, Lim, Joo Hwee, Zhao, Qi, Kreiman, Gabriel
Format:	Technical Report
Language:	en_US
Published:	Center for Brains, Minds and Machines (CBMM), arXiv.org 2018
Online Access:	http://hdl.handle.net/1721.1/119576

_version_	1811080062186815488
author	Zhang, Mengmi Feng, Jiashi Lim, Joo Hwee Zhao, Qi Kreiman, Gabriel
author_facet	Zhang, Mengmi Feng, Jiashi Lim, Joo Hwee Zhao, Qi Kreiman, Gabriel
author_sort	Zhang, Mengmi
collection	MIT
description	Can we infer intentions and goals from a person's actions? As an example of this family of problems, we consider here whether it is possible to decipher what a person is searching for by decoding their eye movement behavior. We conducted two human psychophysics experiments on object arrays and natural images where we monitored subjects' eye movements while they were looking for a target object. Using as input the pattern of "error" fixations on non-target objects before the target was found, we developed a model (InferNet) whose goal was to infer what the target was. "Error" fixations share similar features with the sought target. The Infernet model uses a pre-trained 2D convolutional architecture to extract features from the error fixations and computes a 2D similarity map between the error fixation and all locations across the search image by modulating the search image via convolution across layers. InferNet consolidates the modulated response maps across layers via max pooling to keep track of the sub-patterns highly similar to features at error fixations and integrates these maps across all error fixations. InferNet successfully identifies the subject's goal and outperforms all the competitive null models, even without any object-specific training on the inference task.
first_indexed	2024-09-23T11:25:09Z
format	Technical Report
id	mit-1721.1/119576
institution	Massachusetts Institute of Technology
language	en_US
last_indexed	2024-09-23T11:25:09Z
publishDate	2018
publisher	Center for Brains, Minds and Machines (CBMM), arXiv.org
record_format	dspace
spelling	mit-1721.1/1195762019-04-11T02:26:01Z What am I searching for? Zhang, Mengmi Feng, Jiashi Lim, Joo Hwee Zhao, Qi Kreiman, Gabriel Can we infer intentions and goals from a person's actions? As an example of this family of problems, we consider here whether it is possible to decipher what a person is searching for by decoding their eye movement behavior. We conducted two human psychophysics experiments on object arrays and natural images where we monitored subjects' eye movements while they were looking for a target object. Using as input the pattern of "error" fixations on non-target objects before the target was found, we developed a model (InferNet) whose goal was to infer what the target was. "Error" fixations share similar features with the sought target. The Infernet model uses a pre-trained 2D convolutional architecture to extract features from the error fixations and computes a 2D similarity map between the error fixation and all locations across the search image by modulating the search image via convolution across layers. InferNet consolidates the modulated response maps across layers via max pooling to keep track of the sub-patterns highly similar to features at error fixations and integrates these maps across all error fixations. InferNet successfully identifies the subject's goal and outperforms all the competitive null models, even without any object-specific training on the inference task. This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF – 1231216. 2018-12-11T20:40:45Z 2018-12-11T20:40:45Z 2018-07-31 Technical Report Working Paper Other http://hdl.handle.net/1721.1/119576 en_US CBMM Memo Series;096 application/pdf Center for Brains, Minds and Machines (CBMM), arXiv.org
spellingShingle	Zhang, Mengmi Feng, Jiashi Lim, Joo Hwee Zhao, Qi Kreiman, Gabriel What am I searching for?
title	What am I searching for?
title_full	What am I searching for?
title_fullStr	What am I searching for?
title_full_unstemmed	What am I searching for?
title_short	What am I searching for?
title_sort	what am i searching for
url	http://hdl.handle.net/1721.1/119576
work_keys_str_mv	AT zhangmengmi whatamisearchingfor AT fengjiashi whatamisearchingfor AT limjoohwee whatamisearchingfor AT zhaoqi whatamisearchingfor AT kreimangabriel whatamisearchingfor

What am I searching for?

Similar Items