DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

We introduce a Deep Stochastic IOC1 RNN Encoderdecoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future predicti...

Full description

Bibliographic Details
Main Authors:	Lee, N, Choi, W, Vernaza, P, Choy, C, Torr, P, Chandraker, M
Format:	Conference item
Published:	CVPR 2017

_version_	1826278661231214592
author	Lee, N Choi, W Vernaza, P Choy, C Torr, P Chandraker, M
author_facet	Lee, N Choi, W Vernaza, P Choy, C Torr, P Chandraker, M
author_sort	Lee, N
collection	OXFORD
description	We introduce a Deep Stochastic IOC1 RNN Encoderdecoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i.e., given the same context, future may vary), 2) foreseeing the potential future outcomes and make a strategic prediction based on that, and 3) reasoning not only from the past motion history, but also from the scene context as well as the interactions among the agents. DESIRE achieves these in a single end-to-end trainable neural network model, while being computationally efficient. The model first obtains a diverse set of hypothetical future prediction samples employing a conditional variational autoencoder, which are ranked and refined by the following RNN scoring-regression module. Samples are scored by accounting for accumulated future rewards, which enables better long-term strategic decisions similar to IOC frameworks. An RNN scene context fusion module jointly captures past motion histories, the semantic scene context and interactions among multiple agents. A feedback mechanism iterates over the ranking and refinement to further boost the prediction accuracy. We evaluate our model on two publicly available datasets: KITTI and Stanford Drone Dataset. Our experiments show that the proposed model significantly improves the prediction accuracy compared to other baseline methods
first_indexed	2024-03-06T23:47:16Z
format	Conference item
id	oxford-uuid:715fc1b7-e023-4eb8-8f63-aa869e2c35c7
institution	University of Oxford
last_indexed	2024-03-06T23:47:16Z
publishDate	2017
publisher	CVPR
record_format	dspace
spelling	oxford-uuid:715fc1b7-e023-4eb8-8f63-aa869e2c35c72022-03-26T19:43:06ZDESIRE: Distant Future Prediction in Dynamic Scenes with Interacting AgentsConference itemhttp://purl.org/coar/resource_type/c_5794uuid:715fc1b7-e023-4eb8-8f63-aa869e2c35c7Symplectic Elements at OxfordCVPR2017Lee, NChoi, WVernaza, PChoy, CTorr, PChandraker, MWe introduce a Deep Stochastic IOC1 RNN Encoderdecoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i.e., given the same context, future may vary), 2) foreseeing the potential future outcomes and make a strategic prediction based on that, and 3) reasoning not only from the past motion history, but also from the scene context as well as the interactions among the agents. DESIRE achieves these in a single end-to-end trainable neural network model, while being computationally efficient. The model first obtains a diverse set of hypothetical future prediction samples employing a conditional variational autoencoder, which are ranked and refined by the following RNN scoring-regression module. Samples are scored by accounting for accumulated future rewards, which enables better long-term strategic decisions similar to IOC frameworks. An RNN scene context fusion module jointly captures past motion histories, the semantic scene context and interactions among multiple agents. A feedback mechanism iterates over the ranking and refinement to further boost the prediction accuracy. We evaluate our model on two publicly available datasets: KITTI and Stanford Drone Dataset. Our experiments show that the proposed model significantly improves the prediction accuracy compared to other baseline methods
spellingShingle	Lee, N Choi, W Vernaza, P Choy, C Torr, P Chandraker, M DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
title	DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
title_full	DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
title_fullStr	DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
title_full_unstemmed	DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
title_short	DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
title_sort	desire distant future prediction in dynamic scenes with interacting agents
work_keys_str_mv	AT leen desiredistantfuturepredictionindynamicsceneswithinteractingagents AT choiw desiredistantfuturepredictionindynamicsceneswithinteractingagents AT vernazap desiredistantfuturepredictionindynamicsceneswithinteractingagents AT choyc desiredistantfuturepredictionindynamicsceneswithinteractingagents AT torrp desiredistantfuturepredictionindynamicsceneswithinteractingagents AT chandrakerm desiredistantfuturepredictionindynamicsceneswithinteractingagents

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Similar Items