Learning to rank for synthesizing planning heuristics

We investigate learning heuristics for domainspecific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner’s performance than mean squared...

Full description

Bibliographic Details
Main Authors:	Garrett, Caelan Reed, Kaelbling, Leslie P, Lozano-Perez, Tomas
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	en_US
Published:	AAAI Press 2018
Online Access:	http://hdl.handle.net/1721.1/115313 https://orcid.org/0000-0002-6474-1276 https://orcid.org/0000-0001-6054-7145 https://orcid.org/0000-0002-8657-2450

_version_	1826205052554969088
author	Garrett, Caelan Reed Kaelbling, Leslie P Lozano-Perez, Tomas
author2	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Garrett, Caelan Reed Kaelbling, Leslie P Lozano-Perez, Tomas
author_sort	Garrett, Caelan Reed
collection	MIT
description	We investigate learning heuristics for domainspecific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner’s performance than mean squared error. Thus, we instead frame learning a heuristic as a learning to rank problem which we solve using a RankSVM formulation. Additionally, we introduce new methods for computing features that capture temporal interactions in an approximate plan. Our experiments on recent International Planning Competition problems show that the RankSVM learned heuristics outperform both the original heuristics and heuristics learned through ordinary regression.
first_indexed	2024-09-23T13:06:00Z
format	Article
id	mit-1721.1/115313
institution	Massachusetts Institute of Technology
language	en_US
last_indexed	2024-09-23T13:06:00Z
publishDate	2018
publisher	AAAI Press
record_format	dspace
spelling	mit-1721.1/1153132022-10-01T13:03:24Z Learning to rank for synthesizing planning heuristics Garrett, Caelan Reed Kaelbling, Leslie P Lozano-Perez, Tomas Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Garrett, Caelan Reed Kaelbling, Leslie P Lozano-Perez, Tomas We investigate learning heuristics for domainspecific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner’s performance than mean squared error. Thus, we instead frame learning a heuristic as a learning to rank problem which we solve using a RankSVM formulation. Additionally, we introduce new methods for computing features that capture temporal interactions in an approximate plan. Our experiments on recent International Planning Competition problems show that the RankSVM learned heuristics outperform both the original heuristics and heuristics learned through ordinary regression. National Science Foundation (U.S.) (Grant 1420927) National Science Foundation (U.S.) (Grant 1420927) National Science Foundation (U.S.) (Grant 1523767) United States. Office of Naval Research (Grant N00014-14-1-0486) United States. Army Research Office (Grant W911NF1410433) 2018-05-11T14:19:54Z 2018-05-11T14:19:54Z 2016-07 Article http://purl.org/eprint/type/ConferencePaper http://hdl.handle.net/1721.1/115313 Garrett, Caelan Reed, Leslie Pack Kaelbling, Tomás Lozano-Pérez. "Learning to Rank for Synthesizing Planning Heuristics." Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16), 9-15 July, 2016, New York, New York, AAAI Press. https://orcid.org/0000-0002-6474-1276 https://orcid.org/0000-0001-6054-7145 https://orcid.org/0000-0002-8657-2450 en_US https://www.ijcai.org/proceedings/2016 Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16) Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by/4.0/ application/pdf AAAI Press MIT Web Domain
spellingShingle	Garrett, Caelan Reed Kaelbling, Leslie P Lozano-Perez, Tomas Learning to rank for synthesizing planning heuristics
title	Learning to rank for synthesizing planning heuristics
title_full	Learning to rank for synthesizing planning heuristics
title_fullStr	Learning to rank for synthesizing planning heuristics
title_full_unstemmed	Learning to rank for synthesizing planning heuristics
title_short	Learning to rank for synthesizing planning heuristics
title_sort	learning to rank for synthesizing planning heuristics
url	http://hdl.handle.net/1721.1/115313 https://orcid.org/0000-0002-6474-1276 https://orcid.org/0000-0001-6054-7145 https://orcid.org/0000-0002-8657-2450
work_keys_str_mv	AT garrettcaelanreed learningtorankforsynthesizingplanningheuristics AT kaelblinglesliep learningtorankforsynthesizingplanningheuristics AT lozanopereztomas learningtorankforsynthesizingplanningheuristics

Learning to rank for synthesizing planning heuristics

Similar Items