Learning to rank for synthesizing planning heuristics

We investigate learning heuristics for domainspecific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner’s performance than mean squared...

Full description

Bibliographic Details
Main Authors: Garrett, Caelan Reed, Kaelbling, Leslie P, Lozano-Perez, Tomas
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:en_US
Published: AAAI Press 2018
Online Access:http://hdl.handle.net/1721.1/115313
https://orcid.org/0000-0002-6474-1276
https://orcid.org/0000-0001-6054-7145
https://orcid.org/0000-0002-8657-2450
_version_ 1826205052554969088
author Garrett, Caelan Reed
Kaelbling, Leslie P
Lozano-Perez, Tomas
author2 Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Garrett, Caelan Reed
Kaelbling, Leslie P
Lozano-Perez, Tomas
author_sort Garrett, Caelan Reed
collection MIT
description We investigate learning heuristics for domainspecific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner’s performance than mean squared error. Thus, we instead frame learning a heuristic as a learning to rank problem which we solve using a RankSVM formulation. Additionally, we introduce new methods for computing features that capture temporal interactions in an approximate plan. Our experiments on recent International Planning Competition problems show that the RankSVM learned heuristics outperform both the original heuristics and heuristics learned through ordinary regression.
first_indexed 2024-09-23T13:06:00Z
format Article
id mit-1721.1/115313
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T13:06:00Z
publishDate 2018
publisher AAAI Press
record_format dspace
spelling mit-1721.1/1153132022-10-01T13:03:24Z Learning to rank for synthesizing planning heuristics Garrett, Caelan Reed Kaelbling, Leslie P Lozano-Perez, Tomas Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Garrett, Caelan Reed Kaelbling, Leslie P Lozano-Perez, Tomas We investigate learning heuristics for domainspecific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner’s performance than mean squared error. Thus, we instead frame learning a heuristic as a learning to rank problem which we solve using a RankSVM formulation. Additionally, we introduce new methods for computing features that capture temporal interactions in an approximate plan. Our experiments on recent International Planning Competition problems show that the RankSVM learned heuristics outperform both the original heuristics and heuristics learned through ordinary regression. National Science Foundation (U.S.) (Grant 1420927) National Science Foundation (U.S.) (Grant 1420927) National Science Foundation (U.S.) (Grant 1523767) United States. Office of Naval Research (Grant N00014-14-1-0486) United States. Army Research Office (Grant W911NF1410433) 2018-05-11T14:19:54Z 2018-05-11T14:19:54Z 2016-07 Article http://purl.org/eprint/type/ConferencePaper http://hdl.handle.net/1721.1/115313 Garrett, Caelan Reed, Leslie Pack Kaelbling, Tomás Lozano-Pérez. "Learning to Rank for Synthesizing Planning Heuristics." Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16), 9-15 July, 2016, New York, New York, AAAI Press. https://orcid.org/0000-0002-6474-1276 https://orcid.org/0000-0001-6054-7145 https://orcid.org/0000-0002-8657-2450 en_US https://www.ijcai.org/proceedings/2016 Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16) Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by/4.0/ application/pdf AAAI Press MIT Web Domain
spellingShingle Garrett, Caelan Reed
Kaelbling, Leslie P
Lozano-Perez, Tomas
Learning to rank for synthesizing planning heuristics
title Learning to rank for synthesizing planning heuristics
title_full Learning to rank for synthesizing planning heuristics
title_fullStr Learning to rank for synthesizing planning heuristics
title_full_unstemmed Learning to rank for synthesizing planning heuristics
title_short Learning to rank for synthesizing planning heuristics
title_sort learning to rank for synthesizing planning heuristics
url http://hdl.handle.net/1721.1/115313
https://orcid.org/0000-0002-6474-1276
https://orcid.org/0000-0001-6054-7145
https://orcid.org/0000-0002-8657-2450
work_keys_str_mv AT garrettcaelanreed learningtorankforsynthesizingplanningheuristics
AT kaelblinglesliep learningtorankforsynthesizingplanningheuristics
AT lozanopereztomas learningtorankforsynthesizingplanningheuristics