An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems

We present the first systematic analysis of the characteristics of patch search spaces for automatic patch generation systems. We analyze the search spaces of two current state-of- the-art systems, SPR and Prophet, with 16 different search space configurations. Our results are derived from an analys...

Full description

Bibliographic Details
Main Authors:	Long, Fan, Rinard, Martin
Other Authors:	Martin Rinard
Published:	2016
Subjects:	Program repair
Online Access:	http://hdl.handle.net/1721.1/101211

_version_	1826214359024533504
author	Long, Fan Rinard, Martin
author2	Martin Rinard
author_facet	Martin Rinard Long, Fan Rinard, Martin
author_sort	Long, Fan
collection	MIT
description	We present the first systematic analysis of the characteristics of patch search spaces for automatic patch generation systems. We analyze the search spaces of two current state-of- the-art systems, SPR and Prophet, with 16 different search space configurations. Our results are derived from an analysis of 1104 different search spaces and 768 patch generation executions. Together these experiments consumed over 9000 hours of CPU time on Amazon EC2.The analysis shows that 1) correct patches are sparse in the search spaces (typically at most one correct patch per search space per defect), 2) incorrect patches that nevertheless pass all of the test cases in the validation test suite are typically orders of magnitude more abundant, and 3) leveraging information other than the test suite is therefore critical for enabling the system to successfully isolate correct patches.We also characterize a key tradeoff in the structure of the search spaces. Larger and richer search spaces that contain correct patches for more defects can actually cause systems to find fewer, not more, correct patches. We identify two reasons for this phenomenon: 1) increased validation times because of the presence of more candidate patches and 2) more incorrect patches that pass the test suite and block the discovery of correct patches. These fundamental properties, which are all characterized for the first time in this paper, help explain why past systems often fail to generate correct patches and help identify challenges, opportunities, and productive future directions for the field.
first_indexed	2024-09-23T16:04:08Z
id	mit-1721.1/101211
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T16:04:08Z
publishDate	2016
record_format	dspace
spelling	mit-1721.1/1012112019-04-12T15:56:50Z An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems Long, Fan Rinard, Martin Martin Rinard Program Analysis and Compilation Program repair We present the first systematic analysis of the characteristics of patch search spaces for automatic patch generation systems. We analyze the search spaces of two current state-of- the-art systems, SPR and Prophet, with 16 different search space configurations. Our results are derived from an analysis of 1104 different search spaces and 768 patch generation executions. Together these experiments consumed over 9000 hours of CPU time on Amazon EC2.The analysis shows that 1) correct patches are sparse in the search spaces (typically at most one correct patch per search space per defect), 2) incorrect patches that nevertheless pass all of the test cases in the validation test suite are typically orders of magnitude more abundant, and 3) leveraging information other than the test suite is therefore critical for enabling the system to successfully isolate correct patches.We also characterize a key tradeoff in the structure of the search spaces. Larger and richer search spaces that contain correct patches for more defects can actually cause systems to find fewer, not more, correct patches. We identify two reasons for this phenomenon: 1) increased validation times because of the presence of more candidate patches and 2) more incorrect patches that pass the test suite and block the discovery of correct patches. These fundamental properties, which are all characterized for the first time in this paper, help explain why past systems often fail to generate correct patches and help identify challenges, opportunities, and productive future directions for the field. 2016-02-18T20:45:03Z 2016-02-18T20:45:03Z 2016-02-18 2016-02-18T20:45:03Z http://hdl.handle.net/1721.1/101211 MIT-CSAIL-TR-2016-003 45 p. application/pdf
spellingShingle	Program repair Long, Fan Rinard, Martin An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems
title	An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems
title_full	An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems
title_fullStr	An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems
title_full_unstemmed	An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems
title_short	An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems
title_sort	analysis of the search spaces for generate and validate patch generation systems
topic	Program repair
url	http://hdl.handle.net/1721.1/101211
work_keys_str_mv	AT longfan ananalysisofthesearchspacesforgenerateandvalidatepatchgenerationsystems AT rinardmartin ananalysisofthesearchspacesforgenerateandvalidatepatchgenerationsystems AT longfan analysisofthesearchspacesforgenerateandvalidatepatchgenerationsystems AT rinardmartin analysisofthesearchspacesforgenerateandvalidatepatchgenerationsystems

An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems

Similar Items