The computational complexity of finding stationary points in non-convex optimization

<p>Finding approximate stationary points, i.e., points where the gradient is approximately zero, of non-convex but smooth objective functions f over unrestricted d-dimensional domains is one of the most fundamental problems in classical non-convex optimization. Nevertheless, the computational...

Full description

Bibliographic Details
Main Authors: Hollender, A, Zampetakis, M
Format: Journal article
Language:English
Published: Springer 2024
_version_ 1811141258714808320
author Hollender, A
Zampetakis, M
author_facet Hollender, A
Zampetakis, M
author_sort Hollender, A
collection OXFORD
description <p>Finding approximate stationary points, i.e., points where the gradient is approximately zero, of non-convex but smooth objective functions f over unrestricted d-dimensional domains is one of the most fundamental problems in classical non-convex optimization. Nevertheless, the computational and query complexity of this problem are still not well understood when the dimension d of the problem is independent of the approximation error. In this paper, we show the following computational and query complexity results:</p> <ol> <li>The problem of finding approximate stationary points over unrestricted domains is PLScomplete.</li> <li>For d = 2, we provide a zero-order algorithm for finding &epsilon;-approximate stationary points that requires at most O(1/&epsilon;) value queries to the objective function.</li> <li>We show that any algorithm needs at least Ω(1/&epsilon;) queries to the objective function and/or its gradient to find &epsilon;-approximate stationary points when d = 2. Combined with the above, this characterizes the query complexity of this problem to be &Theta;(1/&epsilon;).</li> <li>For d = 2, we provide a zero-order algorithm for finding &epsilon;-KKT points in constrained optimization problems that requires at most O(1/ &radic; &epsilon;) value queries to the objective function. This closes the gap between the works of Bubeck and Mikulincer [2020] and Vavasis [1993] and characterizes the query complexity of this problem to be &Theta;(1/ &radic; &epsilon;).</li> <li>Combining our results with the recent result of Fearnley et al. [2022], we show that finding approximate KKT points in constrained optimization is reducible to finding approximate stationary points in unconstrained optimization but the converse is impossible.</li> </ol>
first_indexed 2024-09-25T04:35:01Z
format Journal article
id oxford-uuid:6b2097c3-63c4-436d-a027-8878c31566b1
institution University of Oxford
language English
last_indexed 2024-09-25T04:35:01Z
publishDate 2024
publisher Springer
record_format dspace
spelling oxford-uuid:6b2097c3-63c4-436d-a027-8878c31566b12024-09-13T12:04:02ZThe computational complexity of finding stationary points in non-convex optimizationJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:6b2097c3-63c4-436d-a027-8878c31566b1EnglishSymplectic ElementsSpringer2024Hollender, AZampetakis, M<p>Finding approximate stationary points, i.e., points where the gradient is approximately zero, of non-convex but smooth objective functions f over unrestricted d-dimensional domains is one of the most fundamental problems in classical non-convex optimization. Nevertheless, the computational and query complexity of this problem are still not well understood when the dimension d of the problem is independent of the approximation error. In this paper, we show the following computational and query complexity results:</p> <ol> <li>The problem of finding approximate stationary points over unrestricted domains is PLScomplete.</li> <li>For d = 2, we provide a zero-order algorithm for finding &epsilon;-approximate stationary points that requires at most O(1/&epsilon;) value queries to the objective function.</li> <li>We show that any algorithm needs at least Ω(1/&epsilon;) queries to the objective function and/or its gradient to find &epsilon;-approximate stationary points when d = 2. Combined with the above, this characterizes the query complexity of this problem to be &Theta;(1/&epsilon;).</li> <li>For d = 2, we provide a zero-order algorithm for finding &epsilon;-KKT points in constrained optimization problems that requires at most O(1/ &radic; &epsilon;) value queries to the objective function. This closes the gap between the works of Bubeck and Mikulincer [2020] and Vavasis [1993] and characterizes the query complexity of this problem to be &Theta;(1/ &radic; &epsilon;).</li> <li>Combining our results with the recent result of Fearnley et al. [2022], we show that finding approximate KKT points in constrained optimization is reducible to finding approximate stationary points in unconstrained optimization but the converse is impossible.</li> </ol>
spellingShingle Hollender, A
Zampetakis, M
The computational complexity of finding stationary points in non-convex optimization
title The computational complexity of finding stationary points in non-convex optimization
title_full The computational complexity of finding stationary points in non-convex optimization
title_fullStr The computational complexity of finding stationary points in non-convex optimization
title_full_unstemmed The computational complexity of finding stationary points in non-convex optimization
title_short The computational complexity of finding stationary points in non-convex optimization
title_sort computational complexity of finding stationary points in non convex optimization
work_keys_str_mv AT hollendera thecomputationalcomplexityoffindingstationarypointsinnonconvexoptimization
AT zampetakism thecomputationalcomplexityoffindingstationarypointsinnonconvexoptimization
AT hollendera computationalcomplexityoffindingstationarypointsinnonconvexoptimization
AT zampetakism computationalcomplexityoffindingstationarypointsinnonconvexoptimization