The computational complexity of finding stationary points in non-convex optimization

<p>Finding approximate stationary points, i.e., points where the gradient is approximately zero, of non-convex but smooth objective functions f over unrestricted d-dimensional domains is one of the most fundamental problems in classical non-convex optimization. Nevertheless, the computational...

Full description

Bibliographic Details
Main Authors:	Hollender, A, Zampetakis, M
Format:	Journal article
Language:	English
Published:	Springer 2024

_version_	1811141258714808320
author	Hollender, A Zampetakis, M
author_facet	Hollender, A Zampetakis, M
author_sort	Hollender, A
collection	OXFORD
description	<p>Finding approximate stationary points, i.e., points where the gradient is approximately zero, of non-convex but smooth objective functions f over unrestricted d-dimensional domains is one of the most fundamental problems in classical non-convex optimization. Nevertheless, the computational and query complexity of this problem are still not well understood when the dimension d of the problem is independent of the approximation error. In this paper, we show the following computational and query complexity results:</p> <ol> <li>The problem of finding approximate stationary points over unrestricted domains is PLScomplete.</li> <li>For d = 2, we provide a zero-order algorithm for finding ε-approximate stationary points that requires at most O(1/ε) value queries to the objective function.</li> <li>We show that any algorithm needs at least Ω(1/ε) queries to the objective function and/or its gradient to find ε-approximate stationary points when d = 2. Combined with the above, this characterizes the query complexity of this problem to be Θ(1/ε).</li> <li>For d = 2, we provide a zero-order algorithm for finding ε-KKT points in constrained optimization problems that requires at most O(1/ √ ε) value queries to the objective function. This closes the gap between the works of Bubeck and Mikulincer [2020] and Vavasis [1993] and characterizes the query complexity of this problem to be Θ(1/ √ ε).</li> <li>Combining our results with the recent result of Fearnley et al. [2022], we show that finding approximate KKT points in constrained optimization is reducible to finding approximate stationary points in unconstrained optimization but the converse is impossible.</li> </ol>
first_indexed	2024-09-25T04:35:01Z
format	Journal article
id	oxford-uuid:6b2097c3-63c4-436d-a027-8878c31566b1
institution	University of Oxford
language	English
last_indexed	2024-09-25T04:35:01Z
publishDate	2024
publisher	Springer
record_format	dspace
spelling	oxford-uuid:6b2097c3-63c4-436d-a027-8878c31566b12024-09-13T12:04:02ZThe computational complexity of finding stationary points in non-convex optimizationJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:6b2097c3-63c4-436d-a027-8878c31566b1EnglishSymplectic ElementsSpringer2024Hollender, AZampetakis, M<p>Finding approximate stationary points, i.e., points where the gradient is approximately zero, of non-convex but smooth objective functions f over unrestricted d-dimensional domains is one of the most fundamental problems in classical non-convex optimization. Nevertheless, the computational and query complexity of this problem are still not well understood when the dimension d of the problem is independent of the approximation error. In this paper, we show the following computational and query complexity results:</p> <ol> <li>The problem of finding approximate stationary points over unrestricted domains is PLScomplete.</li> <li>For d = 2, we provide a zero-order algorithm for finding ε-approximate stationary points that requires at most O(1/ε) value queries to the objective function.</li> <li>We show that any algorithm needs at least Ω(1/ε) queries to the objective function and/or its gradient to find ε-approximate stationary points when d = 2. Combined with the above, this characterizes the query complexity of this problem to be Θ(1/ε).</li> <li>For d = 2, we provide a zero-order algorithm for finding ε-KKT points in constrained optimization problems that requires at most O(1/ √ ε) value queries to the objective function. This closes the gap between the works of Bubeck and Mikulincer [2020] and Vavasis [1993] and characterizes the query complexity of this problem to be Θ(1/ √ ε).</li> <li>Combining our results with the recent result of Fearnley et al. [2022], we show that finding approximate KKT points in constrained optimization is reducible to finding approximate stationary points in unconstrained optimization but the converse is impossible.</li> </ol>
spellingShingle	Hollender, A Zampetakis, M The computational complexity of finding stationary points in non-convex optimization
title	The computational complexity of finding stationary points in non-convex optimization
title_full	The computational complexity of finding stationary points in non-convex optimization
title_fullStr	The computational complexity of finding stationary points in non-convex optimization
title_full_unstemmed	The computational complexity of finding stationary points in non-convex optimization
title_short	The computational complexity of finding stationary points in non-convex optimization
title_sort	computational complexity of finding stationary points in non convex optimization
work_keys_str_mv	AT hollendera thecomputationalcomplexityoffindingstationarypointsinnonconvexoptimization AT zampetakism thecomputationalcomplexityoffindingstationarypointsinnonconvexoptimization AT hollendera computationalcomplexityoffindingstationarypointsinnonconvexoptimization AT zampetakism computationalcomplexityoffindingstationarypointsinnonconvexoptimization

The computational complexity of finding stationary points in non-convex optimization

Similar Items