Computation of the maximum likelihood estimator in low-rank factor analysis

Abstract Factor analysis is a classical multivariate dimensionality reduction technique popularly used in statistics, econometrics and data science. Estimation for factor analysis is often carried out via the maximum likelihood principle, which seeks to maximize the Gaussian likelihoo...

Full description

Bibliographic Details
Main Authors: Khamaru, Koulik, Mazumder, Rahul
Other Authors: Sloan School of Management
Format: Article
Language:English
Published: Springer Berlin Heidelberg 2021
Online Access:https://hdl.handle.net/1721.1/131369
_version_ 1811078057852665856
author Khamaru, Koulik
Mazumder, Rahul
author2 Sloan School of Management
author_facet Sloan School of Management
Khamaru, Koulik
Mazumder, Rahul
author_sort Khamaru, Koulik
collection MIT
description Abstract Factor analysis is a classical multivariate dimensionality reduction technique popularly used in statistics, econometrics and data science. Estimation for factor analysis is often carried out via the maximum likelihood principle, which seeks to maximize the Gaussian likelihood under the assumption that the positive definite covariance matrix can be decomposed as the sum of a low-rank positive semidefinite matrix and a diagonal matrix with nonnegative entries. This leads to a challenging rank constrained nonconvex optimization problem, for which very few reliable computational algorithms are available. We reformulate the low-rank maximum likelihood factor analysis task as a nonlinear nonsmooth semidefinite optimization problem, study various structural properties of this reformulation; and propose fast and scalable algorithms based on difference of convex optimization. Our approach has computational guarantees, gracefully scales to large problems, is applicable to situations where the sample covariance matrix is rank deficient and adapts to variants of the maximum likelihood problem with additional constraints on the model parameters. Our numerical experiments validate the usefulness of our approach over existing state-of-the-art approaches for maximum likelihood factor analysis.
first_indexed 2024-09-23T10:52:42Z
format Article
id mit-1721.1/131369
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T10:52:42Z
publishDate 2021
publisher Springer Berlin Heidelberg
record_format dspace
spelling mit-1721.1/1313692023-09-27T18:00:17Z Computation of the maximum likelihood estimator in low-rank factor analysis Khamaru, Koulik Mazumder, Rahul Sloan School of Management Massachusetts Institute of Technology. Operations Research Center Abstract Factor analysis is a classical multivariate dimensionality reduction technique popularly used in statistics, econometrics and data science. Estimation for factor analysis is often carried out via the maximum likelihood principle, which seeks to maximize the Gaussian likelihood under the assumption that the positive definite covariance matrix can be decomposed as the sum of a low-rank positive semidefinite matrix and a diagonal matrix with nonnegative entries. This leads to a challenging rank constrained nonconvex optimization problem, for which very few reliable computational algorithms are available. We reformulate the low-rank maximum likelihood factor analysis task as a nonlinear nonsmooth semidefinite optimization problem, study various structural properties of this reformulation; and propose fast and scalable algorithms based on difference of convex optimization. Our approach has computational guarantees, gracefully scales to large problems, is applicable to situations where the sample covariance matrix is rank deficient and adapts to variants of the maximum likelihood problem with additional constraints on the model parameters. Our numerical experiments validate the usefulness of our approach over existing state-of-the-art approaches for maximum likelihood factor analysis. 2021-09-20T17:16:45Z 2021-09-20T17:16:45Z 2019-03-02 2020-09-24T21:02:06Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/131369 en https://doi.org/10.1007/s10107-019-01370-7 Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ Springer-Verlag GmbH Germany, part of Springer Nature and Mathematical Optimization Society application/pdf Springer Berlin Heidelberg Springer Berlin Heidelberg
spellingShingle Khamaru, Koulik
Mazumder, Rahul
Computation of the maximum likelihood estimator in low-rank factor analysis
title Computation of the maximum likelihood estimator in low-rank factor analysis
title_full Computation of the maximum likelihood estimator in low-rank factor analysis
title_fullStr Computation of the maximum likelihood estimator in low-rank factor analysis
title_full_unstemmed Computation of the maximum likelihood estimator in low-rank factor analysis
title_short Computation of the maximum likelihood estimator in low-rank factor analysis
title_sort computation of the maximum likelihood estimator in low rank factor analysis
url https://hdl.handle.net/1721.1/131369
work_keys_str_mv AT khamarukoulik computationofthemaximumlikelihoodestimatorinlowrankfactoranalysis
AT mazumderrahul computationofthemaximumlikelihoodestimatorinlowrankfactoranalysis