Sparsity in Machine Learning: Theory and Applications

Sparsity plays a key role in machine learning for several reasons including interpretability. Interpretability is sought either by practitioners or by scientists. Indeed, on one hand interpretability can be key in a practice such as in healthcare, in which black box models cannot be used for the pre...

Full description

Bibliographic Details
Main Author:	Lahlou Kitane, Driss
Other Authors:	Bertsimas, Dimitris
Format:	Thesis
Published:	Massachusetts Institute of Technology 2022
Online Access:	https://hdl.handle.net/1721.1/143157

_version_	1826205096898199552
author	Lahlou Kitane, Driss
author2	Bertsimas, Dimitris
author_facet	Bertsimas, Dimitris Lahlou Kitane, Driss
author_sort	Lahlou Kitane, Driss
collection	MIT
description	Sparsity plays a key role in machine learning for several reasons including interpretability. Interpretability is sought either by practitioners or by scientists. Indeed, on one hand interpretability can be key in a practice such as in healthcare, in which black box models cannot be used for the prescription of a treatment for a patient. On the other hand, interpretability is essential in understanding of phenomena that are modelled using machine learning such as plasma electromagnetic emissions. Besides interpretability, sparsity has several other important applications such as improvement of the predictive power of models and reduction of operational and investment costs. Integer optimization is a highly effective tool in the conception of methods to tackle sparsity. It offers a rigorous framework to build sparse models and has proved to provide more accurate and sparse models than other approaches including the ones using sparsity-inducing regularization norms. This thesis focuses on the application of integer optimization to address sparsity problems. We provide two applications of sparse modeling. The first one is related to the application of Mixed Integer Optimization (MIO) sparse regression to Laser Induced Breakdown Spectroscopy (LIBS), a modern and important chemical analysis technique. We build a methodology for sparse and robust models in chemometrics and test it on various types of mineral ore. The MIO approach beats experts’ predictions while offering remarkably sparser models compared to 𝐿𝐴𝑆𝑆𝑂. As the 𝑅2 achieved is higher than .99 in some cases, this application is, to the best of our knowledge, the first application that brings empirical proof that a true support exists in nature as the optimization community has been questioning the existence of such a concept in real life applications. The second application is related to COVID testing and sparse classification. We propose a fast and simple method for the detection of SARS-CoV-2 based on spectroscopy. This novel method builds on machine learning capabilities to deliver diagnosis in under a minute, without the use of any reagent, achieving a precision close to that of PCR. Sparse methods enable the detection of specific characteristics in the 3D structure of SARS-CoV-2 RNA and proteins. Given the importance PCA plays in our research and in machine learning in general, we also provide a new approach to tackle the sparse PCA problem. This approach is the first to generate several sparse principal components in one step, while existing techniques rely instead on deflation to iteratively generate principal components. The method proposed (GeoSPCA) generates high quality solutions that improves the variance explained by deflation techniques by more than an order of magnitude.
first_indexed	2024-09-23T13:06:42Z
format	Thesis
id	mit-1721.1/143157
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T13:06:42Z
publishDate	2022
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1431572022-06-16T03:27:19Z Sparsity in Machine Learning: Theory and Applications Lahlou Kitane, Driss Bertsimas, Dimitris Massachusetts Institute of Technology. Operations Research Center Sparsity plays a key role in machine learning for several reasons including interpretability. Interpretability is sought either by practitioners or by scientists. Indeed, on one hand interpretability can be key in a practice such as in healthcare, in which black box models cannot be used for the prescription of a treatment for a patient. On the other hand, interpretability is essential in understanding of phenomena that are modelled using machine learning such as plasma electromagnetic emissions. Besides interpretability, sparsity has several other important applications such as improvement of the predictive power of models and reduction of operational and investment costs. Integer optimization is a highly effective tool in the conception of methods to tackle sparsity. It offers a rigorous framework to build sparse models and has proved to provide more accurate and sparse models than other approaches including the ones using sparsity-inducing regularization norms. This thesis focuses on the application of integer optimization to address sparsity problems. We provide two applications of sparse modeling. The first one is related to the application of Mixed Integer Optimization (MIO) sparse regression to Laser Induced Breakdown Spectroscopy (LIBS), a modern and important chemical analysis technique. We build a methodology for sparse and robust models in chemometrics and test it on various types of mineral ore. The MIO approach beats experts’ predictions while offering remarkably sparser models compared to 𝐿𝐴𝑆𝑆𝑂. As the 𝑅2 achieved is higher than .99 in some cases, this application is, to the best of our knowledge, the first application that brings empirical proof that a true support exists in nature as the optimization community has been questioning the existence of such a concept in real life applications. The second application is related to COVID testing and sparse classification. We propose a fast and simple method for the detection of SARS-CoV-2 based on spectroscopy. This novel method builds on machine learning capabilities to deliver diagnosis in under a minute, without the use of any reagent, achieving a precision close to that of PCR. Sparse methods enable the detection of specific characteristics in the 3D structure of SARS-CoV-2 RNA and proteins. Given the importance PCA plays in our research and in machine learning in general, we also provide a new approach to tackle the sparse PCA problem. This approach is the first to generate several sparse principal components in one step, while existing techniques rely instead on deflation to iteratively generate principal components. The method proposed (GeoSPCA) generates high quality solutions that improves the variance explained by deflation techniques by more than an order of magnitude. Ph.D. 2022-06-15T13:00:09Z 2022-06-15T13:00:09Z 2022-02 2022-01-06T00:05:12.917Z Thesis https://hdl.handle.net/1721.1/143157 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Lahlou Kitane, Driss Sparsity in Machine Learning: Theory and Applications
title	Sparsity in Machine Learning: Theory and Applications
title_full	Sparsity in Machine Learning: Theory and Applications
title_fullStr	Sparsity in Machine Learning: Theory and Applications
title_full_unstemmed	Sparsity in Machine Learning: Theory and Applications
title_short	Sparsity in Machine Learning: Theory and Applications
title_sort	sparsity in machine learning theory and applications
url	https://hdl.handle.net/1721.1/143157
work_keys_str_mv	AT lahloukitanedriss sparsityinmachinelearningtheoryandapplications

Sparsity in Machine Learning: Theory and Applications

Similar Items