Sparsity in Machine Learning: Theory and Applications

Sparsity plays a key role in machine learning for several reasons including interpretability. Interpretability is sought either by practitioners or by scientists. Indeed, on one hand interpretability can be key in a practice such as in healthcare, in which black box models cannot be used for the pre...

Full description

Bibliographic Details
Main Author: Lahlou Kitane, Driss
Other Authors: Bertsimas, Dimitris
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/143157
_version_ 1826205096898199552
author Lahlou Kitane, Driss
author2 Bertsimas, Dimitris
author_facet Bertsimas, Dimitris
Lahlou Kitane, Driss
author_sort Lahlou Kitane, Driss
collection MIT
description Sparsity plays a key role in machine learning for several reasons including interpretability. Interpretability is sought either by practitioners or by scientists. Indeed, on one hand interpretability can be key in a practice such as in healthcare, in which black box models cannot be used for the prescription of a treatment for a patient. On the other hand, interpretability is essential in understanding of phenomena that are modelled using machine learning such as plasma electromagnetic emissions. Besides interpretability, sparsity has several other important applications such as improvement of the predictive power of models and reduction of operational and investment costs. Integer optimization is a highly effective tool in the conception of methods to tackle sparsity. It offers a rigorous framework to build sparse models and has proved to provide more accurate and sparse models than other approaches including the ones using sparsity-inducing regularization norms. This thesis focuses on the application of integer optimization to address sparsity problems. We provide two applications of sparse modeling. The first one is related to the application of Mixed Integer Optimization (MIO) sparse regression to Laser Induced Breakdown Spectroscopy (LIBS), a modern and important chemical analysis technique. We build a methodology for sparse and robust models in chemometrics and test it on various types of mineral ore. The MIO approach beats experts’ predictions while offering remarkably sparser models compared to 𝐿𝐴𝑆𝑆𝑂. As the 𝑅2 achieved is higher than .99 in some cases, this application is, to the best of our knowledge, the first application that brings empirical proof that a true support exists in nature as the optimization community has been questioning the existence of such a concept in real life applications. The second application is related to COVID testing and sparse classification. We propose a fast and simple method for the detection of SARS-CoV-2 based on spectroscopy. This novel method builds on machine learning capabilities to deliver diagnosis in under a minute, without the use of any reagent, achieving a precision close to that of PCR. Sparse methods enable the detection of specific characteristics in the 3D structure of SARS-CoV-2 RNA and proteins. Given the importance PCA plays in our research and in machine learning in general, we also provide a new approach to tackle the sparse PCA problem. This approach is the first to generate several sparse principal components in one step, while existing techniques rely instead on deflation to iteratively generate principal components. The method proposed (GeoSPCA) generates high quality solutions that improves the variance explained by deflation techniques by more than an order of magnitude.
first_indexed 2024-09-23T13:06:42Z
format Thesis
id mit-1721.1/143157
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:06:42Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1431572022-06-16T03:27:19Z Sparsity in Machine Learning: Theory and Applications Lahlou Kitane, Driss Bertsimas, Dimitris Massachusetts Institute of Technology. Operations Research Center Sparsity plays a key role in machine learning for several reasons including interpretability. Interpretability is sought either by practitioners or by scientists. Indeed, on one hand interpretability can be key in a practice such as in healthcare, in which black box models cannot be used for the prescription of a treatment for a patient. On the other hand, interpretability is essential in understanding of phenomena that are modelled using machine learning such as plasma electromagnetic emissions. Besides interpretability, sparsity has several other important applications such as improvement of the predictive power of models and reduction of operational and investment costs. Integer optimization is a highly effective tool in the conception of methods to tackle sparsity. It offers a rigorous framework to build sparse models and has proved to provide more accurate and sparse models than other approaches including the ones using sparsity-inducing regularization norms. This thesis focuses on the application of integer optimization to address sparsity problems. We provide two applications of sparse modeling. The first one is related to the application of Mixed Integer Optimization (MIO) sparse regression to Laser Induced Breakdown Spectroscopy (LIBS), a modern and important chemical analysis technique. We build a methodology for sparse and robust models in chemometrics and test it on various types of mineral ore. The MIO approach beats experts’ predictions while offering remarkably sparser models compared to 𝐿𝐴𝑆𝑆𝑂. As the 𝑅2 achieved is higher than .99 in some cases, this application is, to the best of our knowledge, the first application that brings empirical proof that a true support exists in nature as the optimization community has been questioning the existence of such a concept in real life applications. The second application is related to COVID testing and sparse classification. We propose a fast and simple method for the detection of SARS-CoV-2 based on spectroscopy. This novel method builds on machine learning capabilities to deliver diagnosis in under a minute, without the use of any reagent, achieving a precision close to that of PCR. Sparse methods enable the detection of specific characteristics in the 3D structure of SARS-CoV-2 RNA and proteins. Given the importance PCA plays in our research and in machine learning in general, we also provide a new approach to tackle the sparse PCA problem. This approach is the first to generate several sparse principal components in one step, while existing techniques rely instead on deflation to iteratively generate principal components. The method proposed (GeoSPCA) generates high quality solutions that improves the variance explained by deflation techniques by more than an order of magnitude. Ph.D. 2022-06-15T13:00:09Z 2022-06-15T13:00:09Z 2022-02 2022-01-06T00:05:12.917Z Thesis https://hdl.handle.net/1721.1/143157 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Lahlou Kitane, Driss
Sparsity in Machine Learning: Theory and Applications
title Sparsity in Machine Learning: Theory and Applications
title_full Sparsity in Machine Learning: Theory and Applications
title_fullStr Sparsity in Machine Learning: Theory and Applications
title_full_unstemmed Sparsity in Machine Learning: Theory and Applications
title_short Sparsity in Machine Learning: Theory and Applications
title_sort sparsity in machine learning theory and applications
url https://hdl.handle.net/1721.1/143157
work_keys_str_mv AT lahloukitanedriss sparsityinmachinelearningtheoryandapplications