Fourier transform ion cyclotron resonance mass spectrometry for petroleomics

<p>The past two decades have witnessed tremendous advances in the field of high accuracy, high mass resolution data acquisition of complex samples such as crude oils and the human proteome. With the development of Fourier transform ion cyclotron resonance mass spectrometry, the rapidly growing...

Full description

Bibliographic Details
Main Author: Hauschild, J
Other Authors: Cartwright, H
Format: Thesis
Language:English
Published: 2012
Subjects:
_version_ 1797079730013339648
author Hauschild, J
author2 Cartwright, H
author_facet Cartwright, H
Hauschild, J
author_sort Hauschild, J
collection OXFORD
description <p>The past two decades have witnessed tremendous advances in the field of high accuracy, high mass resolution data acquisition of complex samples such as crude oils and the human proteome. With the development of Fourier transform ion cyclotron resonance mass spectrometry, the rapidly growing field of petroleomics has emerged, whose goal is to process and analyse the large volumes of complex and often poorly understood data on crude oils generated by mass spectrometry.</p><p>As global oil resources deplete, oil companies are increasingly moving towards the extraction and refining of the still plentiful reserves of heavy, carbon rich and highly contaminated crude oil. It is essential that the oil industry gather the maximum possible amount of information about the crude oil prior to setting up the drilling infrastructure, in order to reduce processing costs.</p><p>This project describes how machine learning can be used as a novel way to extract critical information from complex mass spectra which will aid in the processing of crude oils.</p><p>The thesis discusses the experimental methods involved in acquiring high accuracy mass spectral data for a large and key industry-standard set of crude oil samples. These data are subsequently analysed to identify possible links between the raw mass spectra and certain physical properties of the oils, such as pour point and sulphur content.</p><p>Methods including artificial neural networks and self organising maps are described and the use of spectral clustering and pattern recognition to classify crude oils is investigated. The main focus of the research, the creation of an original simulated annealing genetic algorithm hybrid technique (SAGA), is discussed in detail and the successes of modelling a number of different datasets using all described methods are outlined.</p><p>Despite the complexity of the underlying mass spectrometry data, which reflects the considerable chemical diversity of the samples themselves, the results show that physical properties can be modelled with varying degrees of success. When modelling pour point temperatures, the artificial neural network achieved an average prediction error of less than 10% while SAGA predicted the same values with an average accuracy of more than 85%. It did not prove possible to model any of the other properties with such statistical significance; however improvements to feature extraction and pre-processing of the spectral data as well as enhancement of the modelling techniques should yield more consistent and statistically reliable results.</p><p>These should in due course lead to a comprehensive model which the oil industry can use to process crude oil data using rapid and cost effective analytical methods.</p>
first_indexed 2024-03-07T00:49:54Z
format Thesis
id oxford-uuid:8604a373-fb6b-4bc0-8dc1-464a191b1fac
institution University of Oxford
language English
last_indexed 2024-03-07T00:49:54Z
publishDate 2012
record_format dspace
spelling oxford-uuid:8604a373-fb6b-4bc0-8dc1-464a191b1fac2022-03-26T22:01:20ZFourier transform ion cyclotron resonance mass spectrometry for petroleomicsThesishttp://purl.org/coar/resource_type/c_db06uuid:8604a373-fb6b-4bc0-8dc1-464a191b1facPhysical & theoretical chemistryComputational chemistryChemistry & allied sciencesPhysical SciencesMass spectrometryEnglishOxford University Research Archive - Valet2012Hauschild, JCartwright, H<p>The past two decades have witnessed tremendous advances in the field of high accuracy, high mass resolution data acquisition of complex samples such as crude oils and the human proteome. With the development of Fourier transform ion cyclotron resonance mass spectrometry, the rapidly growing field of petroleomics has emerged, whose goal is to process and analyse the large volumes of complex and often poorly understood data on crude oils generated by mass spectrometry.</p><p>As global oil resources deplete, oil companies are increasingly moving towards the extraction and refining of the still plentiful reserves of heavy, carbon rich and highly contaminated crude oil. It is essential that the oil industry gather the maximum possible amount of information about the crude oil prior to setting up the drilling infrastructure, in order to reduce processing costs.</p><p>This project describes how machine learning can be used as a novel way to extract critical information from complex mass spectra which will aid in the processing of crude oils.</p><p>The thesis discusses the experimental methods involved in acquiring high accuracy mass spectral data for a large and key industry-standard set of crude oil samples. These data are subsequently analysed to identify possible links between the raw mass spectra and certain physical properties of the oils, such as pour point and sulphur content.</p><p>Methods including artificial neural networks and self organising maps are described and the use of spectral clustering and pattern recognition to classify crude oils is investigated. The main focus of the research, the creation of an original simulated annealing genetic algorithm hybrid technique (SAGA), is discussed in detail and the successes of modelling a number of different datasets using all described methods are outlined.</p><p>Despite the complexity of the underlying mass spectrometry data, which reflects the considerable chemical diversity of the samples themselves, the results show that physical properties can be modelled with varying degrees of success. When modelling pour point temperatures, the artificial neural network achieved an average prediction error of less than 10% while SAGA predicted the same values with an average accuracy of more than 85%. It did not prove possible to model any of the other properties with such statistical significance; however improvements to feature extraction and pre-processing of the spectral data as well as enhancement of the modelling techniques should yield more consistent and statistically reliable results.</p><p>These should in due course lead to a comprehensive model which the oil industry can use to process crude oil data using rapid and cost effective analytical methods.</p>
spellingShingle Physical & theoretical chemistry
Computational chemistry
Chemistry & allied sciences
Physical Sciences
Mass spectrometry
Hauschild, J
Fourier transform ion cyclotron resonance mass spectrometry for petroleomics
title Fourier transform ion cyclotron resonance mass spectrometry for petroleomics
title_full Fourier transform ion cyclotron resonance mass spectrometry for petroleomics
title_fullStr Fourier transform ion cyclotron resonance mass spectrometry for petroleomics
title_full_unstemmed Fourier transform ion cyclotron resonance mass spectrometry for petroleomics
title_short Fourier transform ion cyclotron resonance mass spectrometry for petroleomics
title_sort fourier transform ion cyclotron resonance mass spectrometry for petroleomics
topic Physical & theoretical chemistry
Computational chemistry
Chemistry & allied sciences
Physical Sciences
Mass spectrometry
work_keys_str_mv AT hauschildj fouriertransformioncyclotronresonancemassspectrometryforpetroleomics