Curated dataset of asphaltene structures

Asphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing,...

Full description

Bibliographic Details
Main Authors: Madison Franke, Selsela Arsala, Frozan Tahiry, Simon-Olivier Gingras, Arun K. Sharma
Format: Article
Language:English
Published: Elsevier 2024-02-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340923009502
_version_ 1797317145745424384
author Madison Franke
Selsela Arsala
Frozan Tahiry
Simon-Olivier Gingras
Arun K. Sharma
author_facet Madison Franke
Selsela Arsala
Frozan Tahiry
Simon-Olivier Gingras
Arun K. Sharma
author_sort Madison Franke
collection DOAJ
description Asphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing, resulting in significant economic losses and production disruptions. While no singular structure defines asphaltenes, two major molecular architectures, namely archipelago and continental models, have gained wide acceptance for their consistency with various experimental investigations and subsequent use in computational studies.The archipelago model comprises two or more polyaromatic hydrocarbon entities interconnected via aliphatic side chains. In contrast, the island or continental model features a unified polyaromatic hydrocarbon moiety with 4 to 10 fused aromatic rings, averaging around 7 rings. To establish a comprehensive collection, we meticulously curated over 250 asphaltene structures derived from previous experimental and computational studies in this field. Our curation process involved an extensive literature survey, conversion of figures from publications into molecular structure files, careful verification of conversion accuracy, and structure editing to ensure alignment with molecular formulas. Our database provides digital structure files and optimized geometries for both predominant structural motifs. The optimization procedure commenced with the PM6 semi-empirical method, followed by further optimization utilizing density functional theory employing the B3LYP functional and the 6-31+G(d,p) basis set. Furthermore, we compiled a range of structural and electronic features for these molecules, serving as a valuable foundation for employing machine learning algorithms to investigate asphaltenes. This work provides a ready to use structural database of asphaltenes and sets the stage for future research endeavours in this domain.
first_indexed 2024-03-08T03:30:38Z
format Article
id doaj.art-bee3c234a7744227843641a20690dcc7
institution Directory Open Access Journal
issn 2352-3409
language English
last_indexed 2024-03-08T03:30:38Z
publishDate 2024-02-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj.art-bee3c234a7744227843641a20690dcc72024-02-11T05:10:31ZengElsevierData in Brief2352-34092024-02-0152109907Curated dataset of asphaltene structuresMadison Franke0Selsela Arsala1Frozan Tahiry2Simon-Olivier Gingras3Arun K. Sharma4Department of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USADepartment of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USADepartment of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USADepartment of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USACorresponding author.; Department of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USAAsphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing, resulting in significant economic losses and production disruptions. While no singular structure defines asphaltenes, two major molecular architectures, namely archipelago and continental models, have gained wide acceptance for their consistency with various experimental investigations and subsequent use in computational studies.The archipelago model comprises two or more polyaromatic hydrocarbon entities interconnected via aliphatic side chains. In contrast, the island or continental model features a unified polyaromatic hydrocarbon moiety with 4 to 10 fused aromatic rings, averaging around 7 rings. To establish a comprehensive collection, we meticulously curated over 250 asphaltene structures derived from previous experimental and computational studies in this field. Our curation process involved an extensive literature survey, conversion of figures from publications into molecular structure files, careful verification of conversion accuracy, and structure editing to ensure alignment with molecular formulas. Our database provides digital structure files and optimized geometries for both predominant structural motifs. The optimization procedure commenced with the PM6 semi-empirical method, followed by further optimization utilizing density functional theory employing the B3LYP functional and the 6-31+G(d,p) basis set. Furthermore, we compiled a range of structural and electronic features for these molecules, serving as a valuable foundation for employing machine learning algorithms to investigate asphaltenes. This work provides a ready to use structural database of asphaltenes and sets the stage for future research endeavours in this domain.http://www.sciencedirect.com/science/article/pii/S2352340923009502PetroleumMolecular structuresComputational chemistryCrude oil
spellingShingle Madison Franke
Selsela Arsala
Frozan Tahiry
Simon-Olivier Gingras
Arun K. Sharma
Curated dataset of asphaltene structures
Data in Brief
Petroleum
Molecular structures
Computational chemistry
Crude oil
title Curated dataset of asphaltene structures
title_full Curated dataset of asphaltene structures
title_fullStr Curated dataset of asphaltene structures
title_full_unstemmed Curated dataset of asphaltene structures
title_short Curated dataset of asphaltene structures
title_sort curated dataset of asphaltene structures
topic Petroleum
Molecular structures
Computational chemistry
Crude oil
url http://www.sciencedirect.com/science/article/pii/S2352340923009502
work_keys_str_mv AT madisonfranke curateddatasetofasphaltenestructures
AT selselaarsala curateddatasetofasphaltenestructures
AT frozantahiry curateddatasetofasphaltenestructures
AT simonoliviergingras curateddatasetofasphaltenestructures
AT arunksharma curateddatasetofasphaltenestructures