Curated dataset of asphaltene structures
Asphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing,...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2024-02-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340923009502 |
_version_ | 1797317145745424384 |
---|---|
author | Madison Franke Selsela Arsala Frozan Tahiry Simon-Olivier Gingras Arun K. Sharma |
author_facet | Madison Franke Selsela Arsala Frozan Tahiry Simon-Olivier Gingras Arun K. Sharma |
author_sort | Madison Franke |
collection | DOAJ |
description | Asphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing, resulting in significant economic losses and production disruptions. While no singular structure defines asphaltenes, two major molecular architectures, namely archipelago and continental models, have gained wide acceptance for their consistency with various experimental investigations and subsequent use in computational studies.The archipelago model comprises two or more polyaromatic hydrocarbon entities interconnected via aliphatic side chains. In contrast, the island or continental model features a unified polyaromatic hydrocarbon moiety with 4 to 10 fused aromatic rings, averaging around 7 rings. To establish a comprehensive collection, we meticulously curated over 250 asphaltene structures derived from previous experimental and computational studies in this field. Our curation process involved an extensive literature survey, conversion of figures from publications into molecular structure files, careful verification of conversion accuracy, and structure editing to ensure alignment with molecular formulas. Our database provides digital structure files and optimized geometries for both predominant structural motifs. The optimization procedure commenced with the PM6 semi-empirical method, followed by further optimization utilizing density functional theory employing the B3LYP functional and the 6-31+G(d,p) basis set. Furthermore, we compiled a range of structural and electronic features for these molecules, serving as a valuable foundation for employing machine learning algorithms to investigate asphaltenes. This work provides a ready to use structural database of asphaltenes and sets the stage for future research endeavours in this domain. |
first_indexed | 2024-03-08T03:30:38Z |
format | Article |
id | doaj.art-bee3c234a7744227843641a20690dcc7 |
institution | Directory Open Access Journal |
issn | 2352-3409 |
language | English |
last_indexed | 2024-03-08T03:30:38Z |
publishDate | 2024-02-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj.art-bee3c234a7744227843641a20690dcc72024-02-11T05:10:31ZengElsevierData in Brief2352-34092024-02-0152109907Curated dataset of asphaltene structuresMadison Franke0Selsela Arsala1Frozan Tahiry2Simon-Olivier Gingras3Arun K. Sharma4Department of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USADepartment of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USADepartment of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USADepartment of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USACorresponding author.; Department of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA 93955, USAAsphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing, resulting in significant economic losses and production disruptions. While no singular structure defines asphaltenes, two major molecular architectures, namely archipelago and continental models, have gained wide acceptance for their consistency with various experimental investigations and subsequent use in computational studies.The archipelago model comprises two or more polyaromatic hydrocarbon entities interconnected via aliphatic side chains. In contrast, the island or continental model features a unified polyaromatic hydrocarbon moiety with 4 to 10 fused aromatic rings, averaging around 7 rings. To establish a comprehensive collection, we meticulously curated over 250 asphaltene structures derived from previous experimental and computational studies in this field. Our curation process involved an extensive literature survey, conversion of figures from publications into molecular structure files, careful verification of conversion accuracy, and structure editing to ensure alignment with molecular formulas. Our database provides digital structure files and optimized geometries for both predominant structural motifs. The optimization procedure commenced with the PM6 semi-empirical method, followed by further optimization utilizing density functional theory employing the B3LYP functional and the 6-31+G(d,p) basis set. Furthermore, we compiled a range of structural and electronic features for these molecules, serving as a valuable foundation for employing machine learning algorithms to investigate asphaltenes. This work provides a ready to use structural database of asphaltenes and sets the stage for future research endeavours in this domain.http://www.sciencedirect.com/science/article/pii/S2352340923009502PetroleumMolecular structuresComputational chemistryCrude oil |
spellingShingle | Madison Franke Selsela Arsala Frozan Tahiry Simon-Olivier Gingras Arun K. Sharma Curated dataset of asphaltene structures Data in Brief Petroleum Molecular structures Computational chemistry Crude oil |
title | Curated dataset of asphaltene structures |
title_full | Curated dataset of asphaltene structures |
title_fullStr | Curated dataset of asphaltene structures |
title_full_unstemmed | Curated dataset of asphaltene structures |
title_short | Curated dataset of asphaltene structures |
title_sort | curated dataset of asphaltene structures |
topic | Petroleum Molecular structures Computational chemistry Crude oil |
url | http://www.sciencedirect.com/science/article/pii/S2352340923009502 |
work_keys_str_mv | AT madisonfranke curateddatasetofasphaltenestructures AT selselaarsala curateddatasetofasphaltenestructures AT frozantahiry curateddatasetofasphaltenestructures AT simonoliviergingras curateddatasetofasphaltenestructures AT arunksharma curateddatasetofasphaltenestructures |