Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.
The widespread legalization of Cannabis has opened the industry to using contemporary analytical techniques for chemotype analysis. Chemotypic data has been collected on a large variety of oil profiles inherent to the cultivars that are commercially available. The unknown gene regulation and pharmac...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0236878 |
_version_ | 1818584058100187136 |
---|---|
author | Daniela Vergara Reggie Gaudino Thomas Blank Brian Keegan |
author_facet | Daniela Vergara Reggie Gaudino Thomas Blank Brian Keegan |
author_sort | Daniela Vergara |
collection | DOAJ |
description | The widespread legalization of Cannabis has opened the industry to using contemporary analytical techniques for chemotype analysis. Chemotypic data has been collected on a large variety of oil profiles inherent to the cultivars that are commercially available. The unknown gene regulation and pharmacokinetics of dozens of cannabinoids offer opportunities of high interest in pharmacology research. Retailers in many medical and recreational jurisdictions are typically required to report chemical concentrations of at least some cannabinoids. Commercial cannabis laboratories have collected large chemotype datasets of diverse Cannabis cultivars. In this work a data set of 17,600 cultivars tested by Steep Hill Inc., is examined using machine learning techniques to interpolate missing chemotype observations and cluster cultivars into groups based on chemotype similarity. The results indicate cultivars cluster based on their chemotypes, and that some imputation methods work better than others at grouping these cultivars based on chemotypic identity. Due to the missing data and to the low signal to noise ratio for some less common cannabinoids, their behavior could not be accurately predicted. These findings have implications for characterizing complex interactions in cannabinoid biosynthesis and improving phenotypical classification of Cannabis cultivars. |
first_indexed | 2024-12-16T08:15:08Z |
format | Article |
id | doaj.art-fb6d4311817146f3bb86dd7b1293c402 |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-12-16T08:15:08Z |
publishDate | 2020-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-fb6d4311817146f3bb86dd7b1293c4022022-12-21T22:38:16ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01159e023687810.1371/journal.pone.0236878Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.Daniela VergaraReggie GaudinoThomas BlankBrian KeeganThe widespread legalization of Cannabis has opened the industry to using contemporary analytical techniques for chemotype analysis. Chemotypic data has been collected on a large variety of oil profiles inherent to the cultivars that are commercially available. The unknown gene regulation and pharmacokinetics of dozens of cannabinoids offer opportunities of high interest in pharmacology research. Retailers in many medical and recreational jurisdictions are typically required to report chemical concentrations of at least some cannabinoids. Commercial cannabis laboratories have collected large chemotype datasets of diverse Cannabis cultivars. In this work a data set of 17,600 cultivars tested by Steep Hill Inc., is examined using machine learning techniques to interpolate missing chemotype observations and cluster cultivars into groups based on chemotype similarity. The results indicate cultivars cluster based on their chemotypes, and that some imputation methods work better than others at grouping these cultivars based on chemotypic identity. Due to the missing data and to the low signal to noise ratio for some less common cannabinoids, their behavior could not be accurately predicted. These findings have implications for characterizing complex interactions in cannabinoid biosynthesis and improving phenotypical classification of Cannabis cultivars.https://doi.org/10.1371/journal.pone.0236878 |
spellingShingle | Daniela Vergara Reggie Gaudino Thomas Blank Brian Keegan Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes. PLoS ONE |
title | Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes. |
title_full | Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes. |
title_fullStr | Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes. |
title_full_unstemmed | Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes. |
title_short | Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes. |
title_sort | modeling cannabinoids from a large scale sample of cannabis sativa chemotypes |
url | https://doi.org/10.1371/journal.pone.0236878 |
work_keys_str_mv | AT danielavergara modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes AT reggiegaudino modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes AT thomasblank modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes AT briankeegan modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes |