PolyDAT: A Generic Data Schema for Polymer Characterization

Polymers are stochastic materials that represent distributions of different molecules. In general, to quantify the distribution, polymer researchers rely on a series of chemical characterizations that each reveal partial information on the distribution. However, in practice, the exact set of charact...

Full description

Bibliographic Details
Main Authors: Lin, Tzyy-Shyang, Rebello, Nathan J, Beech, Haley K, Wang, Zi, El-Zaatari, Bassil, Lundberg, David J, Johnson, Jeremiah A, Kalow, Julia A, Craig, Stephen L, Olsen, Bradley D
Other Authors: Massachusetts Institute of Technology. Department of Chemical Engineering
Format: Article
Language:English
Published: American Chemical Society (ACS) 2021
Online Access:https://hdl.handle.net/1721.1/136049
_version_ 1826189993831301120
author Lin, Tzyy-Shyang
Rebello, Nathan J
Beech, Haley K
Wang, Zi
El-Zaatari, Bassil
Lundberg, David J
Johnson, Jeremiah A
Kalow, Julia A
Craig, Stephen L
Olsen, Bradley D
author2 Massachusetts Institute of Technology. Department of Chemical Engineering
author_facet Massachusetts Institute of Technology. Department of Chemical Engineering
Lin, Tzyy-Shyang
Rebello, Nathan J
Beech, Haley K
Wang, Zi
El-Zaatari, Bassil
Lundberg, David J
Johnson, Jeremiah A
Kalow, Julia A
Craig, Stephen L
Olsen, Bradley D
author_sort Lin, Tzyy-Shyang
collection MIT
description Polymers are stochastic materials that represent distributions of different molecules. In general, to quantify the distribution, polymer researchers rely on a series of chemical characterizations that each reveal partial information on the distribution. However, in practice, the exact set of characterizations that are carried out, as well as how the characterization data are aggregated and reported, is largely nonstandard across the polymer community. This scenario makes polymer characterization data highly disparate, thereby significantly slowing down the development of polymer informatics. In this work, a proposal on how structural characterization data can be organized is presented. To ensure that the system can apply universally across the entire polymer community, the proposed schema, PolyDAT, is designed to embody a minimal congruent set of vocabulary that is common across different domains. Unlike most chemical schemas, where only data pertinent to the species of interest are included, PolyDAT deploys a multi-species reaction network construct, in which every characterization on relevant species is collected to provide the most comprehensive profile on the polymer species of interest. Instead of maintaining a comprehensive list of available characterization techniques, PolyDAT provides a handful of generic templates, which align closely with experimental conventions and cover most types of common characterization techniques. This allows flexibility for the development and inclusion of new measurement methods. By providing a standard format to digitalize data, PolyDAT serves not only as an extension to BigSMILES that provides the necessary quantitative information but also as a standard channel for researchers to share polymer characterization data.
first_indexed 2024-09-23T08:33:23Z
format Article
id mit-1721.1/136049
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T08:33:23Z
publishDate 2021
publisher American Chemical Society (ACS)
record_format dspace
spelling mit-1721.1/1360492023-09-01T19:39:39Z PolyDAT: A Generic Data Schema for Polymer Characterization Lin, Tzyy-Shyang Rebello, Nathan J Beech, Haley K Wang, Zi El-Zaatari, Bassil Lundberg, David J Johnson, Jeremiah A Kalow, Julia A Craig, Stephen L Olsen, Bradley D Massachusetts Institute of Technology. Department of Chemical Engineering Polymers are stochastic materials that represent distributions of different molecules. In general, to quantify the distribution, polymer researchers rely on a series of chemical characterizations that each reveal partial information on the distribution. However, in practice, the exact set of characterizations that are carried out, as well as how the characterization data are aggregated and reported, is largely nonstandard across the polymer community. This scenario makes polymer characterization data highly disparate, thereby significantly slowing down the development of polymer informatics. In this work, a proposal on how structural characterization data can be organized is presented. To ensure that the system can apply universally across the entire polymer community, the proposed schema, PolyDAT, is designed to embody a minimal congruent set of vocabulary that is common across different domains. Unlike most chemical schemas, where only data pertinent to the species of interest are included, PolyDAT deploys a multi-species reaction network construct, in which every characterization on relevant species is collected to provide the most comprehensive profile on the polymer species of interest. Instead of maintaining a comprehensive list of available characterization techniques, PolyDAT provides a handful of generic templates, which align closely with experimental conventions and cover most types of common characterization techniques. This allows flexibility for the development and inclusion of new measurement methods. By providing a standard format to digitalize data, PolyDAT serves not only as an extension to BigSMILES that provides the necessary quantitative information but also as a standard channel for researchers to share polymer characterization data. 2021-10-27T20:30:34Z 2021-10-27T20:30:34Z 2021 2021-06-22T16:38:14Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/136049 en 10.1021/acs.jcim.1c00028 Journal of Chemical Information and Modeling Creative Commons Attribution-NonCommercial-NoDerivs License http://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf American Chemical Society (ACS) ACS
spellingShingle Lin, Tzyy-Shyang
Rebello, Nathan J
Beech, Haley K
Wang, Zi
El-Zaatari, Bassil
Lundberg, David J
Johnson, Jeremiah A
Kalow, Julia A
Craig, Stephen L
Olsen, Bradley D
PolyDAT: A Generic Data Schema for Polymer Characterization
title PolyDAT: A Generic Data Schema for Polymer Characterization
title_full PolyDAT: A Generic Data Schema for Polymer Characterization
title_fullStr PolyDAT: A Generic Data Schema for Polymer Characterization
title_full_unstemmed PolyDAT: A Generic Data Schema for Polymer Characterization
title_short PolyDAT: A Generic Data Schema for Polymer Characterization
title_sort polydat a generic data schema for polymer characterization
url https://hdl.handle.net/1721.1/136049
work_keys_str_mv AT lintzyyshyang polydatagenericdataschemaforpolymercharacterization
AT rebellonathanj polydatagenericdataschemaforpolymercharacterization
AT beechhaleyk polydatagenericdataschemaforpolymercharacterization
AT wangzi polydatagenericdataschemaforpolymercharacterization
AT elzaataribassil polydatagenericdataschemaforpolymercharacterization
AT lundbergdavidj polydatagenericdataschemaforpolymercharacterization
AT johnsonjeremiaha polydatagenericdataschemaforpolymercharacterization
AT kalowjuliaa polydatagenericdataschemaforpolymercharacterization
AT craigstephenl polydatagenericdataschemaforpolymercharacterization
AT olsenbradleyd polydatagenericdataschemaforpolymercharacterization