Autonomous data extraction from peer reviewed literature for training machine learning models of oxidation potentials
We present an automated data-collection pipeline involving a convolutional neural network and a large language model to extract user-specified tabular data from peer-reviewed literature. The pipeline is applied to 74 reports published between 1957 and 2014 with experimentally-measured oxidation pote...
Main Authors: | Siwoo Lee, Stefan Heinen, Danish Khan, O Anatole von Lilienfeld |
---|---|
Format: | Article |
Language: | English |
Published: |
IOP Publishing
2024-01-01
|
Series: | Machine Learning: Science and Technology |
Subjects: | |
Online Access: | https://doi.org/10.1088/2632-2153/ad2f52 |
Similar Items
-
Reducing training data needs with minimal multilevel machine learning (M3L)
by: Stefan Heinen, et al.
Published: (2024-01-01) -
Operator Quantum Machine Learning: Navigating the Chemical Space of Response Properties
by: Anders S. Christensen, et al.
Published: (2019-12-01) -
Many Molecular Properties from One Kernel in Chemical Space
by: Raghunathan Ramakrishnan, et al.
Published: (2015-04-01) -
Quantum Mechanical Treatment of Variable Molecular Composition: From 'Alchemical' Changes of State Functions to Rational Compound Design
by: K. Y. Samuel Chang, et al.
Published: (2014-09-01) -
Quasi-autonomous quantum thermal machines and quantum to classical energy flow
by: Max F Frenzel, et al.
Published: (2016-01-01)