MaterialBERT for natural language processing of materials science texts
A BERT (Bidirectional Encoder Representations from Transformers) model, which we named “MaterialBERT”, has been generated using scientific papers in wide area of material science as a corpus. A new vocabulary list for tokenizer was generated using material science corpus. Two BERT models with differ...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2022-12-01
|
Series: | Science and Technology of Advanced Materials: Methods |
Subjects: | |
Online Access: | http://dx.doi.org/10.1080/27660400.2022.2124831 |