MaterialBERT for natural language processing of materials science texts

A BERT (Bidirectional Encoder Representations from Transformers) model, which we named “MaterialBERT”, has been generated using scientific papers in wide area of material science as a corpus. A new vocabulary list for tokenizer was generated using material science corpus. Two BERT models with differ...

Full description

Bibliographic Details
Main Authors: Michiko Yoshitake, Fumitaka Sato, Hiroyuki Kawano, Hiroshi Teraoka
Format: Article
Language:English
Published: Taylor & Francis Group 2022-12-01
Series:Science and Technology of Advanced Materials: Methods
Subjects:
Online Access:http://dx.doi.org/10.1080/27660400.2022.2124831