Transfer learning for named-entity recognition with neural networks

Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-entity recognition (NER). In order to achieve high performances, ANNs need to be trained on a large labeled dataset. However, labels might be difficult to obtain for the dataset on which the user want...

Full description

Bibliographic Details
Main Authors: Lee, Ji Young, Dernoncourt, Franck, Szolovits, Peter
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:English
Published: European Language Resources Association 2020
Online Access:https://hdl.handle.net/1721.1/123340
_version_ 1811068448035307520
author Lee, Ji Young
Dernoncourt, Franck
Szolovits, Peter
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Lee, Ji Young
Dernoncourt, Franck
Szolovits, Peter
author_sort Lee, Ji Young
collection MIT
description Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-entity recognition (NER). In order to achieve high performances, ANNs need to be trained on a large labeled dataset. However, labels might be difficult to obtain for the dataset on which the user wants to perform NER: label scarcity is particularly pronounced for patient note de-identification, which is an instance of NER. In this work, we analyze to what extent transfer learning may address this issue. In particular, we demonstrate that transferring an ANN model trained on a large labeled dataset to another dataset with a limited number of labels improves upon the state-of-the-art results on two different datasets for patient note de-identification.
first_indexed 2024-09-23T07:56:09Z
format Article
id mit-1721.1/123340
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T07:56:09Z
publishDate 2020
publisher European Language Resources Association
record_format dspace
spelling mit-1721.1/1233402022-09-23T09:44:24Z Transfer learning for named-entity recognition with neural networks Lee, Ji Young Dernoncourt, Franck Szolovits, Peter Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-entity recognition (NER). In order to achieve high performances, ANNs need to be trained on a large labeled dataset. However, labels might be difficult to obtain for the dataset on which the user wants to perform NER: label scarcity is particularly pronounced for patient note de-identification, which is an instance of NER. In this work, we analyze to what extent transfer learning may address this issue. In particular, we demonstrate that transferring an ANN model trained on a large labeled dataset to another dataset with a limited number of labels improves upon the state-of-the-art results on two different datasets for patient note de-identification. 2020-01-07T20:30:10Z 2020-01-07T20:30:10Z 2018-05 2019-07-11T12:20:15Z Article http://purl.org/eprint/type/ConferencePaper 979-10-95546-00-9 https://hdl.handle.net/1721.1/123340 Lee, Ji Young et al. "Transfer Learning for Named-Entity Recognition with Neural Networks." LREC 2018: Eleventh International Conference on Language Resources and Evaluation, May 2018, Miyazaki, Japan, European Language Resources Association, May 2018 © 2018 LREC en http://www.lrec-conf.org/proceedings/lrec2018/index.html LREC 2018: Eleventh International Conference on Language Resources and Evaluation Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf European Language Resources Association arXiv
spellingShingle Lee, Ji Young
Dernoncourt, Franck
Szolovits, Peter
Transfer learning for named-entity recognition with neural networks
title Transfer learning for named-entity recognition with neural networks
title_full Transfer learning for named-entity recognition with neural networks
title_fullStr Transfer learning for named-entity recognition with neural networks
title_full_unstemmed Transfer learning for named-entity recognition with neural networks
title_short Transfer learning for named-entity recognition with neural networks
title_sort transfer learning for named entity recognition with neural networks
url https://hdl.handle.net/1721.1/123340
work_keys_str_mv AT leejiyoung transferlearningfornamedentityrecognitionwithneuralnetworks
AT dernoncourtfranck transferlearningfornamedentityrecognitionwithneuralnetworks
AT szolovitspeter transferlearningfornamedentityrecognitionwithneuralnetworks