ACRF: Aggregated Conditional Random Field for Out of Vocab (OOV) Token Representation for Hindi NER
Named entities are random, like emerging entities and complex entities. Most of the large language model’s tokenizers have fixed vocab; hence, they tokenize out-of-vocab (OOV) words into multiple sub-words during tokenization. During fine-tuning for any downstream task, these sub-words (t...
Main Authors: | Sumit Singh, Uma Shanker Tiwary |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2024-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10422739/ |
Similar Items
-
Banner: A Cost-Sensitive Contextualized Model for Bangla Named Entity Recognition
by: Imranul Ashrafi, et al.
Published: (2020-01-01) -
Named Entity Recognition (NER) von Warndienstmeldungen im Gartenbau: Eine empirische Studie zu Design, Entwicklung und Bewertung der statistischen und Deep-Learning benutzerdefinierten NER-Modelle
by: He-Bleinagel, Xia, et al.
Published: (2022-02-01) -
Composition distillation for semantic sentence embeddings
by: Vaanavan, Sezhiyan
Published: (2024) -
Delayed Combination of Feature Embedding in Bidirectional LSTM CRF for NER
by: Chirawan Ronran, et al.
Published: (2020-10-01) -
DEVELOPMENT OF A QUESTION ANSWERING CHATBOT FOR BLOCKCHAIN DOMAIN
by: Aigerim Mansurova, et al.
Published: (2023-09-01)