Semi-Supervised Chinese Word Segmentation in Geological Domain Using Pseudo-Lexicon and Self-Training Strategy
Chinese word segmentation (CWS), which involves splitting the sequence of Chinese characters into words, is a key task in natural language processing (NLP) for Chinese. However, the complexity and flexibility of geologic terms require that domain-specific knowledge be utilized in CWS for geoscience...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/15/3/1404 |