Evaluation of Filtering Methods Applied to the Unstructured Datasets in the Predictive Learning Services
Predictive learning services perform aggregation and homogenization of open data from public sources, in particular from the online recruitment agencies. However, the sample of vacancies may contain various percentage of noise due to the frequent occurrence of homonyms. This article will consider tw...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2017-01-01
|
Series: | ITM Web of Conferences |
Online Access: | https://doi.org/10.1051/itmconf/20171002005 |
Summary: | Predictive learning services perform aggregation and homogenization of open data from public sources, in particular from the online recruitment agencies. However, the sample of vacancies may contain various percentage of noise due to the frequent occurrence of homonyms. This article will consider two approaches of noise reduction: the first one is based on the cosine similarity and the second one is based on the contextual words. |
---|---|
ISSN: | 2271-2097 |