WEB-BASED DUPLICATE RECORDS DETECTION WITH ARABIC LANGUAGE ENHANCEMENT
Sharing data between organizations has growing importance in many data mining projects. Data from various heterogeneous sources often has to be linked and aggregated in order to improve data quality. The importance of data accuracy and quality has increased with the explosion of data size. The first...
Main Authors: | Azza Higazy, Amany Sarhan, Tarek El-Tobely |
---|---|
Format: | Article |
Language: | Arabic |
Published: |
Faculty of engineering, Tanta University
2015-12-01
|
Series: | Journal of Engineering Research - Egypt |
Subjects: | |
Online Access: | https://erjeng.journals.ekb.eg/article_126816_f7f43ef1431f9249990df2ee1689f8e9.pdf |
Similar Items
-
plantR: An R package and workflow for managing species records from biological collections
by: Renato A. F. deLima, et al.
Published: (2023-02-01) -
Analytics on Non-Normalized Data Sources: More Learning, Rather Than More Cleaning
by: Alexis Cvetkov-Iliev, et al.
Published: (2022-01-01) -
Duplicate Literature Detection for Cross-Library Search
by: Liu Wei, et al.
Published: (2016-06-01) -
mvp – an open‐source preprocessor for cleaning duplicate records and missing values in mass spectrometry data
by: Geunho Lee, et al.
Published: (2017-07-01) -
A Matching Algorithm Based on Voronoi Diagram for Multi-Scale Polygonal Residential Areas
by: Jianhua Wu, et al.
Published: (2018-01-01)