Redblock: a tool for online deduplication on large datasets
Online data deduplication aims to identify records that represent the same purpose on a continuous data flow environment. It must be able to process a range of information with high effectiveness and no delays. The purpose of this paper is to introduce a developed tool entitled Redblock, for real-ti...
Main Authors: | Luan Félix Pimentel, Igor Lemos Vicente, Guilherme Dal Bianco |
---|---|
Format: | Article |
Language: | English |
Published: |
Universidade de Passo Fundo (UPF)
2017-07-01
|
Series: | Revista Brasileira de Computação Aplicada |
Subjects: | |
Online Access: | http://seer.upf.br/index.php/rbca/article/view/7143 |
Similar Items
-
Dataset Deduplication with Datamodels
by: Liao, Yunxing
Published: (2022) -
Um olhar sobre a matemática no ensino integrado
by: Aline Picoli Sonza, et al.
Published: (2022-01-01) -
Integração transfronteiriça: ressignificar sentidos, com “novos” atores
by: Gustavo Oliveira Vieira
Published: (2019-06-01) -
Democracia, institucionalidade e processos decisórios no MERCOSUL e na União Europeia
by: Guilherme Rossi
Published: (2020-08-01) -
Multiculturalismo em educação: o atendimento escolar de alunos bolivianos e descendentes
by: Elaine Teresinha Dal Mas Dias, et al.
Published: (2018-08-01)