RLT-S: A Web System for Record Linkage.

Record linkage integrates records across multiple related data sources identifying duplicates and accounting for possible errors. Real life applications require efficient algorithms to merge these voluminous data sources to find out all records belonging to same individuals. Our recently devised hig...

Full description

Bibliographic Details
Main Authors: Abdullah-Al Mamun, Robert Aseltine, Sanguthevar Rajasekaran
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4420456?pdf=render
_version_ 1819066989505675264
author Abdullah-Al Mamun
Robert Aseltine
Sanguthevar Rajasekaran
author_facet Abdullah-Al Mamun
Robert Aseltine
Sanguthevar Rajasekaran
author_sort Abdullah-Al Mamun
collection DOAJ
description Record linkage integrates records across multiple related data sources identifying duplicates and accounting for possible errors. Real life applications require efficient algorithms to merge these voluminous data sources to find out all records belonging to same individuals. Our recently devised highly efficient record linkage algorithms provide best-known solutions to this challenging problem.We have developed RLT-S, a freely available web tool, which implements our single linkage clustering algorithm for record linkage. This tool requires input data sets and a small set of configuration settings about these files to work efficiently. RLT-S employs exact match clustering, blocking on a specified attribute and single linkage based hierarchical clustering among these blocks.RLT-S is an implementation package of our sequential record linkage algorithm. It outperforms previous best-known implementations by a large margin. The tool is at least two times faster for any dataset than the previous best-known tools.RLT-S tool implements our record linkage algorithm that outperforms previous best-known algorithms in this area. This website also contains necessary information such as instructions, submission history, feedback, publications and some other sections to facilitate the usage of the tool.RLT-S is integrated into http://www.rlatools.com, which is currently serving this tool only. The tool is freely available and can be used without login. All data files used in this paper have been stored in https://github.com/abdullah009/DataRLATools. For copies of the relevant programs please see https://github.com/abdullah009/RLATools.
first_indexed 2024-12-21T16:11:07Z
format Article
id doaj.art-11d14e63044a45878ab86b38c050f053
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-21T16:11:07Z
publishDate 2015-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-11d14e63044a45878ab86b38c050f0532022-12-21T18:57:47ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01105e012444910.1371/journal.pone.0124449RLT-S: A Web System for Record Linkage.Abdullah-Al MamunRobert AseltineSanguthevar RajasekaranRecord linkage integrates records across multiple related data sources identifying duplicates and accounting for possible errors. Real life applications require efficient algorithms to merge these voluminous data sources to find out all records belonging to same individuals. Our recently devised highly efficient record linkage algorithms provide best-known solutions to this challenging problem.We have developed RLT-S, a freely available web tool, which implements our single linkage clustering algorithm for record linkage. This tool requires input data sets and a small set of configuration settings about these files to work efficiently. RLT-S employs exact match clustering, blocking on a specified attribute and single linkage based hierarchical clustering among these blocks.RLT-S is an implementation package of our sequential record linkage algorithm. It outperforms previous best-known implementations by a large margin. The tool is at least two times faster for any dataset than the previous best-known tools.RLT-S tool implements our record linkage algorithm that outperforms previous best-known algorithms in this area. This website also contains necessary information such as instructions, submission history, feedback, publications and some other sections to facilitate the usage of the tool.RLT-S is integrated into http://www.rlatools.com, which is currently serving this tool only. The tool is freely available and can be used without login. All data files used in this paper have been stored in https://github.com/abdullah009/DataRLATools. For copies of the relevant programs please see https://github.com/abdullah009/RLATools.http://europepmc.org/articles/PMC4420456?pdf=render
spellingShingle Abdullah-Al Mamun
Robert Aseltine
Sanguthevar Rajasekaran
RLT-S: A Web System for Record Linkage.
PLoS ONE
title RLT-S: A Web System for Record Linkage.
title_full RLT-S: A Web System for Record Linkage.
title_fullStr RLT-S: A Web System for Record Linkage.
title_full_unstemmed RLT-S: A Web System for Record Linkage.
title_short RLT-S: A Web System for Record Linkage.
title_sort rlt s a web system for record linkage
url http://europepmc.org/articles/PMC4420456?pdf=render
work_keys_str_mv AT abdullahalmamun rltsawebsystemforrecordlinkage
AT robertaseltine rltsawebsystemforrecordlinkage
AT sanguthevarrajasekaran rltsawebsystemforrecordlinkage