Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs

In this study, an automated approach to record linkage in prosopographic datasets is presented. It implements numerous genealogical rules for linking individuals. This makes it particularly suitable for datasets that contain a lot of gen...

Full description

Bibliographic Details
Main Authors: Jan Michael Goldberg, Marcel Mernitz
Format: Article
Language:deu
Published: Forschungsverbund Marbach Weimar Wolfenbüttel / Verband Digital Humanities im deutschsprachigen Raum e.V. 2023-01-01
Series:Zeitschrift für digitale Geisteswissenschaften
Subjects:
Online Access:https://www.zfdg.de/node/383
_version_ 1826992714873307136
author Jan Michael Goldberg
Marcel Mernitz
author_facet Jan Michael Goldberg
Marcel Mernitz
author_sort Jan Michael Goldberg
collection DOAJ
description In this study, an automated approach to record linkage in prosopographic datasets is presented. It implements numerous genealogical rules for linking individuals. This makes it particularly suitable for datasets that contain a lot of genealogically relevant information about the represented individuals. For this purpose, a standardized data structure is defined into which the input data is to be arranged. The algorithm recognizes entries pertaining to the same persons within this data structure and merges them automatically. In this process, a formalization of genealogical heuristics is performed. The functionality of the algorithm is successfully demonstrated using historical datasets from the city of Leipzig as an example. The program code has been realized in Python and is freely available.
first_indexed 2024-04-10T17:42:05Z
format Article
id doaj.art-5654e2ff1a0d4bc8ac69666d38814b48
institution Directory Open Access Journal
issn 2510-1358
language deu
last_indexed 2025-02-18T08:54:43Z
publishDate 2023-01-01
publisher Forschungsverbund Marbach Weimar Wolfenbüttel / Verband Digital Humanities im deutschsprachigen Raum e.V.
record_format Article
series Zeitschrift für digitale Geisteswissenschaften
spelling doaj.art-5654e2ff1a0d4bc8ac69666d38814b482024-11-02T23:56:16ZdeuForschungsverbund Marbach Weimar Wolfenbüttel / Verband Digital Humanities im deutschsprachigen Raum e.V.Zeitschrift für digitale Geisteswissenschaften2510-13582023-01-010110.17175/2023_0011819370283Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen LeipzigsJan Michael Goldberghttps://orcid.org/0000-0002-4817-4283Marcel Mernitzhttps://orcid.org/0000-0001-6464-2844In this study, an automated approach to record linkage in prosopographic datasets is presented. It implements numerous genealogical rules for linking individuals. This makes it particularly suitable for datasets that contain a lot of genealogically relevant information about the represented individuals. For this purpose, a standardized data structure is defined into which the input data is to be arranged. The algorithm recognizes entries pertaining to the same persons within this data structure and merges them automatically. In this process, a formalization of genealogical heuristics is performed. The functionality of the algorithm is successfully demonstrated using historical datasets from the city of Leipzig as an example. The program code has been realized in Python and is freely available.https://www.zfdg.de/node/383duplikaterkennung datenverknüpfung personenbezogene daten algorithmus genealogie geschichtswissenschaft
spellingShingle Jan Michael Goldberg
Marcel Mernitz
Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs
Zeitschrift für digitale Geisteswissenschaften
duplikaterkennung
datenverknüpfung
personenbezogene daten
algorithmus
genealogie
geschichtswissenschaft
title Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs
title_full Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs
title_fullStr Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs
title_full_unstemmed Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs
title_short Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs
title_sort automatisiertes record linkage in prosopographischen datenbestanden am beispiel historischer quellen leipzigs
topic duplikaterkennung
datenverknüpfung
personenbezogene daten
algorithmus
genealogie
geschichtswissenschaft
url https://www.zfdg.de/node/383
work_keys_str_mv AT janmichaelgoldberg automatisiertesrecordlinkageinprosopographischendatenbestandenambeispielhistorischerquellenleipzigs
AT marcelmernitz automatisiertesrecordlinkageinprosopographischendatenbestandenambeispielhistorischerquellenleipzigs