Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs
In this study, an automated approach to record linkage in prosopographic datasets is presented. It implements numerous genealogical rules for linking individuals. This makes it particularly suitable for datasets that contain a lot of gen...
Main Authors: | , |
---|---|
Format: | Article |
Language: | deu |
Published: |
Forschungsverbund Marbach Weimar Wolfenbüttel / Verband Digital Humanities im deutschsprachigen Raum e.V.
2023-01-01
|
Series: | Zeitschrift für digitale Geisteswissenschaften |
Subjects: | |
Online Access: | https://www.zfdg.de/node/383 |
_version_ | 1826992714873307136 |
---|---|
author | Jan Michael Goldberg Marcel Mernitz |
author_facet | Jan Michael Goldberg Marcel Mernitz |
author_sort | Jan Michael Goldberg |
collection | DOAJ |
description | In this study, an automated approach to record linkage in prosopographic
datasets is presented. It implements numerous genealogical rules for linking
individuals. This makes it particularly suitable for datasets that contain a
lot of genealogically relevant information about the represented individuals.
For this purpose, a standardized data structure is defined into which the input
data is to be arranged. The algorithm recognizes entries pertaining to the same
persons within this data structure and merges them automatically. In this
process, a formalization of genealogical heuristics is performed. The
functionality of the algorithm is successfully demonstrated using historical
datasets from the city of Leipzig as an example. The program code has been
realized in Python and is freely available. |
first_indexed | 2024-04-10T17:42:05Z |
format | Article |
id | doaj.art-5654e2ff1a0d4bc8ac69666d38814b48 |
institution | Directory Open Access Journal |
issn | 2510-1358 |
language | deu |
last_indexed | 2025-02-18T08:54:43Z |
publishDate | 2023-01-01 |
publisher | Forschungsverbund Marbach Weimar Wolfenbüttel / Verband Digital Humanities im deutschsprachigen Raum e.V. |
record_format | Article |
series | Zeitschrift für digitale Geisteswissenschaften |
spelling | doaj.art-5654e2ff1a0d4bc8ac69666d38814b482024-11-02T23:56:16ZdeuForschungsverbund Marbach Weimar Wolfenbüttel / Verband Digital Humanities im deutschsprachigen Raum e.V.Zeitschrift für digitale Geisteswissenschaften2510-13582023-01-010110.17175/2023_0011819370283Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen LeipzigsJan Michael Goldberghttps://orcid.org/0000-0002-4817-4283Marcel Mernitzhttps://orcid.org/0000-0001-6464-2844In this study, an automated approach to record linkage in prosopographic datasets is presented. It implements numerous genealogical rules for linking individuals. This makes it particularly suitable for datasets that contain a lot of genealogically relevant information about the represented individuals. For this purpose, a standardized data structure is defined into which the input data is to be arranged. The algorithm recognizes entries pertaining to the same persons within this data structure and merges them automatically. In this process, a formalization of genealogical heuristics is performed. The functionality of the algorithm is successfully demonstrated using historical datasets from the city of Leipzig as an example. The program code has been realized in Python and is freely available.https://www.zfdg.de/node/383duplikaterkennung datenverknüpfung personenbezogene daten algorithmus genealogie geschichtswissenschaft |
spellingShingle | Jan Michael Goldberg Marcel Mernitz Automatisiertes Record Linkage in prosopographischen Datenbeständen am Beispiel historischer Quellen Leipzigs Zeitschrift für digitale Geisteswissenschaften duplikaterkennung datenverknüpfung personenbezogene daten algorithmus genealogie geschichtswissenschaft |
title | Automatisiertes Record Linkage in prosopographischen
Datenbeständen am Beispiel historischer Quellen Leipzigs |
title_full | Automatisiertes Record Linkage in prosopographischen
Datenbeständen am Beispiel historischer Quellen Leipzigs |
title_fullStr | Automatisiertes Record Linkage in prosopographischen
Datenbeständen am Beispiel historischer Quellen Leipzigs |
title_full_unstemmed | Automatisiertes Record Linkage in prosopographischen
Datenbeständen am Beispiel historischer Quellen Leipzigs |
title_short | Automatisiertes Record Linkage in prosopographischen
Datenbeständen am Beispiel historischer Quellen Leipzigs |
title_sort | automatisiertes record linkage in prosopographischen datenbestanden am beispiel historischer quellen leipzigs |
topic | duplikaterkennung datenverknüpfung personenbezogene daten algorithmus genealogie geschichtswissenschaft |
url | https://www.zfdg.de/node/383 |
work_keys_str_mv | AT janmichaelgoldberg automatisiertesrecordlinkageinprosopographischendatenbestandenambeispielhistorischerquellenleipzigs AT marcelmernitz automatisiertesrecordlinkageinprosopographischendatenbestandenambeispielhistorischerquellenleipzigs |