Creating a typology of parishes in England and Wales: Mining 1881 census data

The paper presents the application of principal component analysis and cluster analysis to historical individual level census data in order to explore social and economic variations and patterns in household structure across mid-Victorian England and Wales. Principal component analysis is used in or...

Full description

Bibliographic Details
Main Authors: Kevin Schürer, Tatiana Penkova
Format: Article
Language:English
Published: International Instititute of Social History 2015-09-01
Series:Historical Life Course Studies
Subjects:
Online Access:https://hlcs.nl/article/view/9358
Description
Summary:The paper presents the application of principal component analysis and cluster analysis to historical individual level census data in order to explore social and economic variations and patterns in household structure across mid-Victorian England and Wales. Principal component analysis is used in order to identify and eliminate unimportant attributes within the data and the aggregation of the remaining attributes. By combining Kaiser’s rule and the Broken-stick model, four principal components are selected for subsequent data modelling. Cluster analysis is used in order to identify associations and structure within the data. A hierarchy of cluster structures is constructed with two, three, four and five clusters in 21-dimensional data space. The main differences between clusters are described in this paper.
ISSN:2352-6343