Testing Heaps’ law for cities using administrative and gridded population data sets

Abstract Since 2008 the number of individuals living in urban areas has surpassed that of rural areas and in the next decades urbanisation is expected to further increase, especially in developing countries. A country’s urbanisation depends both on the distribution of city sizes, describing the frac...

Full description

Bibliographic Details
Main Authors: Filippo Simini, Charlotte James
Format: Article
Language:English
Published: SpringerOpen 2019-07-01
Series:EPJ Data Science
Subjects:
Online Access:http://link.springer.com/article/10.1140/epjds/s13688-019-0203-y
_version_ 1818287187579371520
author Filippo Simini
Charlotte James
author_facet Filippo Simini
Charlotte James
author_sort Filippo Simini
collection DOAJ
description Abstract Since 2008 the number of individuals living in urban areas has surpassed that of rural areas and in the next decades urbanisation is expected to further increase, especially in developing countries. A country’s urbanisation depends both on the distribution of city sizes, describing the fraction of cities with a given population (or area), and the overall number of cities in the country. Here we present empirical evidence suggesting the validity of Heaps’ law for cities: the expected number of cities in a country is only a function of the country’s total population (or built-up area) and the distribution of city sizes. This implies the absence of correlations in the spatial distribution of cities. We show that this result holds at the country scale using the official administrative definition of cities provided by the Geonames dataset, as well as at the local scale, for areas of 128×128 $128 \times 128$ km2 in the United States, using a morphological definition of urban clusters obtained from the Global Rural-Urban Mapping Project (GRUMP) dataset. We also derive a general theoretical result applicable to all systems characterised by a Zipf distribution of group sizes, which describes the relationship between the expected number of groups (cities) and the total number of elements in all groups (population), providing further insights on the relationship between Zipf’s law and Heaps’ law for finite-size systems.
first_indexed 2024-12-13T01:36:30Z
format Article
id doaj.art-3ba67b27e3c042238ba96770e2c2d9e8
institution Directory Open Access Journal
issn 2193-1127
language English
last_indexed 2024-12-13T01:36:30Z
publishDate 2019-07-01
publisher SpringerOpen
record_format Article
series EPJ Data Science
spelling doaj.art-3ba67b27e3c042238ba96770e2c2d9e82022-12-22T00:03:52ZengSpringerOpenEPJ Data Science2193-11272019-07-018111310.1140/epjds/s13688-019-0203-yTesting Heaps’ law for cities using administrative and gridded population data setsFilippo Simini0Charlotte James1Department of Engineering Mathematics, University of BristolDepartment of Engineering Mathematics, University of BristolAbstract Since 2008 the number of individuals living in urban areas has surpassed that of rural areas and in the next decades urbanisation is expected to further increase, especially in developing countries. A country’s urbanisation depends both on the distribution of city sizes, describing the fraction of cities with a given population (or area), and the overall number of cities in the country. Here we present empirical evidence suggesting the validity of Heaps’ law for cities: the expected number of cities in a country is only a function of the country’s total population (or built-up area) and the distribution of city sizes. This implies the absence of correlations in the spatial distribution of cities. We show that this result holds at the country scale using the official administrative definition of cities provided by the Geonames dataset, as well as at the local scale, for areas of 128×128 $128 \times 128$ km2 in the United States, using a morphological definition of urban clusters obtained from the Global Rural-Urban Mapping Project (GRUMP) dataset. We also derive a general theoretical result applicable to all systems characterised by a Zipf distribution of group sizes, which describes the relationship between the expected number of groups (cities) and the total number of elements in all groups (population), providing further insights on the relationship between Zipf’s law and Heaps’ law for finite-size systems.http://link.springer.com/article/10.1140/epjds/s13688-019-0203-yHeaps’ lawZipf’s lawCitiesUrbanisationScaling
spellingShingle Filippo Simini
Charlotte James
Testing Heaps’ law for cities using administrative and gridded population data sets
EPJ Data Science
Heaps’ law
Zipf’s law
Cities
Urbanisation
Scaling
title Testing Heaps’ law for cities using administrative and gridded population data sets
title_full Testing Heaps’ law for cities using administrative and gridded population data sets
title_fullStr Testing Heaps’ law for cities using administrative and gridded population data sets
title_full_unstemmed Testing Heaps’ law for cities using administrative and gridded population data sets
title_short Testing Heaps’ law for cities using administrative and gridded population data sets
title_sort testing heaps law for cities using administrative and gridded population data sets
topic Heaps’ law
Zipf’s law
Cities
Urbanisation
Scaling
url http://link.springer.com/article/10.1140/epjds/s13688-019-0203-y
work_keys_str_mv AT filipposimini testingheapslawforcitiesusingadministrativeandgriddedpopulationdatasets
AT charlottejames testingheapslawforcitiesusingadministrativeandgriddedpopulationdatasets