Testing Heaps’ law for cities using administrative and gridded population data sets
Abstract Since 2008 the number of individuals living in urban areas has surpassed that of rural areas and in the next decades urbanisation is expected to further increase, especially in developing countries. A country’s urbanisation depends both on the distribution of city sizes, describing the frac...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2019-07-01
|
Series: | EPJ Data Science |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1140/epjds/s13688-019-0203-y |
_version_ | 1818287187579371520 |
---|---|
author | Filippo Simini Charlotte James |
author_facet | Filippo Simini Charlotte James |
author_sort | Filippo Simini |
collection | DOAJ |
description | Abstract Since 2008 the number of individuals living in urban areas has surpassed that of rural areas and in the next decades urbanisation is expected to further increase, especially in developing countries. A country’s urbanisation depends both on the distribution of city sizes, describing the fraction of cities with a given population (or area), and the overall number of cities in the country. Here we present empirical evidence suggesting the validity of Heaps’ law for cities: the expected number of cities in a country is only a function of the country’s total population (or built-up area) and the distribution of city sizes. This implies the absence of correlations in the spatial distribution of cities. We show that this result holds at the country scale using the official administrative definition of cities provided by the Geonames dataset, as well as at the local scale, for areas of 128×128 $128 \times 128$ km2 in the United States, using a morphological definition of urban clusters obtained from the Global Rural-Urban Mapping Project (GRUMP) dataset. We also derive a general theoretical result applicable to all systems characterised by a Zipf distribution of group sizes, which describes the relationship between the expected number of groups (cities) and the total number of elements in all groups (population), providing further insights on the relationship between Zipf’s law and Heaps’ law for finite-size systems. |
first_indexed | 2024-12-13T01:36:30Z |
format | Article |
id | doaj.art-3ba67b27e3c042238ba96770e2c2d9e8 |
institution | Directory Open Access Journal |
issn | 2193-1127 |
language | English |
last_indexed | 2024-12-13T01:36:30Z |
publishDate | 2019-07-01 |
publisher | SpringerOpen |
record_format | Article |
series | EPJ Data Science |
spelling | doaj.art-3ba67b27e3c042238ba96770e2c2d9e82022-12-22T00:03:52ZengSpringerOpenEPJ Data Science2193-11272019-07-018111310.1140/epjds/s13688-019-0203-yTesting Heaps’ law for cities using administrative and gridded population data setsFilippo Simini0Charlotte James1Department of Engineering Mathematics, University of BristolDepartment of Engineering Mathematics, University of BristolAbstract Since 2008 the number of individuals living in urban areas has surpassed that of rural areas and in the next decades urbanisation is expected to further increase, especially in developing countries. A country’s urbanisation depends both on the distribution of city sizes, describing the fraction of cities with a given population (or area), and the overall number of cities in the country. Here we present empirical evidence suggesting the validity of Heaps’ law for cities: the expected number of cities in a country is only a function of the country’s total population (or built-up area) and the distribution of city sizes. This implies the absence of correlations in the spatial distribution of cities. We show that this result holds at the country scale using the official administrative definition of cities provided by the Geonames dataset, as well as at the local scale, for areas of 128×128 $128 \times 128$ km2 in the United States, using a morphological definition of urban clusters obtained from the Global Rural-Urban Mapping Project (GRUMP) dataset. We also derive a general theoretical result applicable to all systems characterised by a Zipf distribution of group sizes, which describes the relationship between the expected number of groups (cities) and the total number of elements in all groups (population), providing further insights on the relationship between Zipf’s law and Heaps’ law for finite-size systems.http://link.springer.com/article/10.1140/epjds/s13688-019-0203-yHeaps’ lawZipf’s lawCitiesUrbanisationScaling |
spellingShingle | Filippo Simini Charlotte James Testing Heaps’ law for cities using administrative and gridded population data sets EPJ Data Science Heaps’ law Zipf’s law Cities Urbanisation Scaling |
title | Testing Heaps’ law for cities using administrative and gridded population data sets |
title_full | Testing Heaps’ law for cities using administrative and gridded population data sets |
title_fullStr | Testing Heaps’ law for cities using administrative and gridded population data sets |
title_full_unstemmed | Testing Heaps’ law for cities using administrative and gridded population data sets |
title_short | Testing Heaps’ law for cities using administrative and gridded population data sets |
title_sort | testing heaps law for cities using administrative and gridded population data sets |
topic | Heaps’ law Zipf’s law Cities Urbanisation Scaling |
url | http://link.springer.com/article/10.1140/epjds/s13688-019-0203-y |
work_keys_str_mv | AT filipposimini testingheapslawforcitiesusingadministrativeandgriddedpopulationdatasets AT charlottejames testingheapslawforcitiesusingadministrativeandgriddedpopulationdatasets |