American cultural regions mapped through the lexical analysis of social media

Abstract Cultural areas represent a useful concept that cross-fertilizes diverse fields in social sciences. Knowledge of how humans organize and relate their ideas and behavior within a society can help us to understand our actions and attitudes toward different issues. However, the selection of com...

Full description

Bibliographic Details
Main Authors: Thomas Louf, Bruno Gonçalves, José J. Ramasco, David Sánchez, Jack Grieve
Format: Article
Language:English
Published: Springer Nature 2023-03-01
Series:Humanities & Social Sciences Communications
Online Access:https://doi.org/10.1057/s41599-023-01611-3
_version_ 1797853991097335808
author Thomas Louf
Bruno Gonçalves
José J. Ramasco
David Sánchez
Jack Grieve
author_facet Thomas Louf
Bruno Gonçalves
José J. Ramasco
David Sánchez
Jack Grieve
author_sort Thomas Louf
collection DOAJ
description Abstract Cultural areas represent a useful concept that cross-fertilizes diverse fields in social sciences. Knowledge of how humans organize and relate their ideas and behavior within a society can help us to understand our actions and attitudes toward different issues. However, the selection of common traits that shape a cultural area is somewhat arbitrary. What is needed is a method that can leverage the massive amounts of data coming online, especially through social media, to identify cultural regions without ad-hoc assumptions, biases, or prejudices. This work takes a crucial step in this direction by introducing a method to infer cultural regions based on the automatic analysis of large datasets from microblogging posts. The approach presented here is based on the principle that cultural affiliation can be inferred from the topics that people discuss among themselves. Specifically, regional variations in written discourse are measured in American social media. From the frequency distributions of content words in geotagged tweets, the regional hotspots of words’ usage are found, and from there, principal components of regional variation are derived. Through a hierarchical clustering of the data in this lower-dimensional space, this method yields clear cultural areas and the topics of discussion that define them. It uncovers a manifest North–South separation, which is primarily influenced by the African American culture, and further contiguous (East–West) and non-contiguous divisions that provide a comprehensive picture of modern American cultural areas.
first_indexed 2024-04-09T19:58:39Z
format Article
id doaj.art-752c59bab2bd4233948c2e740291d940
institution Directory Open Access Journal
issn 2662-9992
language English
last_indexed 2024-04-09T19:58:39Z
publishDate 2023-03-01
publisher Springer Nature
record_format Article
series Humanities & Social Sciences Communications
spelling doaj.art-752c59bab2bd4233948c2e740291d9402023-04-03T05:20:08ZengSpringer NatureHumanities & Social Sciences Communications2662-99922023-03-0110111110.1057/s41599-023-01611-3American cultural regions mapped through the lexical analysis of social mediaThomas Louf0Bruno Gonçalves1José J. Ramasco2David Sánchez3Jack Grieve4Institute for Cross-Disciplinary Physics and Complex Systems IFISC (UIB-CSIC)ISI FoundationInstitute for Cross-Disciplinary Physics and Complex Systems IFISC (UIB-CSIC)Institute for Cross-Disciplinary Physics and Complex Systems IFISC (UIB-CSIC)Department of English Language and Linguistics, University of BirminghamAbstract Cultural areas represent a useful concept that cross-fertilizes diverse fields in social sciences. Knowledge of how humans organize and relate their ideas and behavior within a society can help us to understand our actions and attitudes toward different issues. However, the selection of common traits that shape a cultural area is somewhat arbitrary. What is needed is a method that can leverage the massive amounts of data coming online, especially through social media, to identify cultural regions without ad-hoc assumptions, biases, or prejudices. This work takes a crucial step in this direction by introducing a method to infer cultural regions based on the automatic analysis of large datasets from microblogging posts. The approach presented here is based on the principle that cultural affiliation can be inferred from the topics that people discuss among themselves. Specifically, regional variations in written discourse are measured in American social media. From the frequency distributions of content words in geotagged tweets, the regional hotspots of words’ usage are found, and from there, principal components of regional variation are derived. Through a hierarchical clustering of the data in this lower-dimensional space, this method yields clear cultural areas and the topics of discussion that define them. It uncovers a manifest North–South separation, which is primarily influenced by the African American culture, and further contiguous (East–West) and non-contiguous divisions that provide a comprehensive picture of modern American cultural areas.https://doi.org/10.1057/s41599-023-01611-3
spellingShingle Thomas Louf
Bruno Gonçalves
José J. Ramasco
David Sánchez
Jack Grieve
American cultural regions mapped through the lexical analysis of social media
Humanities & Social Sciences Communications
title American cultural regions mapped through the lexical analysis of social media
title_full American cultural regions mapped through the lexical analysis of social media
title_fullStr American cultural regions mapped through the lexical analysis of social media
title_full_unstemmed American cultural regions mapped through the lexical analysis of social media
title_short American cultural regions mapped through the lexical analysis of social media
title_sort american cultural regions mapped through the lexical analysis of social media
url https://doi.org/10.1057/s41599-023-01611-3
work_keys_str_mv AT thomaslouf americanculturalregionsmappedthroughthelexicalanalysisofsocialmedia
AT brunogoncalves americanculturalregionsmappedthroughthelexicalanalysisofsocialmedia
AT josejramasco americanculturalregionsmappedthroughthelexicalanalysisofsocialmedia
AT davidsanchez americanculturalregionsmappedthroughthelexicalanalysisofsocialmedia
AT jackgrieve americanculturalregionsmappedthroughthelexicalanalysisofsocialmedia