Pyrlato: A novel methodology to collect real-world acoustic data

In this paper, we present Pyrlato, an innovative tool developed in Python for collecting acoustic data from YouTube. The development of this tool was motivated by the need to conveniently collect real-world spoken data. By executing this Python code, researchers can obtain a spoken corpus of specif...

Full description

Bibliographic Details
Main Authors: Giuseppe Magistro, Claudia Crocco
Format: Article
Language:Catalan
Published: Universitat de Barcelona 2023-11-01
Series:Estudios de Fonética Experimental
Subjects:
Online Access:https://revistes.ub.edu/index.php/experimentalphonetics/article/view/45019
_version_ 1797302126804729856
author Giuseppe Magistro
Claudia Crocco
author_facet Giuseppe Magistro
Claudia Crocco
author_sort Giuseppe Magistro
collection DOAJ
description In this paper, we present Pyrlato, an innovative tool developed in Python for collecting acoustic data from YouTube. The development of this tool was motivated by the need to conveniently collect real-world spoken data. By executing this Python code, researchers can obtain a spoken corpus of specific words, syllables, constituents, and more. We illustrate the main steps of the execution to demonstrate how it works and how to use it. Additionally, we provide a complete example for reference, demonstrating how to customize Pyrlato according to specific requirements. Finally, we discuss the future developments we intend to cover for Pyrlato.
first_indexed 2024-03-07T23:32:17Z
format Article
id doaj.art-24998548dd42417abd0e952c96b55f1c
institution Directory Open Access Journal
issn 1575-5533
2385-3573
language Catalan
last_indexed 2024-03-07T23:32:17Z
publishDate 2023-11-01
publisher Universitat de Barcelona
record_format Article
series Estudios de Fonética Experimental
spelling doaj.art-24998548dd42417abd0e952c96b55f1c2024-02-20T11:51:28ZcatUniversitat de BarcelonaEstudios de Fonética Experimental1575-55332385-35732023-11-0132Pyrlato: A novel methodology to collect real-world acoustic dataGiuseppe Magistro0Claudia Crocco1Ghent UniversityGhent University In this paper, we present Pyrlato, an innovative tool developed in Python for collecting acoustic data from YouTube. The development of this tool was motivated by the need to conveniently collect real-world spoken data. By executing this Python code, researchers can obtain a spoken corpus of specific words, syllables, constituents, and more. We illustrate the main steps of the execution to demonstrate how it works and how to use it. Additionally, we provide a complete example for reference, demonstrating how to customize Pyrlato according to specific requirements. Finally, we discuss the future developments we intend to cover for Pyrlato. https://revistes.ub.edu/index.php/experimentalphonetics/article/view/45019real-word dataecological vadilitydata scraping
spellingShingle Giuseppe Magistro
Claudia Crocco
Pyrlato: A novel methodology to collect real-world acoustic data
Estudios de Fonética Experimental
real-word data
ecological vadility
data scraping
title Pyrlato: A novel methodology to collect real-world acoustic data
title_full Pyrlato: A novel methodology to collect real-world acoustic data
title_fullStr Pyrlato: A novel methodology to collect real-world acoustic data
title_full_unstemmed Pyrlato: A novel methodology to collect real-world acoustic data
title_short Pyrlato: A novel methodology to collect real-world acoustic data
title_sort pyrlato a novel methodology to collect real world acoustic data
topic real-word data
ecological vadility
data scraping
url https://revistes.ub.edu/index.php/experimentalphonetics/article/view/45019
work_keys_str_mv AT giuseppemagistro pyrlatoanovelmethodologytocollectrealworldacousticdata
AT claudiacrocco pyrlatoanovelmethodologytocollectrealworldacousticdata