A pipeline for the creation of multimodal corpora from YouTube videos

This paper introduces an open-source pipeline for the creation of multimodal corpora from YouTube videos. It minimizes storage and bandwidth requirements, because the videos themselves need not be downloaded and can remain on YouTube’s servers. It also minimizes processing requirements by using YouT...

Celý popis

Podrobná bibliografie
Hlavní autoři: Dykes, N, Wilson, A, Uhrig, P
Médium: Conference item
Jazyk:English
Vydáno: Association for Computational Lingustics 2023