A pipeline for the creation of multimodal corpora from YouTube videos
This paper introduces an open-source pipeline for the creation of multimodal corpora from YouTube videos. It minimizes storage and bandwidth requirements, because the videos themselves need not be downloaded and can remain on YouTube’s servers. It also minimizes processing requirements by using YouT...
Hlavní autoři: | , , |
---|---|
Médium: | Conference item |
Jazyk: | English |
Vydáno: |
Association for Computational Lingustics
2023
|