A pipeline for the creation of multimodal corpora from YouTube videos

This paper introduces an open-source pipeline for the creation of multimodal corpora from YouTube videos. It minimizes storage and bandwidth requirements, because the videos themselves need not be downloaded and can remain on YouTube’s servers. It also minimizes processing requirements by using YouT...

وصف كامل

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Dykes, N, Wilson, A, Uhrig, P
التنسيق: Conference item
اللغة:English
منشور في: Association for Computational Lingustics 2023