Forecasting COVID-19 caseloads using unsupervised embedding clusters of social media posts

We present a novel approach incorporating transformer-based language models into infectious disease modelling. Text-derived features are quantified by tracking high-density clusters of sentence-level representations of Reddit posts within specific US states’ COVID-19 subreddits. We benchmark these c...

وصف كامل

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Drinkall, F, Zohren, S, Pierrehumbert, JB
التنسيق: Conference item
اللغة:English
منشور في: Association for Computational Linguistics 2022