HateCheck: functional tests for hate speech detection models

Detecting online hate is a difficult task that even state-of-the-art models struggle with. Typically, hate speech detection models are evaluated by measuring their performance on held-out test data using metrics such as accuracy and F1 score. However, this approach makes it difficult to identify spe...

وصف كامل

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Röttger, P, Vidgen, B, Dong, N, Waseem, Z, Margetts, H, Pierrehumbert, JB
التنسيق:	Conference item
اللغة:	English
منشور في:	Association for Computational Linguistics 2021

HateCheck: functional tests for hate speech detection models

مواد مشابهة