Hatemoji: A test suite and adversarially-generated dataset for benchmarking and detecting emoji-based hate

Detecting online hate is a complex task, and low-performing models have harmful consequences when used for sensitive applications such as content moderation. Emoji-based hate is an emerging challenge for automated detection. We present HatemojiCheck, a test suite of 3,930 short-form statements that...

Full description

Bibliographic Details
Main Authors: Kirk, HR, Vidgen, B, Rottger, P, Thrush, T, Hale, S
Format: Conference item
Language:English
Published: Association for Computational Linguistics 2022