Humans Need Context, What about Machines? Investigating Conversational Context in Abusive Language Detection - Intelligence Artificielle Access content directly
Conference Papers Year : 2024

Humans Need Context, What about Machines? Investigating Conversational Context in Abusive Language Detection

Abstract

A crucial aspect in abusive language on social media platforms (toxicity, hate speech, harmful stereotypes, etc.) is its inherent contextual nature. In this paper, we focus on the role of conversational context in abusive language detection, one of the most “direct” forms of context in this domain, as given by the conversation threads (e.g., directly preceding message, original post). The incorporation of surrounding messages has proven vital for the accurate human annotation of harmful content. However, many prior works have either ignored this aspect, collecting and processing messages in isolation, or have obtained inconsistent results when attempting to embed such contextual information into traditional classification methods. The reasons behind these findings have not yet been properly addressed. To this end, we propose an analysis of the impact of conversational context in abusive language detection, through: (1) an analysis of prior works and the limitations of the most common concatenation-based approach, which we attempt to address with two alternative architectures; (2) an evaluation of these methods on existing datasets in English, and a new dataset of French tweets annotated for hate speech and stereotypes; and (3) a qualitative analysis showcasing the necessity for context-awareness in ALD, but also its difficulties.
Fichier principal
Vignette du fichier
2024.lrec-main.740.pdf (477.99 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04593250 , version 1 (29-05-2024)

Identifiers

  • HAL Id : hal-04593250 , version 1

Cite

Tom Bourgeade, Zongmin Li, Farah Benamara, Véronique Moriceau, Jian Su, et al.. Humans Need Context, What about Machines? Investigating Conversational Context in Abusive Language Detection. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), ELRA and ICCL, May 2024, Turin, Italy. ⟨hal-04593250⟩
91 View
12 Download

Share

Gmail Mastodon Facebook X LinkedIn More