Loading...
Predicting the Type and Target of Offensive Posts in Social Media
Zampieri, Marcos ; Malmasi, Shervin ; Nakov, Preslav ; Rosenthal, Sara ; Farra, Noura ; Kumar, Ritesh
Zampieri, Marcos
Malmasi, Shervin
Nakov, Preslav
Rosenthal, Sara
Farra, Noura
Kumar, Ritesh
Editors
Other contributors
Affiliation
Epub Date
Issue Date
2019-06-01
Submitted date
Subjects
Files
Alternative
Abstract
As offensive content has become pervasive in social media, there has been much research in identifying potentially offensive messages. However, previous work on this topic did not consider the problem as a whole, but rather focused on detecting very specific types of offensive content, e.g., hate speech, cyberbulling, or cyber-aggression. In contrast, here we target several different kinds of offensive content. In particular, we model the task hierarchically, identifying the type and the target of offensive messages in social media. For this purpose, we complied the Offensive Language Identification Dataset (OLID), a new dataset with tweets annotated for offensive content using a fine-grained three-layer annotation scheme, which we make publicly available. We discuss the main similarities and differences between OLID and pre-existing datasets for hate speech identification, aggression detection, and similar tasks. We further experiment with and we compare the performance of different machine learning models on OLID.
Citation
Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N. and Kumar, R. (2019) Predicting the Type and Target of Offensive Posts in Social Media, Proceedings of NAACL-HLT 2019, Minneapolis, Minnesota, 2nd-7th June, 2019, pp. 1415–1420.
Journal
Research Unit
DOI
PubMed ID
PubMed Central ID
Embedded videos
Additional Links
Type
Conference contribution
Language
en
Description
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)