Overview of the HASOC subtrack at FIRE 2021: Hate speech and offensive content identification in English and Indo-Aryan languages and conversational hate speech
Shahi, Gautam Kishore
Jaiswal, Amit Kumar
MetadataShow full item record
AbstractThe HASOC track is dedicated to the evaluation of technology for finding Offensive Language and Hate Speech. HASOC is creating a multilingual data corpus mainly for English and under-resourced languages(Hindi and Marathi). This paper presents one HASOC subtrack with two tasks. In 2021, we organized the classification task for English, Hindi, and Marathi. The first task consists of two classification tasks; Subtask 1A consists of a binary and fine-grained classification into offensive and non-offensive tweets. Subtask 1B asks to classify the tweets into Hate, Profane and offensive. Task 2 consists of identifying tweets given additional context in the form of the preceding conversion. During the shared task, 65 teams have submitted 652 runs. This overview paper briefly presents the task descriptions, the data and the results obtained from the participant's submission.
CitationMandl, T., Modha, S., Shahi, G.K. et al. (2021) Overview of the HASOC subtrack at FIRE 2021: Hate speech and offensive content identification in English and Indo-Aryan languages and conversational hate speech. FIRE 2021: Forum for Information Retrieval Evaluation, Virtual Event India December 13th-17th, 2021.
PublisherAssociation for Computing Machinery
JournalACM International Conference Proceeding Series
DescriptionThis is an accepted manuscript of a paper published by ACM in the proceedings of FIRE 2021: Forum for Information Retrieval Evaluation on 13/12/2021, available online: https://doi.org/10.1145/3503162.3503176 The accepted manuscript of the publication may differ from the final published version.
Except where otherwise noted, this item's license is described as https://creativecommons.org/licenses/by/4.0/