All together against hate: ensemble based LLMs for multi class hate speech classification in the football context

Santos, Guto Leoni; Santos, Vitor Gaboardi dos; Kearns, Colm; Sinclair, Gary; Black, Jack; Doidge, Mark; Fletcher, Thomas E.; Kilvington, Daniel; Liston, Katie; Endo, Patricia; Lynn, Theo

Home
Browse By

Author

DCU Faculties and Centres

Theses

Subject

Year

Publication Type

Year of Award

Supervisors
About / FAQ
Statistics
Login (DCU Staff Only)

All together against hate: ensemble based LLMs for multi class hate speech classification in the football context

Santos, Guto Leoni ORCID: 0000-0002-0257-4214, Santos, Vitor Gaboardi dos, Kearns, Colm ORCID: 0000-0001-6819-8488, Sinclair, Gary ORCID: 0000-0002-2181-7736, Black, Jack ORCID: 0000-0002-1595-5083, Doidge, Mark ORCID: 0000-0002-6858-3914, Fletcher, Thomas E. ORCID: 0000-0002-4618-5480, Kilvington, Daniel ORCID: 0000-0003-3361-0860, Liston, Katie ORCID: 0000-0003-3898-0166, Endo, Patricia and Lynn, Theo ORCID: 0000-0001-9284-7580 (2026) All together against hate: ensemble based LLMs for multi class hate speech classification in the football context. Journal of Big Data, 13 . ISSN 2196-1115

Abstract
Metadata
Downloads
Documents
Metrics

[+][-]

Abstract

The rise of social media platforms like Twitter has transformed communication, fostering community engagement and knowledge sharing across diverse groups. However, it has also provided a stage for toxic content, including hate speech, which can manifest in harmful ways within specific contexts, such as discussions surrounding football. Hate speech in this domain often targets individuals or groups based on attributes such as race, ethnicity, or nationality, and is exacerbated by the emotionally charged nature of sports discourse. While binary classification models have traditionally been employed to detect hate speech, they struggle to address nuanced and context-specific forms of abuse, including microaggressions and intersectional hate speech. Multi-class classification enables a more detailed understanding by distinguishing between various types of hate speech, but these models face challenges such as lexico-semantic variability and rapidly evolving norms within the football community. In this paper, we propose an ensemble technique leveraging BERT-based transformers to improve hate speech detection in football-related discussions on Twitter. Our method integrates manually-annotated datasets and multiple classifiers within ensemble frameworks to enhance accuracy and robustness. The results demonstrate that our approach significantly improves the identification of diverse forms of hate speech in the football context, contributing to more effective content moderation and fostering safer online communities.

Metadata

Item Type:	Article (Published)
Refereed:	Yes
Uncontrolled Keywords:	BERT, Ensemble methods, Euros, Hate speech detection, Multi-class classification
Subjects:	Computer Science > Artificial intelligence Computer Science > Machine learning
DCU Faculties and Centres:	DCU Faculties and Schools > DCU Business School
Publisher:	SpringerOpen
Official URL:	https://link.springer.com/article/10.1186/s40537-0...
Copyright Information:	Authors
ID Code:	32806
Deposited On:	30 Jun 2026 10:37 by Tam Nguyen . Last Modified 30 Jun 2026 10:37

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0
2MB

Metrics

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

All together against hate: ensemble based LLMs for multi class hate speech classification in the football context

Altmetric Badge

Dimensions Badge

Downloads