When the timeline meets the pipeline: a survey on automated cyberbullying detection

Fatma Elsafoury*, Stamos Katsigiannis, Zeeshan Pervez, Naeem Ramzan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

35 Citations (Scopus)
40 Downloads (Pure)

Abstract

Web 2.0 helped user-generated platforms to spread widely. Unfortunately, it also allowed for cyberbullying to spread. Cyberbullying has negative effects that could lead to cases of depression and low self-esteem. It has become crucial to develop tools for automated cyberbullying detection. The research on developing these tools has been growing over the last decade, especially with the recent advances in machine learning and natural language processing. Given the large body of work on this topic, it is vital to critically review the literature on cyberbullying within the context of these latest advances. In this paper, we survey the automated detection of cyberbullying. Our survey sheds light on some challenges and limitations for the field. The challenges range from defining cyberbullying, data collection, and feature representation to model selection, training, and evaluation. We also provide some suggestions for improving the task of cyberbullying detection. In addition to the survey, we propose to improve the task of cyberbullying detection by addressing some of the raised limitations: 1) Using recent contextual language models like BERT for the detection of cyberbullying; 2) Using slang-based word embeddings to generate better representations of the cyberbullying-related datasets. Our results show that BERT outperforms state-of-the-art cyberbullying detection models and deep learning models. The results also show that deep learning models initialized with slang-based word embeddings outperform deep learning models initialized with traditional word embeddings.
Original languageEnglish
Pages (from-to)103541-103563
Number of pages23
JournalIEEE Access
Volume9
DOIs
Publication statusPublished - 21 Jul 2021

Keywords

  • cyberbullying detection
  • deep learning
  • social media
  • text classification

Fingerprint

Dive into the research topics of 'When the timeline meets the pipeline: a survey on automated cyberbullying detection'. Together they form a unique fingerprint.

Cite this