Skip to main navigation Skip to search Skip to main content

Does BERT pay attention to cyberbullying?

  • Fatma Elsafoury
  • , Stamos Katsigiannis
  • , Steven R. Wilson
  • , Naeem Ramzan

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    77 Downloads (Pure)

    Abstract

    Social media have brought threats like cyberbullying, which can lead to stress, anxiety, depression, and in some severe cases, suicide attempts. Detecting cyberbullying can help to warn/ block bullies and provide support to victims. However, very few studies have used self-attention-based language models like BERT for cyberbullying detection and they typically only report BERT's performance without examining in depth the reasons for its performance. In this work, we examine the use of BERT for cyberbullying detection on various datasets and attempt to explain its performance by analyzing its attention weights and gradient-based feature importance scores for textual and linguistic features. Our results show that attention weights do not correlate with feature importance scores and thus do not explain the model's performance. Additionally, they suggest that BERT relies on syntactical biases in the datasets to assign feature importance scores to class-related wordsrather than cyberbullying-related linguistic features.

    Original languageEnglish
    Title of host publicationSIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
    Place of PublicationNew York, NY
    PublisherAssociation for Computing Machinery
    Pages1900-1904
    Number of pages5
    ISBN (Electronic)9781450380379
    DOIs
    Publication statusPublished - 11 Jul 2021
    Event44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021 - Virtual, Online, Canada
    Duration: 11 Jul 202115 Jul 2021

    Publication series

    NameSIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

    Conference

    Conference44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021
    Country/TerritoryCanada
    CityVirtual, Online
    Period11/07/2115/07/21

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 3 - Good Health and Well-being
      SDG 3 Good Health and Well-being

    Keywords

    • cyberbullying
    • text classification
    • BERT
    • NLP

    Fingerprint

    Dive into the research topics of 'Does BERT pay attention to cyberbullying?'. Together they form a unique fingerprint.

    Cite this