AI enabled facial emotion recognition using low-cost thermal cameras

James Thomas Black*, Muhammad Zeeshan Shakir

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

While expensive hardware has historically dominated emotion recognition, our research explores the viability of cost-effective alternatives by utilising IoT-based low-resolution cameras with Vision Transformers (ViTs) and Convolutional Neural Networks (CNNs). In this work, we introduce a novel dataset specifically for thermal facial expression recognition and conduct a comprehensive performance analysis using ResNet, a standard ViT model developed by Google, and a modified ViT model tailored to be trained on smaller dataset sizes. This allows us to compare the efficacy of the more recent ViT architecture against the traditional CNN. Our findings reveal that not only do ViT models learn more swiftly than ResNet, but they also demonstrate superior performance across all metrics on our dataset. Furthermore, our investigation extends to the Kotani Thermal Facial Emotion (KTFE) test set, where we evaluate the generalisation capability of these models when trained using a hybrid approach that combines our dataset with the KTFE dataset. Both ResNet and the ViT model by Google achieved high performance on the KTFE test samples, suggesting that leveraging diverse data sources can significantly strengthen model robustness and adaptability. This study highlights three critical implications: the promising role of accessible and affordable thermal imaging technology in emotion classification; the potential of ViT models to redefine state-of-the-art approaches in this domain; and the importance of dataset diversity in training models with greater generalisation power. By bridging the gap between affordability and sophistication, this research contributes valuable insights into the fields of emotion recognition and affective computing.
Original languageEnglish
Article number2025.0019
Number of pages16
JournalComputing&AI Connect
Volume2
Early online date30 Jun 2025
Publication statusE-pub ahead of print - 30 Jun 2025

Keywords

  • vision transformers
  • convolutional neural network
  • emotion recognition
  • thermal camera
  • facial expression recognition
  • internet-of-things

Fingerprint

Dive into the research topics of 'AI enabled facial emotion recognition using low-cost thermal cameras'. Together they form a unique fingerprint.

Cite this