Optimisation of spatial-exploitation CNN models through hyperparameter-tuning and human-in-the-loop combination

    Research output: Contribution to journalArticlepeer-review

    6 Downloads (Pure)

    Abstract

    Spatial-exploitation convolutional neural networks (CNNs) have a simplified architecture compared to other CNN models. However, devices with limited computational resources could struggle with processing spatial-exploitation CNNs. To address this, we investigate two methods to optimise spatial-exploitation CNN models for time efficiency and classification accuracy: hyperparameter-tuning, and human-in-the-loop (HITL). We apply grid-search to optimise the hyperparameter space, whilst HITL is used to identify whether the time-to-accuracy relationship of the optimised model can be improved. To show the versatility of combining the two methods, CIFAR-10, MNIST, and Imagenette are used as model input. This paper contributes to spatial-exploitation CNN optimisation by combining hyperparameter-tuning and HITL. Results show that this combination improves classification accuracy by 1.47%-2.34% and reduces the time taken to conduct this task by 27%-28%, depending on dataset. We conclude that combining hyperparameter-tuning and HITL are a viable approach to optimise spatial-exploitation CNNs for devices with limited computational resources.
    Original languageEnglish
    Pages (from-to)1-13
    Number of pages13
    JournalInternational Journal of Artificial Intelligence and Soft Computing
    Volume8
    Issue number2
    DOIs
    Publication statusPublished - 4 Jul 2024

    Keywords

    • deep learning
    • convolutional neural network (CNN)
    • image classification
    • hyperparameter-tuning
    • human-in-the-loop (HITL)

    Fingerprint

    Dive into the research topics of 'Optimisation of spatial-exploitation CNN models through hyperparameter-tuning and human-in-the-loop combination'. Together they form a unique fingerprint.

    Cite this