Machine-learning-based top-view safety monitoring of ground workforce on complex industrial sites

Gelayol Golcarenarenji, I Martinez-Alpiste, Q Wang, J.M. Alcaraz Calero

    Research output: Contribution to journalArticlepeer-review

    6 Citations (Scopus)
    42 Downloads (Pure)

    Abstract

    Telescopic cranes are powerful lifting facilities employed in construction, transportation, manufacturing and other industries. Since the ground workforce cannot be aware of their surrounding environment during the current crane operations in busy and complex sites, accidents and even fatalities are not avoidable. Hence, deploying an automatic and accurate top-view human detection solution would make significant improvements to the health and safety of the workforce on such industrial operational sites. The proposed method (CraneNet) is a new machine learning empowered solution to increase the visibility of a crane operator in complex industrial operational environments while addressing the challenges of human detection from top-view on a resource-constrained small-form PC to meet the space constraint in the operator's cabin. CraneNet consists of 4 modified ResBlock-D modules to fulfill the real-time requirements. To increase the accuracy of small humans at high altitudes which is crucial for this use-case, a PAN (Path Aggregation Network) was designed and added to the architecture. This enhances the structure of CraneNet by adding a bottom-up path to spread the low-level information. Furthermore, three output layers were employed in CraneNet to further improve the accuracy of small objects. Spatial Pyramid Pooling (SPP) was integrated at the end of the backbone stage which increases the receptive field of the backbone, thereby increasing the accuracy. The CraneNet has achieved 92.59% of accuracy at 19 FPS on a portable device. The proposed machine learning model has been trained with the Standford Drone Dataset (SDD) and Visdrone 2019 to further show the efficacy of the smart crane approach. Consequently, the proposed system is able to detect people in complex industrial operational areas from a distance up to 50 meters between the camera and the person. This system is also applicable to the detection of any other objects from an overhead camera.
    Original languageEnglish
    Pages (from-to)4207-4220
    Number of pages14
    JournalNeural Computing and Applications
    Volume34
    Issue number6
    Early online date22 Oct 2021
    DOIs
    Publication statusPublished - 31 Mar 2022

    Keywords

    • smart telescopic crane
    • human detection
    • complex industrial sites
    • deep learning

    Fingerprint

    Dive into the research topics of 'Machine-learning-based top-view safety monitoring of ground workforce on complex industrial sites'. Together they form a unique fingerprint.

    Cite this