Abstract
This research addresses the pitfalls and strategies for machine learning with small data sets in the context of sensor-based fatigue detection. It is shown that many existing studies in this area rely on small data sets and that classification results can vary considerably depending on the evaluation method. Our analysis is based on a study with 46 subjects performing multiple sets of squat exercises in a laboratory setting. Data from ratings of perceived exertion, inertial measurement units, and pose estimation were used to train and compare different classifiers. Our findings suggest that commonly used evaluation methods, such as leave-one-subject-out, should be used with caution and may not lead to generalizable classifiers. Furthermore, challenges related to imbalanced data and oversampling are discussed.
Original language | English |
---|---|
Title of host publication | PETRA '23 |
Subtitle of host publication | Proceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments |
Place of Publication | New York, NY |
Publisher | Association for Computing Machinery |
Pages | 364–373 |
Number of pages | 10 |
ISBN (Print) | 9798400700699 |
DOIs | |
Publication status | Published - 10 Aug 2023 |