Abstract
Background: Accurate classification of physical behavior from accelerometer data is crucial for health and behavioral research. While machine learning models often perform well within the populations they are trained on, they are rarely validated on independent populations, and their generalizability remains poorly understood. Therefore, we aimed to externally validate a widely used random forest model for physical behavior classification, and to assess whether its performance varied by participants’ age, sex, or body mass index. Methods: We validated the random forest classifier, trained by Ellis et al., which achieved a balanced accuracy of 79% for classifying sitting, standing, and walking/running from hip-worn accelerometer data in the original training population. For the external validation, we obtained ActiGraph recordings for 610 participants from four European countries from the WEALTH (WEarable sensor Assessment of physicaL and eaTing beHaviors) project, which were labeled with the corresponding free-living behavior using ecological momentary assessment. Classifier performance was assessed using confusion matrices, precision, recall, F-score, and balanced accuracy. Results: In the WEALTH population, the random forest classifier achieved a balanced accuracy of 40% and an average F-score of 0.33. Precision and recall were highest for sitting, followed by walking/running and standing. Performance was consistent across subpopulations defined by age, sex, and body mass index. Conclusion: The substantial reduction in accuracy demonstrates the limited generalizability of the existing random forest classifier. Our findings underscore the need for external validation and more diverse training data to ensure robust application of machine learning models in physical behavior research.
| Original language | English |
|---|---|
| Article number | jmpb.2025-0030 |
| Pages (from-to) | 1-9 |
| Number of pages | 9 |
| Journal | Journal for the Measurement of Physical Behaviour |
| Volume | 8 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Jan 2025 |
Keywords
- ActiGraph
- activity recognition
- free-living assessment
- model generalizability
- random forest
Fingerprint
Dive into the research topics of 'External Validation of a Hip-Worn Accelerometry-Based Machine Learning Model for Physical Behavior Classification in Free-Living Conditions'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver