Although previous research laid the foundation for vision-based monitoring systems using convolutional neural networks (CNNs), too little attention has been paid to the challenges associated with data imbalance and varying object sizes in far-field monitoring. To fill the knowledge gap, this paper investigates various loss functions to design a customized loss function to address the challenges. Scaffold installation operations recorded by camcorders were selected as the subject of analysis in a far-field surveillance setting. It was confirmed that the data imbalance between the workers, hardhats, harnesses, straps, and hooks caused poor performances especially for small size objects. This problem was mitigated by employing a region-based loss and Focal loss terms in the loss function of segmentation models. The findings illustrate the importance of the loss function design in improving performance of CNN models for far-field construction site monitoring.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.