The use of cameras for safety monitoring, progress tracking, and site security has grown significantly on construction and civil infrastructure sites over the past decade. Localization of construction resources is a crucial prerequisite for many applications in automated construction management. However, most existing vision-based methods perform the analysis in the image plane, overlooking the effect of perspective and depth. The manual and labor-intensive process of traditional calibration techniques, as well as the busy and restrictive construction environment, makes this a challenging task. This study proposes a framework for automatic camera calibration with no manual intervention. The framework utilizes convolutional neural networks for geometrical scene analysis and object detection, which are used to estimate the location of horizon line, vertical vanishing point, as well as objects with known height distributions. This enables automatic estimation of camera parameters and retrieval of scale. The proposed framework is evaluated on images from two major construction projects in Melbourne, Australia. Results show that the proposed method achieves a minimum accuracy of 90% in estimating proximity of points on the ground and can facilitate further development of vision-based solutions for safety and productivity analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.