Following its use in several applications, including video coding in wireless surveillance, moving object detection (MOD) has become a popular video analysis topic. Despite the considerable progress in the accuracy of MOD for video coding, its implementation in constrained sensors is a real challenge owing to their high complexity and energy consumption. Therefore, there is a great need to address the trade-off between the accuracy and the energy efficiency of MOD approaches for video coding in constrained systems. In this work, an energy-efficient region-of-interest (ROI) detection algorithm as a pre-encoder for wireless visual surveillance (WVS) is proposed. The algorithm ensures a trade-off between detection accuracy and computational complexity. To this end, we propose constructing an activity map by measuring each block activity between successive frames. The map scores are processed using a combination of a fast Gaussian smoother and a rank-order filter to improve accuracy. Only the blocks in motion are coded and counted for transmission. The accuracy of our approach has been evaluated on a large dataset using key performance metrics. It has been found that our algorithm outperforms other state-of-theart techniques in terms of true positive rate (TPR), with 80.84% on sensitivity metric, while exhibiting a well-balanced accuracy for all categories. A careful examination of the computational complexity confirms the low overhead. The energy and bitrate savings could achieve nearly 90% and 98%, respectively.