Despite the benefits of ultrasound imaging systems for medical diagnosis and treatment, ultrasound images are prone to low resolution and contrast due to ultrasound’s inherent attributes, as well as affected by speckle noise that directly influences their quality. In retrospective studies, diverse filters have been applied to minimize the effects of speckle noise and enhance the quality of ultrasound images. In this article, we propose a method of enhancing ultrasound images inspired by synthetic aperture imaging, which provides high-resolution images by adding low-resolution images and measuring the probe’s movement. Our proposed method does not involve synthetic aperture imaging but compensates for the motion effect in the temporal dimension, aligns consecutive images, and stacks aligned images to suppress speckle noise and consequently enhance the resolution of ultrasound images. We exploited deep neural network models to estimate motion parameters between consecutive ultrasound images. In a new database of ultrasound images, we also collected the images’ position-related information implicitly measured in inertial measurement units, which was exploited as a ground truth for motion parameters between consecutive images. Compared with other image-enhancing techniques involving conventional filters and deep neural network modalities, our method demonstrated superiority in enhancing the quality of ultrasound images. We also found that estimating motion parameters directly influenced the success of the image-stacking process. As in ablation studies in deep neural networks, we additionally investigated the effect of dropping some images in the temporal dimension, which revealed that contextual differences and excessive rates of movement in successive ultrasound images weakens the image-stacking process and thus the potential enhancement of ultrasound images.