A problem of single image super-resolution is considered, where the goal is to recover one high-resolution image from one low-resolution image. Whereas this problem has been successfully solved so far by the known VDSR network, such an approach still cannot give an overall beneficial effect compared to bicubic interpolation. This is so due to the fact that the image reconstruction quality has been estimated separately by three subjective factors. Moreover, the original VDSR network consisting of 20 convolutional layers is apparently not optimal by its depth. This is why here those factors are aggregated, and the network performance is deemed by a single estimator. Then the depth is tried to be decreased (truncation) along with adjusting the learning rate drop factor. Finally, a plausible improvement of the VDSR network is confirmed. The best truncated network, performing by almost 3.2 % better than bicubic interpolation, occupies less memory space and is about 1.44 times faster than the original VDSR network for images of a medium size.