This paper presents a viewpoint from computer vision to the radar echo extrapolation task in the precipitation nowcasting domain. Inspired by the success of some convolutional recurrent neural network models in this domain, including convolutional LSTM, convolutional GRU and trajectory GRU, we designed a new sequence-to-sequence neural network structure to leverage these models in a realistic data context. In this design, we decreased the numbers of channels in high abstract recurrent layers rather than increasing them. We formulated the task as a problem of encoding five radar images and predicting 10 steps ahead at the pixel level, and found that using only the common mean squared error can misguide the training and mislead the testing. Especially, the image quality of last predictions usually degraded rapidly. As a solution, we employed some visual image quality assessment techniques including Structural Similarity (SSIM) and multi-scale SSIM to train our models. Experimental results show that our structure was more tolerant to increasing uncertainty in the data, and the use of image quality metrics can significantly reduce the blurry image issue. Moreover, we found that using SSIM was very effective and a combination of SSIM with mean squared error and mean absolute error yielded the best prediction quality.
This article presents an investigation into the problem of 3D radar echo extrapolationin precipitation nowcasting, using recent AI advances, together with a viewpoint from ComputerVision. While Deep Learning methods, especially convolutional recurrent neural networks, havebeen developed to perform extrapolation, most works use 2D radar images rather than 3D images.In addition, the very few ones which try 3D data do not show a clear picture of results. Throughthis study, we found a potential problem in the convolution-based prediction of 3D data, which issimilar to the cross-talk effect in multi-channel radar processing but has not been documented well inthe literature, and discovered the root cause. The problem was that, when we generated differentchannels using one receptive field, some information in a channel, especially observation errors,might affect other channels unexpectedly. We found that, when using the early-stopping technique toavoid over-fitting, the receptive field did not learn enough to cancel unnecessary information. If weincreased the number of training iterations, this effect could be reduced but that might worsen theover-fitting situation. We therefore proposed a new output generation block which generates eachchannel separately and showed the improvement. Moreover, we also found that common imageaugmentation techniques in Computer Vision can be helpful for radar echo extrapolation, improvingtesting mean squared error of employed models at least 20% in our experiments.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.