The wind microclimate plays an important role in architectural design, and computational fluid dynamics is a method commonly used for analyzing the issue. However, due to its high technical difficulty and time-consuming nature, it limits the interaction and exploration between designers and environment performance analyses. To address the issue, scholars have proposed a series of approximation models based on machine learning that have partially improved computational efficiency. However, these methods face challenges in terms of balancing applicability, prediction accuracy, and sample size. In this paper, we propose a method based on the classic Vggnet deep convolutional neural network as the backbone to construct an approximate model for predicting steady-state flow fields in urban areas. The method is trained on a small amount of sample data and can be extended to calculate the wind environment performance. Furthermore, we investigated the differences between geometric representation methods, such as the Boolean network representation and signed distance function, as well as different structure models, such as Vgg-CFD-11, Vgg-CFD-13, Vgg-CFD-16, and Vgg-CFD-19. The results indicate that the model can be trained using a small amount of sample data, and all models generally possess the ability to predict the wind environment. The best performance on the validation set and test set was achieved with an RMSE (Root Mean Square Error) of 0.7966 m/s and 2.2345 m/s, respectively, and an R-Squared score of 0.9776 and 0.8455. Finally, we embedded the best-performing model into an architect-friendly urban comprehensive analysis platform, URBAN NEURAL-CFD.