A Large-scale base map is needed by metropolitan cities such as Surabaya City for urban planning and smart city development. The most needed information from a large-scale base map is road geospatial information. Road network extraction is a challenging task for many reasons, including the heterogeneous attribute of the geometry and spectral, the complexity of objects that are difficult to model, and poor sensor data. Visual interpretation by operators is still a commonly used approach for extracting information from orthophoto. Interpretation accuracy depends on the skill and experience of the operator. So, there can be inconsistencies in the data generated by different operators. In recent years, the automatic extraction of road from orthophoto or VHR image has become an important and challenging research issue. Many recent studies have explored deep learning to improve the quality of building and road extraction. In this study, we applied a Mask Region-based Convolutional Neural Network (Mask R-CNN) model for the road network extraction using orthophoto in urban area in Surabaya City. The quality of the extracted geometry needs to be improved. Several post-processing strategies, including polygon regularization using Douglas-Peucker algorithm and polygon smoothing are designed to achieve optimal extraction results. The method produces a good performance for road extraction, the precision is 90.28%; recall 85.85%; F1-score 88.01%; and IoU 78.59%; and the overall accuracy is 95.25% and the kappa value is 90.5%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.