Indonesia is a country that possesses natural wonders and historical buildings, which made Indonesia become one of the popular tourist destination. The scenery classification is a challenging task where the feature distribution from each image may spread. In addition, Indonesian tourism spots are plentiful. In this research, we proposed to utilize deep learning for tourism spot classification using Convolutional Neural Network (CNN) as feature extraction. The dataset consists of different tourism varieties, from man-made objects such as monuments to natural objects such as mountains or beaches. The dataset was selfgathered from the internet and various social media with different angles and does not include any images that dominantly contain people. In addition, in the context of CNN as the basis of feature extractor, we also compared the result with pre-trained CNN architecture trained with Place-365 and ImageNet dataset. The first test was conducted with shallow CNN achieving 48% for the non-augmented dataset and 51% for the augmented dataset. The second test performed with VGG16 and ResNet, combining data augmentation and a pre-trained network. The result reveals data augmentation improves the validation accuracy. Pre-trained VGG16 with Place-365 achieved the highest validation of 90% compared to the other combination. A pretrained network with an augmentation combination improves the model performances significantly by a rough margin.