Nowadays, modern Earth Observation systems continuously generate huge amounts of data. A notable example is represented by the Sentinel-2 mission, which provides images at high spatial resolution (up to 10m) with high temporal revisit period (every 5 days), which can be organized in Satellite Image Time Series (SITS). While the use of SITS has been proved to be beneficial in the context of Land Use/Land Cover (LULC) map generation, unfortunately, machine learning approaches commonly leveraged in remote sensing field fail to take advantage of spatio-temporal dependencies present in such data.Recently, new generation deep learning methods allowed to significantly advance research in this field. These approaches have generally focused on a single rent Neural Networks (RNNs), which model different but complementary information: spatial autocorrelation (CNNs) and temporal dependencies (RNNs).In this work, we propose the first deep learning architecture for the analysis of SITS data, namely DuP LO (DUal view Point deep Learning architecture for time series classificatiOn), that combines Convolutional and Recurrent neural networks to exploit their complementarity. Our hypothesis is that, since CNNs and RNNs capture different aspects of the data, a combination of both models would produce a more diverse and complete representation of the information for the underlying land cover classification task. Experiments carried out on two study sites characterized by different land cover characteristics (i.e., the Gard site in France and the Reunion Island in the Indian Ocean), demonstrate the significance of our proposal.Classification, Sentinel-2 toring and land management planning [1,2,3,4,5,6,7,8,9]. In the context of Land Use/Land Cover (LULC) classification, exploiting SITS can be fruitful to discriminate among classes that exhibit different temporal behaviors [10], i.e., with the respect to the results that can be obtained using a single image. In [7], the authors propose to exploit SITS data to extract homogeneous land units in terms of phenological patterns and, later, for the automatic classification of land units according to their land-cover. The effectiveness of Sentinel-2 SITS to produce land cover maps at country scale has been showed in [8], demonstrating the practical interest of such data source. In [9], the authors combine multi-source optical (Landsat-8) and radar (Sentinel-1) SITS in order to improve land cover maps on the agricultural domain. Another example is supplied in [3] where optical SITS are leveraged to characterize grassland area as proxy indicator for biodiversity, food production, and global carbon cycle.Despite the usefulness of temporal trends that can be derived from remote sensing time series, most of the strategies proposed for SITS analysis tasks [11,12,8,7], directly apply standard machine learning approaches (i.e. Random Forest, SVM) on the stacked images, thus ignoring any temporal dependencies that may be discovered in the data. Indeed, such algorithms make the assumption that the infor...