“…For the above reasons, the desired specifications that such a trajectory clustering algorithm should hold, in order to be able to predict the movement of future trajectories, are the following: There have been some approaches to deal with the problem of subtrajectory clustering in a centralized way [1,21,32], however, all the above subtrajectory clustering approaches are centralized and do not scale with the size of today's trajectory data, thus calling for parallel and distributed algorithms. For this reason, we utilize the work presented in [36], coined DSC, which introduces an efficient and highly scalable approach to deal with the problem of Distributed Subtrajectory Clustering, by means of MapReduce. More specifically, the authors of [36] split the original problem to three sub-problems, namely Subtrajectory Join, Trajectory Segmentation and Clustering and Outlier Detection, and deal with each one in a distributed fashion by utilizing the MapReduce programming model.…”