Today, public areas, such as airports, hospitals, city centers are monitored by surveillance systems. The widespread use of surveillance systems reduces security concerns while creating an amount of video data that cannot be examined by people in real-time. Therefore, the concept of automatic understanding of video activities has raised the standards of security camera systems. In this paper, we propose a framework (OF-ConvAE-LSTM) to detect anomalies using Convolutional Autoencoder and Convolutional Long Short-Term Memory in an unsupervised manner. Besides the deep learning model, the feature extraction stage based on dense optical flow is applied in the framework to obtain the velocity and direction information of foreground objects. The experiments were carried out on three popular public datasets consisting of Avenue, UCSD Ped1, and UCSD Peds2. The experimental results have shown that the proposed framework models the complex distribution of the pattern of regular motion changes with high accuracy. Besides, this method was observed to outperform state-of-the-art approaches based on unsupervised and semi-supervised deep learning models.INDEX TERMS Abnormal event detection, convolutional autoencoder, long short-term memory, optical flow.