“…After colour images are captured by the camera and changed to their greyscale forms, a background modelling is built, and according to the principle of consistency of time, a short image sequence is used to generate the initial background model as follows: where N is the number of observed image ALTF in the background model and K is the specified frame interval of taking a sample from the video, is the ALTF sample at a location in the first frame, and is the ALTF sample in the frame. To avoid the generation of a ghost, the Pixel‐Based Adaptive Segmentation [32] and dual sample consensus model [33] initialise the background model with the image values at each pixel in the first N frames. However, those methods also lead to ghost in urban traffic scenes due to slow‐moving or temporarily stopped vehicles.…”