In massive multiple-input multiple-output (MIMO) systems with time shifted pilot (TSP) schemes, the inter-group interference caused by the pilot contamination can be eliminated when the number of base station (BS) antennas M approaches infinity. However, M is finite in practice and the effectiveness of the TSP is limited by channel estimation errors. In this paper, it is analytically shown that the mean square channel estimation error (MSCEE) of the TSP is dominated by the intergroup data interference. To reduce the MSCEE in the finite antenna massive MIMO systems, an interference cancellation based channel estimation for the TSP (IC-TSP) is proposed, where the dominant inter-group data interference is canceled based on BS cooperation. To show the advantage of the IC-TSP, the additional overhead of IC-TSP is evaluated by considering different M and the coherence time of BS-BS channels. Furthermore, the impact of sectorization and compressed sensing based BS-BS channel estimation are also discussed. We show that when 128 ≤ M ≤ 2048, with the inter-group data interference from the nearest two cell layers being canceled, the IC-TSP achieves a spectral efficiency gain of more than 1.2 bps/Hz over the TSP.