Massive multiple-input multiple-output (massive-MIMO) is foreseen as a potential technology for future 5G cellular communication networks due to its substantial benefits in terms of increased spectral and energy efficiency. These advantages of massive-MIMO are a consequence of equipping the base station (BS) with quite a large number of antenna elements, thus resulting in an aggressive spatial multiplexing. In order to effectively reap the benefits of massive-MIMO, an adequate estimate of the channel impulse response (CIR) between each transmit-receive link is of utmost importance. It has been established in the literature that certain specific multipath propagation environments lead to a sparse structured CIR in spatial and/or delay domains. In this paper, implicit training and compressed sensing based CIR estimation techniques are proposed for the case of massive-MIMO sparse uplink channels. In the proposed superimposed training (SiT) based techniques, a periodic and low power training sequence is superimposed (arithmetically added) over the information sequence, thus avoiding any dedicated time/frequency slots for the training sequence. For the estimation of such massive-MIMO sparse uplink channels, two greedy pursuits based compressed sensing approaches are proposed, viz: SiT based stage-wise orthogonal matching pursuit (SiT-StOMP) and gradient pursuit (SiT-GP). In order to demonstrate the validity of proposed techniques, a performance comparison in terms of normalized mean square error (NCMSE) and bit error rate (BER) is performed with a notable SiT based least squares (SiT-LS) channel estimation technique. The effect of channels' sparsity, training-to-information power ratio (TIR) and signal-to-noise ratio (SNR) on BER and NCMSE performance of proposed schemes is thoroughly studied. For a simulation scenario of: 4 × 64 massive-MIMO with a channel sparsity level of 80% and signal-to-noise ratio (SNR) of 10 dB, a performance gain of 18 dB and 13 dB in terms of NCMSE over SiT-LS is observed for the proposed SiT-StOMP and SiT-GP techniques, respectively. Moreover, a performance gain of about 3 dB and 2.5 dB in SNR is achieved by the proposed SiT-StOMP and SiT-GP, respectively, for a BER of 10 −2 , as compared to SiT-LS. This performance gain NCME and BER is observed to further increase with an increase in channels' sparsity.