Massive multiple-input multiple-output (massive-MIMO) is foreseen as a potential technology for future 5G cellular communication networks due to its substantial benefits in terms of increased spectral and energy efficiency. These advantages of massive-MIMO are a consequence of equipping the base station (BS) with quite a large number of antenna elements, thus resulting in an aggressive spatial multiplexing. In order to effectively reap the benefits of massive-MIMO, an adequate estimate of the channel impulse response (CIR) between each transmit–receive link is of utmost importance. It has been established in the literature that certain specific multipath propagation environments lead to a sparse structured CIR in spatial and/or delay domains. In this paper, implicit training and compressed sensing based CIR estimation techniques are proposed for the case of massive-MIMO sparse uplink channels. In the proposed superimposed training (SiT) based techniques, a periodic and low power training sequence is superimposed (arithmetically added) over the information sequence, thus avoiding any dedicated time/frequency slots for the training sequence. For the estimation of such massive-MIMO sparse uplink channels, two greedy pursuits based compressed sensing approaches are proposed, viz: SiT based stage-wise orthogonal matching pursuit (SiT-StOMP) and gradient pursuit (SiT-GP). In order to demonstrate the validity of proposed techniques, a performance comparison in terms of normalized mean square error (NCMSE) and bit error rate (BER) is performed with a notable SiT based least squares (SiT-LS) channel estimation technique. The effect of channels’ sparsity, training-to-information power ratio (TIR) and signal-to-noise ratio (SNR) on BER and NCMSE performance of proposed schemes is thoroughly studied. For a simulation scenario of: 4 × 64 massive-MIMO with a channel sparsity level of 80 % and signal-to-noise ratio (SNR) of 10 dB , a performance gain of 18 dB and 13 dB in terms of NCMSE over SiT-LS is observed for the proposed SiT-StOMP and SiT-GP techniques, respectively. Moreover, a performance gain of about 3 dB and 2.5 dB in SNR is achieved by the proposed SiT-StOMP and SiT-GP, respectively, for a BER of 10 − 2 , as compared to SiT-LS. This performance gain NCME and BER is observed to further increase with an increase in channels’ sparsity.