Due to clutter inhomogeneity, the clutter suppression ability of space–time adaptive processing (STAP) is usually constrained by the insufficient number of independent and identically distributed (IID) clutter training samples and, as a result, is sacrificed to achieve the demanded sample reduction. Moreover, since clutter heterogeneity is exacerbated in the real environment, the IID training sample size can be heavily reduced, leading to the deterioration in clutter suppression. To solve this problem, a novel robust space–time joint sparse processing method with airborne active array is proposed. This method has several outstanding advantages: (1) only the single snapshot cell under test (CUT) data is used for the superior clutter suppression performance; and (2) the proposed method completely removes the dependence of the system processing ability on IID training samples. In this paper, the signal model of uniform transmitting subarray diversity is first established to obtain the single snapshot echo observed CUT data. Then, with the matched reconstruction, the single snapshot data are equivalently converted into multi-frame echo data. Finally, a fast multi-frame echo data joint sparse Bayesian algorithm is used to achieve heterogeneous clutter suppression. Numerous experiments were performed to verify the advantages of the proposed method.