Seismic inversion can be divided into time-domain inversion and frequency-domain inversion based on different transform domains. Time-domain inversion has stronger stability and noise resistance compared to frequency-domain inversion. Frequency domain inversion has stronger ability to identify small-scale bodies and higher inversion resolution. Therefore, the research on the joint inversion method in the time-frequency domain is of great significance for improving the inversion resolution, stability, and noise resistance. The introduction of prior information constraints can effectively reduce ambiguity in the inversion process. However, the existing model-driven time-frequency joint inversion assumes a specific prior distribution of the reservoir. These methods do not consider the original features of the data and are difficult to describe the relationship between time-domain features and frequency-domain features. Therefore, this paper proposes a high-resolution seismic inversion method based on joint data-driven in the time-frequency domain. The method is based on the impedance and reflectivity samples from logging, using joint dictionary learning to obtain adaptive feature information of the reservoir, and using sparse coefficients to capture the intrinsic relationship between impedance and reflectivity. The optimization result of the inversion is achieved through the regularization term of the joint dictionary sparse representation. We have finally achieved an inversion method that combines constraints on time-domain features and frequency features. By testing the model data and field data, the method has higher resolution in the inversion results and good noise resistance.