Recent Parkinson's disease (PD) research has focused on recognizing vocal defects from people's prolonged vowel phonations or running speech since 90% of Parkinson's patients demonstrate vocal dysfunction in the early stages of the illness. This research provides a hybrid analysis of time and frequency and deep learning techniques for PD signal categorization based on ResNet50. The recommended strategy eliminates manual procedures to perform feature extraction in machine learning. 2D time-frequency graphs give frequency and energy information while retaining PD morphology. The method transforms 1D PD recordings into 2D time-frequency diagrams using hybrid HT/Wigner-Ville distribution (WVD). We obtained 91.04% accuracy in five-fold cross-validation and 86.86% in testing using RESNET50. F1-score achieved 0.89186. The suggested approach is more accurate than state-of-the-art models.