Emphysema is normally employed in the sense of morphology, and thus, imaging the modalities has a significant part in detecting the disorder. Most importantly, the High-Resolution Computed Tomography (HRCT) is a trustworthy component for illustrating the emphysema pathology. Hence, the pulmonary emphysema diagnosis at the starting stage is very important and can secure the life of a human. When there is a failure in the technique, there is a possibility of the death that displays the importance of effective diagnosis. The endotracheal intubation’s delay is prevented by implementing distinct mechanisms that result in various requirements. The presented work develops a deep learning-aided pulmonary emphysema disease classification model that is implemented to defeat the above difficulties. At first, the images are accumulated from a standard dataset. Then, the collected images are provided to the phase of segmentation. Here, the newly developed Adaptive Trans-Residual Dense UNet (ATRDUNet) is employed to perform the segmentation process. Also, the recommended ATRDUNet model helps to locate the disease in the appropriate location. Moreover, the performance of the segmentation process is enriched to fine-tune the parameter using the Fitness Revised Position of Wild Horse Optimizer (FRP-WHO). While optimizing the parameters effectively, it tends to handle the complex issues to strengthen the model accurately. The segmented results from the ATRDUNet are passed to Vision Transformer-based Residual DenseNet with LSTM layer (ViTrans-RDLSTM) for getting the final classified outcome. To this end, the analysis of the suggested model is correlated to find the effectiveness of the recommended model. However, these results are examined with the help of standard performance measures. Also, the experimentation is carried out using the standard “Automatic emphysema detection using weakly labeled HRCT lung images” dataset to attain better outcomes in the developed model. The findings of the recommended approach show 95% in terms of accuracy. Also, the statistical analysis of the developed model shows 23.9%, 12.3%, 15.2%, and 13.0% better performance than BWO, CO, HBA, and WHO in terms of mean analysis to show enriched performance.