In this study, we introduce an innovative deep-learning model architecture aimed at enhancing the accuracy of detecting and classifying organizing pneumonia (OP), a condition characterized by the presence of Masson bodies within the alveolar spaces due to lung injury. The variable morphology of Masson bodies and their resemblance to adjacent pulmonary structures pose significant diagnostic challenges, necessitating a model capable of discerning subtle textural and structural differences. Our model incorporates a novel architecture that integrates advancements in three key areas: Semantic segmentation, texture analysis, and structural feature recognition. We employed a dataset of whole slide imaging from 20 patients, totaling 100 slides of OP, segmented into training, validation, and testing sets to reflect real-world application scenarios. Our approach utilizes a modified multi-head self-attention mechanism combined with ResUNet for semantic segmentation, enhanced by superpixel concepts. This method facilitates the generation of representative token features through iterative super-token blocks, creating high-resolution token maps that leverage local and high-level feature information for improved accuracy. Benefiting from token features and distribution for enhanced texture alignment with fewer false-positives, the super-token transformer (STT) model achieved a mean intersection over union (mIOU) of 72.42%, with a sensitivity of 47.81%, specificity of 99.83%, positive predictive value of 64.03%, and negative predictive value of 99.94%, highlighting superior efficacy in Masson body segmentation in complex cross-tissue analyses. Our team developed an iterative learning model based on the STT approach, emphasizing token features of super token, including texture and distribution, that enable enhanced alignment with the unique textures of Masson bodies to improve sensitivity and mIOU, The development of this STT model presents a significant advancement in the field of medical image analysis for OP that offers a promising avenue for improving diagnostic precision and patient outcomes in pulmonary pathology.