Breast cancer (BC) is the most frequently occurring cancer disease observed in women after lung cancer. Out of different stages, invasive ductal BC causes maximum deaths in women. In this work, three deep learning (DL) models such as Vision Transformer (ViT), Convmixer, and Visual Geometry Group-19 (VGG-19) are implemented for the detection and classification of different breast cancer tumors with the help of Breast cancer histopathological (Break His) image database. The performance of each model is evaluated using an 80:20 training scheme and measured in terms of accuracy, precision, recall, loss, F1-score, and area under the curve (AUC). From the simulation result, ViT showed the best performance for binary classification of breast cancer tumors with accuracy, precision, recall, and F1-score of 99.89 %, 98.29 %, 98.29 %, and 98.29 %, respectively. Also, ViT showed the best performance in terms of accuracy (98.21 %), average Precision (89.84 %), recall (89.97 %), and F1-score (88.75) for eight class classifications. Moreover, we have also ensemble the ViT-Convmixer model and observed that the performance of the ensemble model is reduced as compared to the ViT model. We have also compared the performance of the proposed best model with other existing models reported by several research groups. The study will help find suitable models that will increase accuracy in early diagnoses of BC. We hope the study will also help to minimize human errors in the early diagnosis of this fatal disease and administer appropriate treatment. The proposed model may also be implemented for the detection of other diseases with improved accuracy.