Pulmonary sarcoidosis is a multisystem granulomatous interstitial lung disease (ILD) with a variable presentation and prognosis. The early accurate detection of pulmonary sarcoidosis may prevent progression to pulmonary fibrosis, a serious and potentially life-threatening form of the disease. However, the lack of a gold-standard diagnostic test and specific radiographic findings poses challenges in diagnosing pulmonary sarcoidosis. Chest computed tomography (CT) imaging is commonly used but requires expert, chest-trained radiologists to differentiate pulmonary sarcoidosis from lung malignancies, infections, and other ILDs. In this work, we develop a multichannel, CT and radiomics-guided ensemble network (RadCT-CNNViT) with visual explainability for pulmonary sarcoidosis vs. lung cancer (LCa) classification using chest CT images. We leverage CT and hand-crafted radiomics features as input channels, and a 3D convolutional neural network (CNN) and vision transformer (ViT) ensemble network for feature extraction and fusion before a classification head. The 3D CNN sub-network captures the localized spatial information of lesions, while the ViT sub-network captures long-range, global dependencies between features. Through multichannel input and feature fusion, our model achieves the highest performance with accuracy, sensitivity, specificity, precision, F1-score, and combined AUC of 0.93 ± 0.04, 0.94 ± 0.04, 0.93 ± 0.08, 0.95 ± 0.05, 0.94 ± 0.04, and 0.97, respectively, in a five-fold cross-validation study with pulmonary sarcoidosis (n = 126) and LCa (n = 93) cases. A detailed ablation study showing the impact of CNN + ViT compared to CNN or ViT alone, and CT + radiomics input, compared to CT or radiomics alone, is also presented in this work. Overall, the AI model developed in this work offers promising potential for triaging the pulmonary sarcoidosis patients for timely diagnosis and treatment from chest CT.