Purpose To study the feasibility of a quality control framework composed of an anthropomorphic breast phantom and a mathematical model observer (MO) for the objective evaluation of detectability of calcification-like signals in system-processed mammographic images. Methods A 3D-printed anthropomorphic breast phantom was imaged with two brands of digital mammography systems (Selenia, Hologic, USA & Amulet Innovality, Fujifilm, Japan) at four different dose levels. The phantom consisted of two transverse slabs, allowing for the insertion of a sheet with calcification-like signals. These signals were 0.25 mm diameter gold discs deposited on aluminum squares of 1 cm width. From the acquired processed and unprocessed images, signal-present and signal-absent regions of interest (ROIs) were extracted. The ROIs were evaluated with a channelized Hotelling observer (CHO) MO using four different formulations of difference-of-Gaussian (DoG) channel-sets, and by three human observers in two-alternative forced-choice experiments. We compared the human and CHO performance in detecting calcification-like discs in the processed and unprocessed ROIs. Detection performance was measured using the proportion of correct responses of the humans ( PC H ) and the CHO ( PC CHO ) . The correlation between PC H and PC CHO was analyzed using a mixed-effect regression model. Correlation results, including the goodness of fit (r 2 ) of PC H and PC CHO for all parameters investigated, were evaluated to determine whether CHO MO can be used to predict human observer performance. Results A strong correlation of the CHO detection performance with the performance of the humans was obtained, with r 2 = 0.93–0.96. However, the slight, but not significant, differences in intercept and slope suggest some system dependence (p > 0.05). The correlation was not considerably affected by the choice of the DoG channel-set. The image processing did not have a significant impact on the PC H or PC CHO results (p > 0.05). Conclusions The CHO with DoG channel-sets can predict human performance on the detection of calcification-like signals under the tested conditions. Therefore, there is potential for the framework to be used for the evaluation of detectability of calcification-like signals in system-processed mammographic images. However, additional systems need to be evaluated to investigate this system-dependency in the correlation between human and the MO, further.