A comparison of 4 metal mixture toxicity models (that were based on the biotic ligand model [BLM] and the Windermere humic aqueous model using the toxicity function [WHAM-FTOX ]) was presented in a previous paper. In the present study, a streamlined version of the 4 models was developed and applied to multiple data sets and test conditions to examine key assumptions and calibration strategies that are crucial in modeling metal mixture toxicity. Results show that 1) a single binding site on or in the organism was a useful and oftentimes sufficient framework for predicting metal toxicity; 2) a linear free energy relationship (LFER) for bidentate binding of metals and cations to the biotic ligand provided a good first estimate of binding coefficients; 3) although adjustments in metal binding coefficients or adjustments in chemical potency factors can both be used in model calibration for single-metal exposures, changing metal binding coefficients or chemical potency factors had different effects on model predictions for metal mixtures; and 4) selection of a mixture toxicity model (based on concentration addition or independent action) was important in predicting metal mixture toxicity. Moving forward, efforts should focus on reducing uncertainties in model calibration, including development of better methods to characterize metal binding to toxicologically active binding sites, conducting targeted exposure studies to advance the understanding of metal mixture toxicity, and further developing LFERs and other tools to help constrain the model calibration.