For the purpose of sediment quality assessment, the prediction of toxicity risk-levels for aquatic organisms based on simple environmental measurements is desirable. One commonly used approach is the comparison of total contaminant concentrations with corresponding water and sediment quality guideline values, serving as a Line of Evidence (LoE) based on chemistry-toxicity effects relationships. However, the accuracy of toxicity predictions can be improved by considering the factors that modify contaminant bioavailability. In this study we used paired chemistry-ecotoxicity data sets for sediments to evaluate the improvement in toxicity risk predictions using bioavailability-modified guidelines. The sediments were predominantly contaminated with metals, and measurements of sediment particle size, total organic carbon (TOC) and acid volatile sulfide (AVS) were used to modify hazard quotients (HQ). To further assess the predictive efficacy of the bioavailability-modified guideline models, sediments with differing contamination levels were tested for toxicity to a benthic amphipod's reproduction. To account for differences between laboratory exposure and field exposure scenarios, where the latter creates greater dilution, both static-renewal and flow-through test procedures were employed, and flow-through resulted in lower dissolved metal concentrations in the overlying waters. We also investigated how lower AVS concentration by oxidation modified the toxicity. This study reaffirmed that consideration of factors that influence contaminant bioavailability improves toxicity risk predictions, however the improvements may be modest. The sediment particle size data had the greatest influence on the modified HQ, indicating that higher percentage of fine particle size (<63 μm) contributed most to a lower predicted toxicity. The comparison of the static-renewal and flow-through test results continue to raise important questions about the relevance of static or static-renewal toxicity test results for risk assessment decisions, as both these test designs may cause unrealistically high contributions of dissolved metals in overlying waters to toxicity. Overall, this study underscores the value of incorporating outcomes from simple and routine sediment analysis (e.g., particle size, TOC, and consideration of AVS) to enhance the predictive efficacy of toxicity risk assessments in the context of sediment quality risk assessment.