The virtual scale endoscope (VSE) helps endoscopists measure colorectal polyp size more accurately compared to visual assessment (VA). However, previous studies were not adequately powered to evaluate the sizing of polyps at clinically relevant size thresholds and relative accuracy for size subgroups. We created 64 artificial polyps of varied sizes and Paris class morphology, randomly assigned 1:1 to be measured (383 total measurement datapoints with VSE and VA by 6 endoscopists blinded to true size) in a colon model. We added data from two previous trials (480 measurement datapoints). We evaluated for correct classification of polyps into size groups at 3 mm, 5 mm, 10 mm, and 20 mm size thresholds and the relative size measurement accuracy for diminutive polyps (≤5 mm), small polyps (5-9 mm), large polyps at 10-19 mm, and polyps (≥20). VSE had significantly less size group misclassifications at the 5 mm, and 10 mm thresholds (28 percent vs. 45 percent, P = 0.0159 and 26 percent vs. 44 percent, P = 0.0135, respectively). For the 3 mm and 20 mm thresholds, VSE had lower misclassifications; however, this was not statistically significant (36 percent vs. 46 percent, P = 0.3853 and 38 percent vs. 41 percent, P = 0.2705, respectively). The relative size measurement accuracy was significantly higher for VSE compared to VA for all size subgroups (diminutive (P < 0.01), small polyps (P < 0.01), 10-19 mm (P < 0.01), and ≥20 mm (P < 0.01)). VSE outperforms VA in categorizing polyps into size groups at the clinically relevant size thresholds of 5 mm and 10 mm. Using VSE resulted in significantly higher relative measurement accuracy for all size subgroups.