Although similarity could improve visual working memory (VWM) performance, it remains unclear how the spatial configuration of visual information influences the similarity effect in VWM. We explored this question by manipulating the orderliness of spatial configuration (ordered vs. scrambled) in the simultaneous (Experiment 1) and sequential (Experiment 2) change detection tasks. The results showed that similarity improved VWM performance when memory items were presented simultaneously and sequentially. For the simultaneous memory array containing similar and dissimilar items, the performance of the ordered spatial configuration was better than that of the scrambled spatial configuration when probing dissimilar items, while no such difference was found when probing similar items. Further, the similarity effect value in the scrambled spatial configuration was higher than that in the ordered spatial configuration. For the sequential memory array containing similar and dissimilar items, spatial configuration did not affect the similarity effect in VWM. Taken together, these findings suggest that spatial configuration could modulate the similarity effect when memory items are presented simultaneously, in which the ordered spatial configuration protects representations of dissimilar items and reduces the similarity effect in VWM. Our study provides additional evidence for the role of spatial configuration in the similarity effect in VWM, and supports the hierarchical model.