Modulation of visual attention in the Visual World Paradigm relies on parallel processing of linguistic and visual information. Previous studies have argued that the human linguistic capacity includes an aspect of anticipation of upcoming material. Such anticipation can be triggered by both lexical and grammatical/morphosyntactic cues. In this study, we investigated the relationship between comprehension and prediction by testing how subtle changes in visual representations can affect the processing of grammatical case cues in Russian by Russian-German bilingual children (n = 49, age 8-13). The linguistic manipulation followed previous designs, contrasting SVO and OVS sentences, where the first NP (NP1) was marked with nominative or accusative case, respectively. Three types of visual displays were compared: (i) individual referents (potential agent/theme); (ii) pairs of referents (NP1 + potential agent/theme); and (iii) events (representing interactions between the referents). Participants were significantly more sensitive to the case manipulation when presented with events compared to the other two types of visual display. This suggests that they were able to quickly integrate the thematic role information signaled by grammatical case in the event representations. However, they were less likely to use the case information to anticipate upcoming arguments when the target pictures represented individual referents or pairs of noninteracting referents. We hypothesize that the process of argument anticipation is mediated by the activation of syntactic templates (SVO or OSV, depending on the case marking on NP1). The relatively weak anticipation effect observed may be attributed to the absence, or weak representation, of the noncanonical OVS template in the bilingual children's long-term memory.