Previous work on visual short-term memory (VSTM) has encompassed various stimulus attributes including spatial frequency, color, and contrast, revealing specific time courses and a dependence on stimulus parameters. This study investigates visual short-term memory for binocular depth, using dynamic random dot stereograms (DRDS) featuring disparity planes in front of or behind the plane of fixation. In a delayed match-to-sample paradigm, we employed four distinct reference disparities (17.5', 28.8' either crossed or uncrossed) at two contrast levels (20%, 80%), spanning interstimulus intervals (ISI) of up to 4 s. Test stimuli represented a range of equally spaced values centered around the reference disparity of the ongoing trial. In addition, the impact of a memory masking stimulus was also tested in a separate experiment. Accuracy and point of subjective equality (PSE) served as performance markers. The performance, indicated by the accuracy of responses, was better for smaller reference disparities (±17.5') compared to larger ones (±28'), but both deteriorated as a function of ISI. The PSE demonstrated a consistent shift with increasing ISIs, irrespective of the magnitude of the initial disparity, converging gradually toward the range of 20-22' and deviating from the reference disparity. Notably, the influence of masking stimuli on the PSE was more marked when the mask disparity diverged from the reference value. The findings from our study indicate that the retention of absolute disparity in memory is imprecise, it deteriorates with retention time or due to perturbation by dissimilar masking stimuli. As a result, the memory trace is gradually replaced by a default depth value. This value could potentially signify an optimal point within low-level perceptual memory, however, our results are better explained by perceptual averaging whereby the visual system computationally derives a statistical summary of the presented disparities over time. The latter mechanism would aid in the computation of relative disparity in a dynamically changing environment.