In-vivo non-invasive verification of endolymphatic hydrops (ELH) by means of intravenous delayed gadolinium (Gd) enhanced magnetic resonance imaging of the inner ear (iMRI) is rapidly developing into a standard clinical tool to investigate peripheral vestibulo-cochlear syndromes. In this context, methodological comparative studies providing standardization and comparability between labs seem even more important, but so far very few are available. One hundred eight participants [75 patients with Meniere's disease (MD; 55.2 ± 14.9 years) and 33 vestibular healthy controls (HC; 46.4 ± 15.6 years)] were examined. The aim was to understand (i) how variations in acquisition protocols influence endolymphatic space (ELS) MR-signals; (ii) how ELS quantification methods correlate to each other or clinical data; and finally, (iii) how ELS extent influences MR-signals. Diagnostics included neuro-otological assessment, video-oculography during caloric stimulation, head-impulse test, audiometry, and iMRI. Data analysis provided semi-quantitative (SQ) visual grading and automatic algorithmic quantitative segmentation of ELS area [2D, mm2] and volume [3D, mm3] using deep learning-based segmentation and volumetric local thresholding. Within the range of 0.1–0.2 mmol/kg Gd dosage and a 4 h ± 30 min time delay, SQ grading and 2D- or 3D-quantifications were independent of signal intensity (SI) and signal-to-noise ratio (SNR; FWE corrected, p < 0.05). The ELS quantification methods used were highly reproducible across raters or thresholds and correlated strongly (0.3–0.8). However, 3D-quantifications showed the least variability. Asymmetry indices and normalized ELH proved the most useful for predicting quantitative clinical data. ELH size influenced SI (cochlear basal turn p < 0.001), but not SNR. SI could not predict the presence of ELH. In conclusion, (1) Gd dosage of 0.1–0.2 mmol/kg after 4 h ± 30 min time delay suffices for ELS quantification. (2) A consensus is needed on a clinical SQ grading classification including a standardized level of evaluation reconstructed to anatomical fixpoints. (3) 3D-quantification methods of the ELS are best suited for correlations with clinical variables and should include both ears and ELS values reported relative or normalized to size. (4) The presence of ELH increases signal intensity in the basal cochlear turn weakly, but cannot predict the presence of ELH.