Stereotactic radiosurgery (SRS) is a widely employed strategy for intracranial metastases, utilizing linear accelerators and volumetric modulated arc therapy (VMAT). Ensuring precise linear accelerator performance is crucial, given the small planning target volume (PTV) margins. Rapid dose falloff is vital to minimize brain radiation necrosis. Despite advances in SRS planning, tools for end-to-end testing of SRS treatments are lacking, hindering confidence in the procedure. This study introduces a novel end-to-end three-dimensional (3D) anthropomorphic dosimetry system for characterization of a radiosurgery platform, aiming to measure planning metrics, dose gradient index (DGI), brain volumes receiving at least 10 and 12Gy (V10, V12), as well as assess delivery uncertainties in multitarget treatments. The study also compares metrics from benchmark plans to enhance understanding and confidence in SRS treatments. The developed anthropomorphic 3D dosimetry system includes a modified Stereotactic End-to-End Verification (STEEV) phantom with a customized insert integrating 3D dosimeters and a fiber optic CT scanner. Labview and MATLAB programs handle optical scanning, image preprocessing, and dosimetric analysis. SlicerRT is used for 3D dose visualization and analysis. A film stack insert was used to validate the 3D dosimeter measurements at specific slices. Benchmark plans were developed and measured to investigate off-axis errors, dose spillage, small field dosimetry, and multi-target delivery. The accuracy of the developed 3D dosimetry system was rigorously assessed using radiochromic films. Two two-dimensional (2D) dose planes, extracted from the 3D dose distribution, were compared with film measurements, resulting in high passing rates of 99.9% and 99.6% in gamma tests. The mean relative dose difference between film and 3D dosimeter measurements was -1%, with a standard deviation of 2.2%, well within dosimeter uncertainties. In the first module, evaluating single-isocenter multitarget treatments, a 1.5mm dose distribution shift was observed when targets were 7cm off-axis. This shift was attributed to machine mechanical errors and image-guided system uncertainties, indicating potential limitations in conventional gamma tests. The second module investigated discrepancies in intermediate-to-low dose spillage, revealing higher measured doses in the connecting region between closely positioned targets. This discrepancy was linked to uncertainties in treatment planning system (TPS) modeling of out-of-field dose and multileaf collimator (MLC) characteristics, resulting in lower DGI values and higher V10 and V12 values compared to TPS calculations. In the third module, irradiating multiple targets showed consistent V10 and V12 values within 1 cm3 agreement with dose calculations. However, lower DGI values from measurements compared to calculations suggested intricacies in the treatment process. Conducting vital end-to-end testing demonstrated the anthropomorphic 3D dosimetry system's capacity to assess overall treatment uncertainty, offering a valuable tool for enhancing treatment accuracy in radiosurgery platforms. The study introduces a novel anthropomorphic 3D dosimetry system for end-to-end testing of a radiosurgery platform. The system effectively measures plan quality metrics, captures mechanical errors, and visualizes dose discrepancies in 3D space. The comprehensive evaluation capability enhances confidence in the commissioning and verification process, ensuring patient safety. The system is recommended for commissioning new radiosurgery platforms and remote auditing of existing programs.