Over the last decade, the global fight against climate change through electrification has led to an increase in research on building heating, ventilation, and air conditioning (HVAC) systems that utilize intelligent control algorithms to provide demand-side grid service while also maintaining the thermal comfort of building occupants. As the pivotal point between building electricity consumption and indoor thermal comfort, high-efficiency electrical vapor-compression heat pumps are at the center of these emerging studies, and various grid-interactive and occupant-comfort control algorithms have been developed for them. The impact of these algorithms on heat pump operation and performance when subjected to different weather conditions, building loads, and grid requests calls for investigation and verification via experimental testing with actual heat pumps integrated with real-time building and grid responses. This study introduces a Water-Source Heat Pump (WSHP) Hardware-in-The-Loop (HIL) Test Facility that is the first of its kind. This testbed utilizes a 2-ton variable speed water-to-air heat pump that is capable of interacting with a virtual environment currently comprised of an EnergyPlus (E+) building simulation, an agent-based occupant behavioral model, and a single U-tube ground-loop heat exchanger (GLHE) model. Detailed descriptions of the testbed’s physical design and operation, virtual environement, as well as their mutual communication is provided. An uncertainty analysis is also performed under manufacturer specified heating and cooling design conditions. This analysis shows that the total load across the WSHP’s demand side heat exchanger, i.e., the sum of its latent and sensible components, can be measured with a relative uncertainty of ± 10.4% and ± 3.6% in cooling and heating mode respectively. The WSHP’s coefficient of performance (COP) can be measured with relative uncertainties of ± 10.4% in cooling mode, and ± 3.7% in heating mode. A preliminary 24-h experimental demonstration is then performed utilizing the DOE prototype small commercial office building model in E+. The simulation takes place in Atlanta, GA on the date of 08/26/15 from 12:00 AM to 11:59 PM using TMY3 weather data. . The results from this demonstration show that over the course of this experiment the simulated outputs of zone dry-bulb temperature, zone humidity ratio, and WSHP inlet water temperature can be tracked by testbed emulators up to a root mean squared error (RMSE) of ± 0.27 °C, ± 0.376 g/kg, and ± 0.85 °C respectively. The WSHP’s dynamic behavioral characteristics and performance are also captured, and correspond well with the authors’ previous understanding of heat pump efficiency as a function of evaporator and condenser fluid inlet conditions respectively.