To avoid collisions, pedestrians intending to cross a road need to accurately estimate the time-to-collision (TTC) of an approaching vehicle. For TTC estimation, auditory information can be considered particularly relevant when the approaching vehicle accelerates. The sound of vehicles with internal combustion engine (ICEVs) provides characteristic auditory information about the acceleration state (increasing rotational speed and engine load). However, for electric vehicles (EVs), the acoustic signature during acceleration is less salient. Although the auditory detection of EVs has been studied extensively, there is no research on potential effects of the altered acoustic signature of EVs on TTC estimation. To close this gap, we compared TTC estimates for ICEVs and for EVs with and without activated acoustic vehicle alerting system (AVAS). We implemented a novel interactive audiovisual virtual-reality system for studying the human perception of approaching vehicles. Using acoustic recordings of real vehicles as source signals, the dynamic spatial sound field corresponding to a vehicle approaching in an urban setting is generated based on physical modeling of the sound propagation between vehicle and pedestrian (listener) and is presented via sound field synthesis (higher-order Ambisonics). In addition to the auditory simulations, the scene was visually presented on a head-mounted display with head tracking. Participants estimated the TTC of vehicles that either approached at a constant speed or accelerated positively. In conditions with constant speed, TTC estimates for EVs with and without AVAS were similar to those for ICEVs. In contrast, for accelerating vehicles, there was a substantial effect of the vehicle type on the TTC estimates. For the EVs, the mean TTC estimates showed a significant overestimation. Thus, subjects on average perceived the time of arrival of the EV at their position as longer than it actually was. The extent of overestimation increased with acceleration and presented TTC. This pattern is similar to a first-order TTC estimation representing a failure to consider the acceleration, which is consistently reported in the literature for visual-only presentations of accelerating objects. In comparison, the overestimation of TTC was largely reduced for the accelerating ICEVs. The AVAS somewhat improved the TTC estimates for the accelerating EVs, but without reaching the same level of accuracy as for the ICEVs. In real traffic scenarios, overestimations of the TTC of approaching vehicles might lead to risky road-crossing decisions. Therefore, our finding that pedestrians are significantly less able to use the acoustic information emitted by accelerating EVs for their TTC judgments, compared to accelerating ICEVs, has important implications for road safety and for the design of AVAS technologies.