Cardiac computed tomography (CT) exams are some of the most complex CT exams due to the need to carefully time the scan when the heart chambers are near the peak contrast concentration. With current "bolus tracking" and "timing bolus" techniques, after contrast medium is injected, a target vessel or chamber is scanned periodically, and images are reconstructed to monitor the opacification. Both techniques have opportunities for improvement, such as reducing the contrast medium volume, the exam time, the number of manual steps, and improving the robustness of correctly timing the peak opacification. The objective of our study is to (1) develop a novel autonomous cardiac CT clinical workflow to track contrast bolus dynamics directly from pulsed x-ray projections, (2) develop a new five-dimensional virtual cardiac CT data generation tool with programmable cardiac profiles and bolus dynamics, and (3) demonstrate the feasibility of projection-domain prospective bolus tracking using a neural network trained and tested with the virtual data to find the contrast peak. In our proposed workflow, pulsed mode projections (PMPs) are acquired with a wide-open collimator under sparse view conditions (monitoring phase). Each time a new PMP is acquired, the neural network is used to estimate the contrast enhancement inside the target chambers. To train such a network, we introduce a new approach to generate clinically realistic virtual scan data based on a five-dimensional cardiac model, by synthesizing user-defined contrast bolus dynamics and patient electrocardiogram profiles. In this study, we investigated a scenario with one single PMP per rotation. To find the optimal PMP view angle, 20 angles were explored. For each angle, 300 virtual exams were generated from 115 human subject datasets and divided into training, validation, and testing groups. Twenty neural networks were trained and evaluated in total to find the optimal network. Finally, a simple bolus peak time estimation algorithm was developed and evaluated by comparing to the ground truth bolus peak time. To evaluate the accuracy of a bolus time-intensity curve estimated by the network, the cosine similarity between the estimation and the ground truth was computed. The cosine similarity was larger than 0.97 for all projection angles. A view angle corresponding to the x-ray tube at 30 degrees from vertical (left-anterior of subject) showed the lowest errors. The amplitude of the estimated bolus curves (in Hounsfield Units) was not always correctly predicted, but the shape was accurately predicted. This resulted in an RMSE of 1.23s for the left chambers and 0.78s for the right chambers in the contrast peak time estimation. In this study, we proposed an innovative real-time way to predict the contrast bolus peak in cardiac CT as well as an innovative approach to train a neural network using virtual but clinically realistic data. Our trained network successfully estimated the shape of the time-intensity curve for the target chambers, which led to accurate bolus peak time estimation. This technique could be used for autonomous diagnostic cardiac CT to trigger a diagnostic scan for optimal contrast enhancement.