The prediction of system responses for a given fatigue test bench drive signal is a challenging problem, since highly dynamic loads from measurement campaigns must be reproduced accurately. Linear frequency response function models are commonly used for this system identification task, but energy intensive experimental iterations are required to account for system non-linearities. Two novel hybrid modeling strategies are suggested, which augment existing approaches using non-linear Long Short-Term Memory networks. These are trained and deployed on short subsequences of measurement data and recombined using a windowing technique, which enables their application to measurement data with high sampling rates. In addition to fatigue test bench commissioning, these hybrid models can also be employed in the field of virtual sensing. The approach is tested using non-linear experimental data from a servo-hydraulic test rig and this dataset is made publicly available. A variety of metrics in time and frequency domains, as well as fatigue strength under variable amplitudes, are employed in the evaluation. It is shown, that hybrid models can successfully use frequency response function models as a linear baseline estimate, which is further improved by Long Short-Term Memory networks to enable non-linear predictions.