The impact of non-linear soil behavior on site response may be described by the non-linear to linear site response ratio RSRNL introduced in Régnier et al. (2013). This ratio most often exhibits a typical shape with an amplitude above one below a site-specific frequency fNL, and an amplitude below one beyond fNL. This paper presents an investigation of the correlation between this RSRNL ratio and various parameters used to characterize the site (Site Condition Proxies: SCPs) and the seismic loading level (Ground Motion Intensity Measures: GMIMs).The data used in this analysis come from sites of the Japanese Kiban–Kyoshin (KiK-net) network, for which the nonlinear to linear site-response ratio (RSRNL) is obtained by comparing the surface/down-hole Fourier spectral ratio for strong events and for weak events. The five SCPs are VS30, the minimum velocity of the soil profile (Vsmin), an index of the velocity gradient over the top 30 m (B30), the fundamental frequency f0HV, as measured from the H/V earthquake ratio, and the corresponding amplitude A0HV. The seven GMIMs are PGA, PGV, PGV/VS30 (peak strain proxy), IA (Arias Intensity), CAV (Cumulative Absolute Velocity), arms (Root Mean Square Acceleration) and Trifunac-Brady Duration (DT). The original data set consists of a total of 2927 RSRNL derived from KiK-net recordings at 132 sites. To assign an equal weight to each site, and to avoid any bias linked to sites with many recordings, for each GMIM, this original data set is grouped in 15 different intervals corresponding to fixed fractiles of the statistical distribution of the considered GMIM (every 10% from F10 to F50, and every 5% from F55 to F100). In each group, the average RSRNL-GM for each site is computed. For each of these seven advanced data sets, a neural network approach is used to predict the behavior of RSRNL-GM as a function of the corresponding GMIM, and one or two SCPs. The performance of each model is quantified through the average variance reduction coefficient μ(Rc) in a fixed frequency range. This sensitivity study is performed in the normalized frequency (f/fNL) domain to identify the best combinations (GMIM, SCPs) providing the largest variance reduction, and then in the absolute frequency domain for the final optimal combination. The optimal combinations [GMIM, two-SCPs] are triplets [PGV/VS30, VS30-f0HV; μ(Rc) = 18.6%], [PGV/VS30, VS30-A0HV; μ(Rc) = 18.16%], [PGV, VS30-f0HV; μ(Rc) = 17.3%] and [PGA, B30-A0HV; μ(Rc) = 17.2%]. The final absolute frequency model with the best triplet makes it possible to predict the non-linear response of a given site knowing its linear, weak-motion response, and two site proxy parameters, for wide ranges of the considered ground motion parameters.