The aim of this work is to design a fractional delay second order Volterra filter that takes a discrete time sequence as input and its output is as close as possible to the output of a given nonlinear unknown system which may have higher degree nonlinearities in the least square sense. The basic reason for such a design is that rather than including higher than second degree nonlinearities in the designed system, we use the fractional delay degrees of freedom to approximate the given system. The advantage is in terms of obtaining a better approximation of the given nonlinear system than is possible by using only integer delays ( since we are giving more degrees of freedom via the fractional delays ) and simultaneously it does not require to incorporate higher degree nonlinearities than two. This work hinges around the fact that if the input signal is a decimated version of another signal by a factor of M, then fractional delays can be regarded as delays by integers less than M. Using the well known formula for calculating the discrete time Fourier transform ( TFT ) of a decimated signal, we then arrive at an expression for the DTFT of the output of a fractional delay system in terms of the unknown first and second order Volterra system coefficients and the fractional delays. The final energy function to be minimized is the norm square of the difference between the DTFT of the given output and the DTFT of the output of the fractional delay system. Minimization over the filter coefficients is a linear problem and thus the final problem is to minimize a highly nonlinear function of the fractional delays which is accomplished using search techniques like the gradient-search and nature inspired optimization algorithms. The effectiveness of the proposed method is demonstrated using two nonlinear benchmark systems tested with five different input signals. The accuracy of the stated models using the globally convergent metaheuristic, cuckoo-search algorithm ( CSA ) are observed to be superior when compared with other techniques such as real-coded genetic algorithm ( RGA ), particle swarm optimization ( PSO ) and gradient-search ( GS ) methods. Finally, statistical analysis affirms the potential of the proposed designs for its successful implementation.