A novel parallel approach to run continuous ant colony optimisation CACO algorithm on graphic processing unit GPU is presented in this paper for solving large scale continuous optimisation problem. CACO which is an extension to continuous domains from standard ACO is a kind of population-based meta-heuristics in essence. The mechanism of algorithm is described in detail. Its parallel implementation on compute unified device architecture CUDA is proposed in our work. The experiment results on actual hardware to optimise many-dimensions test functions are given. The results and analyses show the excellent performance of algorithm.