An Advanced Conjugate Gradient Training Algorithm Based on a Modified Secant Equation

Ioannis E Livieris,Panagiotis Pintelas

doi:10.5402/2012/486361

Ioannis E Livieris, Panagiotis Pintelas

Open Access

PDF Available

https://doi.org/10.5402/2012/486361

Copy DOI

Export

Save

Cite

Journal: ISRN Artificial Intelligence	Publication Date: Dec 8, 2011
Citations: 6	License type: CC BY 3.0

Affiliation: University of Patras

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Conjugate gradient methods constitute excellent neural network training methods characterized by their simplicity, numerical efficiency, and their very low memory requirements. In this paper, we propose a conjugate gradient neural network training algorithm which guarantees sufficient descent using any line search, avoiding thereby the usually inefficient restarts. Moreover, it achieves a high-order accuracy in approximating the second-order curvature information of the error surface by utilizing the modified secant condition proposed by Li et al. (2007). Under mild conditions, we establish that the proposed method is globally convergent for general functions under the strong Wolfe conditions. Experimental results provide evidence that our proposed method is preferable and in general superior to the classical conjugate gradient methods and has a potential to significantly enhance the computational efficiency and robustness of the training process.

Full Text