The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations

C G Broyden

doi:10.1093/imamat/6.1.76

Abstract

This paper presents a more detailed analysis of a class of minimization algorithms, which includes as a special case the DFP (Davidon-Fletcher-Powell) method, than has previously appeared. Only quadratic functions are considered but particular attention is paid to the magnitude of successive errors and their dependence upon the initial matrix. On the basis of this a possible explanation of some of the observed characteristics of the class is tentatively suggested. PROBABLY the best-known algorithm for determining the unconstrained minimum of a function of many variables, where explicit expressions are available for the first partial derivatives, is that of Davidon (1959) as modified by Fletcher & Powell (1963). This algorithm has many virtues. It is simple and does not require at any stage the solution of linear equations. It minimizes a quadratic function exactly in a finite number of steps and this property makes convergence of this algorithm rapid, when applied to more general functions, in the neighbourhood of the solution. It is, at least in theory, stable since the iteration matrix H,, which transforms the jth gradient into the /th step direction, may be shown to be positive definite. In practice the algorithm has been generally successful, but it has exhibited some puzzling behaviour. Broyden (1967) noted that H, does not always remain positive definite, and attributed this to rounding errors. Pearson (1968) found that for some problems the solution was obtained more efficiently if H, was reset to a positive definite matrix, often the unit matrix, at intervals during the computation. Bard (1968) noted that H, could become singular, attributed this to rounding error and suggested the use of suitably chosen scaling factors as a remedy. In this paper we analyse the more general algorithm given by Broyden (1967), of which the DFP algorithm is a special case, and determine how for quadratic functions the choice of an arbitrary parameter affects convergence. We investigate how the successive errors depend, again for quadratic functions, upon the initial choice of iteration matrix paying particular attention to the cases where this is either the unit matrix or a good approximation to the inverse Hessian. We finally give a tentative explanation of some of the observed experimental behaviour in the case where the function to be minimized is not quadratic.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations

Abstract

Talk to us

Similar Papers

More From: IMA Journal of Applied Mathematics

Lead the way for us

Journal: IMA Journal of Applied Mathematics	Publication Date: Jan 1, 1970
Citations: 2283

Similar Papers

Analysis of the Applicability of Parameter Estimation Methods for a Stochastic Rainfall Model
Cho
Journal of the Korean Society of Civil Engineers | VOL. 34
Cho Cho
01 Jan 2014
Journal of the Korean Society of Civil Engineers | VOL. 34

Ж.-Л. ЛАГРАНЖ КАК ОДИН ИЗ ОСНОВОПОЛОЖНИКОВ ТЕОРИИ ЭКСТРЕМУМОВ ФУНКЦИЙ МНОГИХ ПЕРЕМЕННЫХ
Ольга Михайловна Прохорова
RADIOELECTRONIC AND COMPUTER SYSTEMS | VOL. -
Ольга Михайловна ПрохороваОльга Михайловна Прохорова
28 Jan 2020
RADIOELECTRONIC AND COMPUTER SYSTEMS | VOL. -

Quantum algorithms for learning the algebraic normal form of quadratic Boolean functions
Xuexuan Hao ... Yong Zhou
Quantum Information Processing | VOL. 19
Xuexuan Hao, et. al.Xuexuan Hao ... Yong Zhou
30 Jul 2020
Quantum Information Processing | VOL. 19

Functions of Many Variables — Partial Differentiation
D M Hirst
-
D M HirstD M Hirst
01 Jan 1976
01 Jan 1976

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations

Abstract

Talk to us

Similar Papers

More From: IMA Journal of Applied Mathematics