gauss_newton

A generalization of the Gauss-Newton method that solves extended least-squares problems

Abstract

Modeling the mean of a random variable as a function of unknown parameters leads to a nonlinear least-squares objective function. The Gauss-Newton method reduces nonlinear least-squares problems to a sequence of linear least-squares problems and requires only first-order information about the model functions. In a more general heteroscedastic setting, there are also unknown parameters in a model for the variance. This leads to an objective function that is no longer a sum of squares. We present an extension of the Gauss-Newton method that minimizes this objective function by reducing the problem to a sequence of linear least-squares problems. The extension requires only first-order information. This represents a new result because other methods that reduce this problem to a sequence of linear least-squares problems do not necessarily converge.

Paper

gauss_newton.pdf