Gene Golub Stanford University

Abstract:

The standard least-squares problem is to minimize tex2html_wrap_inline77 , where A is an tex2html_wrap_inline81 matrix, with m greater than n, b is an m-dimensional vector, and tex2html_wrap_inline89 indicates the Euclidean norm of the vector. Frequently in practice, the data A and b are not known exactly. The TLS algorithm and regularization-based methods are popular schemes for handling such uncertainties.

We propose several new formulations of least-squares type problems which explicitly include bounds on the uncertainties and which regularize the solution. We also provide efficient algorithms for their solution.

The direct solution of the least squares problem may lead to a vector x that is severely contaminated with noise. Tikhonov regularization addresses this problem by solving instead the linear least squares problem

displaymath97

The solution  tex2html_wrap_inline99 satisfies the equation

displaymath101

The choice of an appropriate regularization parameter  tex2html_wrap_inline103 is crucial, and many methods have been proposed for this purpose. One method handles the case where the norm of the noise vector e is known:

Morozov's discrepancy principle
The value of  tex2html_wrap_inline103 is chosen such that the norm of the residual  tex2html_wrap_inline109 equals the norm of the error term:

displaymath111

Generalized cross-validation
The value of  tex2html_wrap_inline103 is computed as the global minimizer of

eqnarray35

Another formulation is the following min-max problem

displaymath115

which can be shown to be equivalent to

displaymath117

Its solution can be written in the form

displaymath119

where tex2html_wrap_inline103 is a non-negative real number obtained as the root of a certain secular equation. We have efficient and reliable numerical algorithms for finding tex2html_wrap_inline103 , which in fact can be interpreted as an automatic regularization parameter that minimises the worst-case residual.

For large scale problems iterative methods become necessary. We will discuss how the Lanczos algorithm can be used to approximate the functions  tex2html_wrap_inline125 . In most cases we can compute lower and upper bounds on  tex2html_wrap_inline125 , and these bounds become tighter as the number of Lanczos iterations increases. In the case of GCV a stochastic trace estimator is used to approximate the denominator in  tex2html_wrap_inline129 .

This talk represents joint work with S. Chandrasekaran, M. Gu, A. H. Sayed and U. von Matt.

[Gene Golub, Department of Computer Science, Stanford University, Stanford, CA 94305-2140, golub@na-net.ornl.gov]



Grace Wahba
Wed Apr 22 14:09:47 CDT 1998