Previous: 3.1 Importance of assumptions
Up: 3. Improving
Next: 3.3 Response surfaces
  Contents
Subsections
Computing an affine regression line is equivalent to playing the following
game. Alice knows the values of two secret coefficients
and Bob doesn't. Alice discards some pairs of numbers
.
The
are exactly defined and the
are (secretely)
computed as
where the
are independent identically distributed random variables, with mean
0 and variance
.
What Bob must do is finding an estimate of
in the
form of a confidence interval around a "most probable value".
The well known solution is to determine the pair
that minimizes :
Using notations of Theorem 1.3.19, we have :
and therefore :
The key point here is the assumption that the reality (not alone the
model) is affine, and the errors are iid. In all what follows the
same assumptions are ever done : the measures are
the sum of an iid random term and a linear combination of exactly
known terms (whatever complicated and non linear these terms can be
relative to the exactly known
variables).
Proof.

is a quadratic form over the

and it reduces,
by the very definitions, into a quadratic form with

terms.
In the special case where

, this formula is "more true
than ever" : this

indeterminate result describes exactly
what is known about the uncertainties : nothing.
Theorem 3.2.3
In the general case, the variance of the estimate
computed
at point
using the least squares formula is :
Proof.
From the assumptions done, the coefficients

computed according
to (
1.1), and

itself are affine functions
of the iid variables

. Therefore :
and it is immediate that column

equals

.
Remark 3.2.4
The preceding results were obtained without any hypothesis on the
distribution of the discrepancies, apart from being centered iid random
variables. However, it is essential that the

are exactly known.
Remark 3.2.5
Assuming further that noise is normal, it can be seen that quantity

follows a Student-Fischer
law with

degrees of freedom. This allows the computation of
confidence intervals when

is small.
Previous: 3.1 Importance of assumptions
Up: 3. Improving
Next: 3.3 Response surfaces
  Contents
douillet@ensait.fr
2008-03-13