The difficulty to face with is that, actually, the experimental points are never aligned, and that for two kinds of reasons. On the one hand, the co-ordinates of each point are submitted to measurement uncertainties. On the other hand, the linear model is only relevant for a limited interval. When stress is too low, the phenomenon is not yet established and when stress is too high, i.e. approaches it's maximum value, the process that will lead to rupture becomes prevalent.
Therefore, our opinion is as follows : the process of measure must
provide the range of relevance as well as the slope itself. Therefore
four quanta of knowledge are to be extracted from
data, and not only one: obviously the slope itself, but also the end
points
,
of the interval of best validity
of the linear model and a quality factor
qualifying the
obtained goodness of fit.
To facilitate comparisons between different materials, this interval
of best fit will be expressed as a percentage of the maximum force,
i.e. as
. This interval will
be obtained by a compromise between two contradictory requirements.
Increasing the size of the interval will minimize the influence of
uncertainties dues to measurements, but will also take into account
zones in which the non-linearity becomes noticeable, increasing uncertainties
due to the model.
A quality factor
has to be defined that summarize this
compromise, and the best
will
be chosen by maximizing
. The
's selected by
our research team, as well as the
selected by [3]
are not everywhere smooth, and may have many local extrema. Therefore,
a sound algorithm of maximization is requested.
We have chosen to subdivide the interval
in 25 parts, leading to 300 ordered pairs
,
after eliminating pairs such that
. This
reduction of the searching space allows to obtain a first guess for
by exhaustion. Thereafter,
a second pass can be done locally (
being smooth near the
absolute maximum).
This searching algorithm has an over-important by-product. Considering
the best values obtained for the slope
(for example, the
top
when sorted according to quality factor), we obtain
a cloud of values around the best fit value. In our opinion, this
cloud indicates values that are not really discernible from each other
and therefore provides a confidence interval around the best fit value.
In any problem of regression, the reference model consists in approximating
the variable
by a constant
. The average quadratic
error for this model is given by
.
It is well-known that :
Carrying now an approximation by a linear function, we are brought
to minimize
by an efficient choice of
. An elementary calculation
leads to:
Since the key point of this process is the decrease of remaining unexplained variance, it is convenient to define the variance reduction factor as :
When applying the process described in the former paragraph to the
traction test, we obviously have to determine the slope
relative
to the raw data, i.e. the lengthening/force pairs. But, to facilitate
comparisons with the values familiar to end users, all numerical results
given in the present paper have been normalized after computation
and expressed in terms of Young's modulus
(and therefore
relative to the deformation/stress pairs).
As previously said, we have chosen to compute
regressions,
corresponding to
choices of the interval
and therefore obtained
estimates for
. Concerning
the first test piece, these estimates are ranging into an interval
such that
, leading
to an amplitude close to
.
But, of course, this amplitude must be broken up into an uncertainty
resulting from the model (caused by nonrelevant choices of the interval
) and an uncertainty from the
measuring instruments (caused by the repercussions of uncertainties
carried by the primary measures). To this end, we can select a given
proportion among all the obtained values, for example one of four,
retaining the top
values, ranked according to the chosen
quality factor.
Using
of EQ. 3 as the quality factor
to determine the bounds
of
the pertinence interval, one obtains the level lines of FIG. 3,
each graph being relative to a test piece, namely
(left),
(center) and
(right).
It can be seen that
presents a more or less marked maximum,
located in the neighborhood of (respectively)
,
of
and of
.
The corresponding values of
are
,
et
.
It has been shown in [1] that
(where
in the number of points in the interval) is a more efficient
choice for the quality factor. In fact, both criterion are not leading
to very different central values for
. But they differ heavily
when used to determine the
best values that define a confidence
interval around this central value.
This behavior can be observed in FIG. 4, that
gives the equipotential surfaces of
(top) and of
(bottom) according to the values of
.
It can be seen that surfaces of the second type are much more regular,
and that the "parasitic peaks" corresponding to
small intervals have disappeared.
A visualization of these results is given by FIG. 5.
The first part (upper left) shows a plot of the pairs
obtained for the first test piece. The vertical lines are the limits
of the range of the
best values for
while the
horizontal line stands at the 75th best
. The second part
(upper right) gathers the three graphs corresponding to the three
test pieces of the set, while the third part (bottom) uses
as quality factor.
The use of the quality factor
leads to the global range
, i.e. a relative
amplitude around
, the relative amplitudes concerning only
one test piece being around
. For example, the best formula
for the first test piece of the set is: