The first quality that is expected from an engineer is the ability
to submit his/her findings in a clear and scientific language. All
listings, computations, charts and other "printer outputs"
can in no way replace a statement of conclusions, coined in an accurate
and scientific language.
The examination was about three different models for a single round
of tests. It was therefore necessary to coordinate different types
of materials in order to produce an organized response. Different
methods have been used. Doing screenshots and integrating them it
into a word processor is fine.
Doing cut and paste with scissors and glue (and traces of thumb) is
EQUALLY fine... and is probably the fastest way. You
can also incorporate comments into the listing of execution. You can
also ... All methods are right. What turns wrong is the lack of a
communication method.
Some comments are "poorly written", with a huge
number of orthographic/grammatical mistakes. However, a lack of mistakes
due to a lack of comments is not what is required.
Check the printers at the beginning of the evaluation, and print each
piece as soon as possible. At exactly the specified time, printers
will be disconnected.
On any printed document, the FAMILY_NAME/Given_name of the student
must appear (especially in the title of the figures).
Students are advised that the network traffic of their computer
is likely to be recorded during the evaluation.
How many trials () are occurring ? How many factors ()
? What is, for each factor, the number of levels? What is the size
() of the code suitable for an affine model without correlation
?
The given design includes trials. There are factors,
each of them with three levels that are labeled . The size
of the whole product space to be explored is .
If we consider these levels as mere identifiers, the corresponding
reduced Boolean coding necessitates
unknowns (one
for the affine constant of the model, and two for the independent
levels of each factor).
Read the file into Scilab (by modifying the procedure for reading
... or by editing the file by hand). Give matrix coding the
design.
It was necessary to transform the "space" into something
else, such as "underscore". Then convert text separators,
formerly "space / semi-colon / space" into Scilab
tokens, such as "space".
In case of failure, it was suggested to modify the file by hand, using
a text-processor such as edit or notepad or scipad
or ...
The result is a string matrix datas whose size is
,
then a numerical matrix ma whose size is
(cf Table 1, left).
Table 1:
Matrices ma and Ma
Give some Cartesian representations of the design under study.
What can be observed? What do you think of this allocation?
The Cartesian map
, that use the
first two factors for the first axis and the last two for the second
axis, does not reveal any defect.
On the other hand, Cartesian maps using three of the four factors
are illustrating the qualities of the design. There are four ways
to choose three of the four terms. And for every restriction a three
factors complete design is obtained.
Recall what is the measure typically used to determine the quality
of a design of experiment. Get, by a random pick, an alternative design.
How good is it? Compare with the design described by the given file.
The measure usually used to determine the quality of a design is the
minimum eigenvalue of the matrix ms=ma'*ma. The greater
is this value, the lower is the propagation of error.
The command [Rdatas,Rma,Rms]=randplan() provides a design
among all the possibilities.
When comparing the values of the two matrices ms (left) and
Rms (right), we see that the proposed design ms
is far better that the design Rms obtained at random.
Use the least squares method and determinethe
affine model without interactions mx that provides
the best fit with the experimental results.
The least squares method is
ms=ma'*ma ; mx=(1/ms)*ma'*mb
This formula is used to determine the model which provides the best
fit with the experimental data.
One obtains:
Assess the confidence that can be given to this model. Give the details
of computations, draw and print any useful graphic.
The confidence that can be given to a model is related to its ability
to predict the results of the measures that have not been made. The
assessment of this ability is mainly based on its ability to retrospectively
predict the past.
A first graph is obtained by reporting the experimental results on
the x-axis and the discrepancies between model and reality on the
y-axis. This yields Figure 1.
Figure 1:
Discrepancies versus results
The estimation of the goodness of fit is based on the ratio of the
variances :
VRF=variance(mb)/variance(mdelta)
A visual estimation can be obtained by using the ratio of the amplitudes :
(max(mb)-min(mb))/(max(mdelta)-min(mdelta))
roughly speaking, this ratio is not very different from the square
root of the VRF.
The value VRF quantifies the ability to back-cast
past events. The fore-cast ability is obtained by using a correcting
factor, due to the degrees of freedom that have been burnt to obtain
the coefficients of the model. This yields to :
Even so reduced, the quality of the model is good (VRF clearly above
1).
Evaluate the (isolated) influence of each factor (plot, print
and comment any useful graphic).
The influence of each factor is estimated by examining the amplitude
of variations obtained while averaging all others factors (cf Figure 2).
Figure 2:
Influences of the four factors
Factor D is the most influencing, while factor B is about six times
less influent.
The parameters are now considered as continuous quantities,
and the numbers are now perceived as the values taken
by a continuous variable, rather than only labels for a stepwise parameter.
Trial
is now coded by
in order to use centered variables. Obtain the new coding matrixmAand the corresponding new modelmX.
Show that the design already studied can be used to evaluate the pairwise
interactions of the parameters.
A quadratic model requires
interaction coefficients
and curvature coefficients together with the already
used coefficients of the affine model. This results into degrees
of freedom burnt to compute coefficients, leaving 10 of them for averaging
the errors.
We can also forecast that curvature coefficients will be very small,
and only take the interaction coefficients. Acting that way leads
to a coefficients model.
Conduct the necessary calculations and obtain the coefficients
of this new model.
Procedure auh(j,k) can be used to build stepwise the Zma
(without square terms) and maa (with square terms) matrices.
It is better to work with centered variables. The are centered
by design. Therefore, the are centered either. On the
other hand, the square terms are not centered. We rather use
and correct matrix maa accordingly into matrix Yma.
The spectra of matrices Yms and Zms are given by :
The corresponding coefficients are given in Table 2.
It can be seen that affine, only crossproducts and full quadratic
models are "nested". This happens because the corresponding
groups of columns are linearly independent (Yms is a block
diagonal matrix).
Table 2:
Coefficients mX, Zmx and Ymx of the three
models
Some concluding words would be welcome.
The latest model, with four more factors, leads to a residual variance
slightly smaller. But what counts is not to better predict the past.
However, the model without square terms has a better ability to predict
the future because its VRF is higher (446 instead of 395).
In any case, a second degree model is far better than a first degree
one since the terms expressing interactions 12 and 34 have an influence
comparable with the first degree terms, and are therefore essential
for an efficient modeling.