Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
2, 1996 63
Experimental design
II. Optimization
Pedro W. Araujo, Richard 2. Optimization
G. Brereton *
Bristol, UK 2.1. Sequential designs
a) x1
D
B B=79
D
4
C
x2
I,
A=64
c=93
x2
Fig. 1. Basic simplex for two factors: (a) initial simplex; (b) simulated responses; and (c) reflection of the worst
vertex to obtain the new condition.
l If the experimental conditions provided by D its response was the worst, and so V, is obtained
produce a worst response again, reject the next reflecting VI through the mid-point between V, and
to the worst vertex (B in our example). V,. This new experiment is run and V, is rejected
Sometimes when the new vertices are calculated, and a new vertex V, is found. Again, the new exper-
the values obtained lie outside the allowable exper- imental point is run and so until reaching some
imental domain, for instance, the new volume experimental condition which produce the opti-
obtained is higher than the capacity of the appara- mum response. The best response in this example
tus. In this case, the simplex should be contracted was Vlo. An important aspect of the simplex
and a new vertex must be generated. designs is to know when the procedure has finished.
An illustrative example based in the rules There are many different stopping rules [ 6-91, but
described above can be seen in Table 1 and Fig. 2, sometimes the best method involves the experi-
which show the progress of a simplex design in the menter’s skill in judging for him/herself whether
maximization of a hypothetical response. Two fac- a true optimum has been found [ 61.
tors have been considered which are described in The most notable disadvantages of the simplex
the vertices ( Vi) of a triangle (N+ 1 = 3). For a designs are that the optimum is confined to a local
better understanding, the variation of the factors is point instead of the global region, often a large
presented in coded levels (from - 1 to + 1) and number of experiments is required, replicate infor-
the response is expressed as a percentage in area. mation is not obtained, there is no information
Table 1 shows the initial simplex ( V, V,V,) and the about how confident we are in the model and with
simplex progress followed in this example. Follow-
ing the algorithm described, responses at vertices
V, to V, were obtained and V, was rejected because
L
0
Table 1
Progress in a simplex design in the maximization of an exper- k 0.6
imental response when 2 factors are varied ‘c
completely. No. Xl X2 X3
The steepest ascent (or descent) method has also
been proposed as an approach to finding an opti- 1 0 0 0
mum [ 4,10,1 11. The main principle is to use local 2 -1 0 0
3 +1 0 0
response surface modelling to determine the gra-
4 0 -1 0
dient of an overall design. Above, designs were 5 0 +1 0
performed over the entire experimental region 6 0 0 -1
which then produced a mathematical model for the 7 0 0 +1
optimum. Steepest ascent methods starts over a
small region rather like the simplex approach. For
two factors this nominally involves performing 4 where !I,,,&, is the second-order term. The follow-
experiments arranged as a 2 level factorial design ing rules can help to construct a star design.
to give a response of the form presented in Eq. 1 0 Choose the number of factors to be studied (N),
of Part I of this article. Taking derivatives, it is e.g., 3 factors.
possible to determine the gradient over each factor Establish the number of experiments to be done
and setting dv/Jx,, and ;?Y/&x~at 0, to determine according to the expression 2N+ 1, e.g., for 3
the direction of maximum change. There has been factors it will be necessary to perform 7
some discussion in the literature as to precisely how experiments.
to estimate this gradient [ 121. Once the gradient is Locate a central experimental point and a step
determined, new experiments are performed cen- size, in the space of the coded variables, from
tred on which other factor combinations are generated
by moving the same positive and negative dis-
x:,,, = GM + aAx (I)
tance ( = step size), e.g., for 3 factors the exper-
where a is the step size and Ax is obtained from imental arrangement is shown in Table 2, where
the gradient. This is continued until an optimum is the centre corresponds to the point (0, 0, 0).
reached. The experiments performed according to Table 2
The main disadvantage to the analytical chemist are exactly sufficient to estimate 2N+ 1 para-
is that if more than 2 factors are significant, the meters of the model, e.g., for 3 factors we expect
number of experiments per estimate increases by 2 7 parameters, bO, b,, bZ, b,, b, , , bz2 and b,,.
for each factor. Hence a 4 factor process requires The main disadvantage of these designs is that
16 experiments each time. This is prohibitively they do not allow for models involving interac-
large, so this approach is not popular. tions. This is evident from Table 2, where the mul-
tiplication of any column by another in order to
2.2. Simultaneous designs obtain the interaction factor will produce a result
of zero in all the experimental points. This lack of
The basic idea in simultaneous designs is to rec- estimation effects of interaction can cause the
ord one or more selected experimental responses model to fail if they have significance. Also, there
for a set of experiments carried out in a systematic are no degrees of freedom to estimate how well the
way in order to predict the optimum and the inter- model is obeyed. To overcome this problem, a bet-
action effects using regression analysis generally. ter approximation to a multifactor space model has
Factorial designs at 2 levels are suitable in the been developed [ 131. Belonging to this approach
estimation of first-order and interaction effects, but are the central composite designs, the face-centred
they fail when additional effects such as second- cube designs, rotatable central composite designs,
order effects are significant. For optimization some Doehlert designs and mixture designs. For N fac-
second order effects are necessary. tors we can allow an extension of the model to
Star designs provide a simple way to fit a model
described by the following equation: j=b,,+b,x] +b2x2 +b3x3+...+b,,++bIZx,x2
Table 3 Table 4
Central composite designforthreefactors Face-centredcubedesignforthreefactors
No. Xl X2 X3 No. Xi X2 X3
1 0 0 0 1 0 0 0
2 -1 0 0 2 -1 0 0
3 +1 0 0 3 +1 0 0
4 0 -1 0 4 0 -1 0
5 0 +1 0 5 0 +1 0
6 0 0 -1 6 0 0 -1
7 0 0 +1 7 0 0 +1
8 -0.5 -0.5 -0.5 8 -1 -1 -1
9 -0.5 +0.5 -0.5 9 -1 i-1 -1
10 +0.5 +0.5 -0.5 10 +1 +1 -1
11 +0.5 -0.5 -0.5 11 t-1 -1 -1
12 -0.5 -0.5 +0.5 12 -1 -1 +I
13 -0.5 +0.5 +0.5 13 -1 +1 +1
14 i-o.5 +0.5 +0.5 14 +1 +1 +1
15 +0.5 -0.5 +0.5 15 +1 -1 +1
Central composite designs are the result of rotatable central composite design can be used.
superimposing a factorial design at 2 levels ( 2N) Rotatable central composite designs are identical
on a star design (2N+ 1). The rules followed in its with a faced-centred cube design except that the
construction are the same as explained for both points defined for the star design must be changed
kinds of design and the total number of factor com- to ( + [2N] “4 ,...,O) and those generated by the fac-
binations will be given by 2N + 2N+ 1 to define the torial design remain unchanged. In this way the
linear, quadratic and interaction terms of the model design generates information equally in all direc-
together with an estimation of the error in the terms. tions, e.g., the variance of the estimated response
An experimental arrangement for 3 factors in a is the same at all points on a sphere centred at the
central composite design is shown in Table 3, origin. This is why these designs are called rotat-
where experiments 1 to 7 define the star and 8 to able, because a rotation of design about the origin
15 define the factorial design. In this example the does not alter the variance contours [ 171.
distance from the centre to the negative and posi- Taking into account the above considerations
tive extreme in each design is different ( + 1 in the about rotatability in Table 4 all the levels + 1 in
star and f 0.5 in the factorial design), but when experiments 8-l 5 must change to ) 1.68, whereas
the same positive and negative distance is taken experiments l-7 do not change (see Table 5). As
from the centre of the design it is called a face-
Table 5
centred cube design. The arrangement of the exper-
Rotatablecentralcomposite
designforthreefactors
imental points of this design is the same as for the
central composite except that the coded levels from No. Xl x2 X3
the factorial part must be the same than the star part
1 0 0 0
changing the levels f 0.5 by + 1 (see Table 4 and
2 -1 0 0
compare with Table 3). This kind of design has 3 +1 0 0
been used in the optimization of new methods 4 0 -1 0
[ 14,151. 5 0 +1 0
Sometimes the centre in this kind of design is 6 0 0 -1
7 0 0 +1
replicated 5 or more times, because the design
8 -1.68 -1.68 -1.68
assumes that the pure experimental uncertainty is 9 -1.68 +1.68 -1.68
related to the distance from the centre of the design, 10 +I.68 +I.68 -1.68
but sometimes it is also a function of the distance 11 +1.68 -1.68 -1.68
from the other experimental points [ 161. To fulfil 12 -1.68 -1.68 +I.68
13 -1.68 +1.68 +1.68
the condition of dependence only on the distance
14 +1.68 +1.68 +1.68
from the centre of the design and independence of 15 +I.68 -1.68 +1.68
the location of the other experimental points, a
trends in analytical chemistry, vol. 15, no. 2, 1996 67
Table 6 Table 7
Rotatable central composite design and yields obtained in Experimental field in a Doehlert matrix for three factors
enamine synthesis by the TiCId method
No. Xl X2 X3
No. Xl X2 Yield (%)
1 0 0 0
1 -1 0 73.4 2 -1 0 0
2 +1 0 69.7 3 -0.5 - 0.866 0
3 -1 -1 88.7 4 -0.5 - 0.289 -0.816
4 +1 -1 98.7 5 1 0 0
5 - 1.41 +1 76.8 6 0.5 0.866 0
6 +1.41 +1 84.9 7 0.5 0.289 0.816
7 0 0 56.6 8 -0.5 0.866 0
8 0 0 81.3 9 -0.5 0.289 0.816
9 0 - 1.41 96.3 10 0 - 0.577 0.816
10 0 + 1.41 96.4 11 0.5 - 0.866 0
11 0 0 87.5 12 0.5 - 0.289 -0.816
12 0 0 96.1 13 0 0.577 -0.816
13 0 0 90.5
an example of this design we present the synthesis between 0.50 and 1.OOmol and morpholine/ketone
of enamine from methyl isobutyl ketone and mor- (x2) varied between 3.00 and 8.00 mol. The exper-
pholine by the TiCI4 method [ 181. This synthesis imental design and yields obtained are shown in
was carried out considering the effect of two fac- Table 6, where the factors are given in coded levels.
tors, the amount of TiCl,/ketone (x,) varied The quadratic response surface model fitted to the
experiments was
I(1 II IO II
conditions: 2-S% (v/v) HCl, ashing temperature
.Q@y5&ls
4 3 4 3 4 3 I3 14
400-l 300°C and atomization temperature 2800°C.
The last type of design discussed in this section
is mixture designs. These designs are particularly
useful when the experimenter has to deal with pro-
portions which are restricted to some limits in the
Fig. 4. Extension of the experimental Doehlert matrix possible combination of the factors. For example,
to another experimental domain by using previous the proportion of three solvents A, B and C cannot
adjacents points. exceed lOO%, so if the proportion of A and B is
established then the proportion of C must be
0.212 0.200 cv NV lOO- (A+B)%.
0.065
0.346 ” 0.341
. \
0.285 0.289
0.0 2.5 5
Table 8
Capacity factor results for seven mobile phases in Fig. 7
I II III IV V VI VII
There are several types of mixtures designs one vents and the responses measured were the sepa-
of the most common being the simplex-lattice ration factor and the value of the capacity factor
design, which was introduced by Scheffi [ 2 1,221 between two adjacent bands which are shown in
and formed the base upon which the theory of Table 8. The conclusion reached in this work was
experimental design for mixtures was built. Some that none of the seven standard mobile phases alone
rules can be considered in the building of this kind provides a complete separation of the sample com-
of design: ponents, so the optimum solvent composition is
l Choose the number of factors (N), e.g., three given as 0.83% acetonitrile, 0.67% methyl tert.-
solvents: acetone, water and methanol. butyl ether and 3.05% methylene chloride by vol-
l Represent the composition of the N factors by a ume in hexane.
simplex in N- 1 dimensional space, e.g., for the In spite of the variety of other mixture designs,
three solvents considered above, this is a triangle, they have not been exploited extensively in chem-
as shown in Fig. 6. istry [6].
l Select the experimental points to be studied as
2N-‘. These experimental points correspond to
N permutations of N pure components, N!/ References
[ 2!(N - 2) !] permutation of full binary
mixtures, N!l[3!(N -3)!] permutation of all
III W. Spendley, G.R. Hext and F.R. Himsworth,
ternary mixtures, and so on, e.g., for 3 compo- Technometrics, 4 ( 1962) 441.
nents, 7 experiments have to be performed as [21 J.A. Nelder and R. Mead, Comput. J., 7 ( 1965)
follows: 3 with just one component ( 100% pure 308.
solvent), 3 with a binary mixture of 50% of each [31 K.W.C. Burton and G. Nickless, Chemom. Intell.
component and 1 with an equimolar mixture of Lab. Syst., 1 ( 1987) 135.
33.33% of each component. These experimental [41 W.G. Cochran and G.M. Cox, Disefios
points are represented in Fig. 6 by the symbol the Experimentales, Trillas, Mkxico, 2nd ed., 1990.
shaded circles. [51 G.E. P Box, Appl. Statist., 6 ( 1957) 81.
Mixture designs were used in the optimization [61 R.G. Brereton, Chemometrics, Applications of
Mathematics and Statistics to Laboratory
of solvent selectivity in liquid-solid chromatogra-
Systems, Ellis Horwood, Chichester, 1993.
phy for separating a mixture of 13 substituted naph-
171 S.L. Morgan and S.N. Deming, Anal. Chem., 46
thalene [23]. The seven experimental points (1974) 1170.
studied are shown in Fig. 7 and they correspond to [81 E.R. Aberg and A.G.T. Gustavsson, Anal. Chim.
changes in selectivity as a result of variation of the Acta, 144 (1982) 39.
mobile phase composition. Each experimental [91 E. Morgan and K. Burton, Chemom. Intell. Lab.
point is a quaternary mixture of four standard sol- Syst., 7 ( 1990) 209.
70 trends in analytical chemistry, vol. 15, no. 2, 1996
[lOI G.E.P. Box and K.B. Wilson, J. R. Statist. Sot. [ 181 R. Carlson, A. Nilsson and M. Stromqvist, Acta
Ser. B, 13 (1951) 1. Chem. Stand. Ser. B, 37 (1983) 7.
[ill G.E. P Box, W.G. Hunter and J.S. Hunter, [ 191 D.H. Doehlert, Appl. Statit., 19 (1970) 231.
Statistics for Experimenters, Wiley, Chichester, [ 201 Z. Benzo, P. Araujo, A. Sierraalta and F. Ruette,
1978. Anal. Chem., 65 ( 1993) 1107.
[I21E. Foster and R.V. Bathe, Chemom. Intell. Lab. [ 211 H. Scheffe, J. R. Statist. Sot. Ser. B, 2 ( 1958)
Syst., 9 (1990) 207.
[I31 G.E.P. Box and D.W. Behnken, Technometrics, 2 [22] I??Scheffe, J. R. Statist. Sot. Ser. B, 2 (1963)
( 1960) 445. 235.
iI41 P.W. Araujo, K. Kavianpour and R.G. Brereton, [ 231 J.L. Glajch, J.J. Kirkland and L.R. Snyder, J.
Analyst, 120 (1995) 295. Chromatogr., 238 (1982) 269.
[I51 C. Altesor, P. Corbi, I. Do1 and M. Knochen,
Analyst, 118 ( 1993) 1549.
1161 S.N. Deming and S.L. Morgan, Experimental Pedro W. Araujo and Richard G. Brereton are at
Design: A Chemometric Approach, Elsevier, the School of Chemistry, University of Bristol,
Amsterdam, 1987. Cantock’s Close, Bristol BS8 I TS, UK. For
[I71 N.R. Draper, Technometrics, 4 ( 1962) 219. bibliographical details see Part I of this article.