Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Cost-Cumulants and
Risk-Sensitive Control
Chang-HeeWon 5.1 Introduction ....................................................................................... 1061
Department of Electrical Engineering, 5.2 Linear-Quadratic-Gaussian Control ........................................................ 1061
University of North Dakota, 5.3 Cost-Cumulant Control ........................................................................ 1062
Grand Forks,
5.3.1 M i n i m a l Cost Variance C o n t r o l
North Dakota, USA
5.4 Risk-Sensitive Control .......................................................................... 1063
5.5 Relationship Between Risk-Sensitive and Cost-Cumulant Control ................ 1064
5.6 Applications ........................................................................................ 1065
5.6.1 Risk-Sensitive C o n t r o l Applied to Satellite Attitude M a n e u v e r
5.6.2 M C V Control Applied to Seismic Protection of Structures
5.7 Conclusions ........................................................................................ 1067
References .......................................................................................... 1068
while satisfying a constraint: The equation also has the cost criterion:
Ek{l} = M , (5.13) tF
A Riccati solution to JMv minimization is developed for the for all t E [0, tF] and x E R n.
open-loop case: A minimal mean cost-control law k~ satisfies V1(t, x; k~) =
V~(t,x) <_ V l ( t , x ; k ) for t E r , x E R ~ and for k, an ad-
u(t) = k(t, x(0)). (5.16)
missible control law. An MCV control law kvl M satisfies
The solution is based on the differential equations: V2(t, x; kvlM) = Vf(t, x) <_ V2(t, x; k) for t E T, x E N~
whenever k is admissible. The corresponding minimal cost
variance is given by V * ( t , x ) = V 2 * ( t , x ) - M 2(t,x) for
~(t) = A(t)z(t) - ~B(t)R 1B'(t)~(t). (5.17)
t E T, x E N~. Here the full-state feedback solution of the
~(t) = -A'(t)(fft) - 2Qz(t) - 8txQv(t). (5.18) MCV control problem is presented for a linear system and a
quadratic cost criterion.
i,(t) = A ( t ) v ( t ) + E(t)WE'(t)y(t). (5.19) Then the linear optimal MCV controller is given by (Sain et
al., 2000):
~(t) = - A ' ( t ) y ( t ) - Qz(t). (5.20)
These equations have the boundary conditions: KviM(t, X) = - R - l ( t ) B ' ( t ) [ . M ( t ) + y(t) V(t)]x,
z(O) = x(0). (5.21) where .M and V are the solutions of the coupled Riccati-type
equations (suppressing the time argument):
(fftf) = 2QF z(tf) + 8DQFV(tF). (5.22)
v(0) = 0. (5.23) 0 = A)t + A ' M + M A + Q - MBR-1B'Ad + ~2dkdBR-1Blid.
y(tF) = QFZ(tF). (5.24) (5.28)
The equations also have the control action relationship: 0 = ~ / + 4;k4EWffAd + Alid + IdA - dkdBR-lffId
where ] is the classical quadratic cost criterion: 5.5 Relationship Between Risk-Sensitive
and Cost-Cumulant Control
l = I (x'Qx + k'Rk)dt. (5.32)
To see the relationship between RS and cost-cumulant control,
consider a cost criterion:
The RS control problem then becomes a minimization of
tF
the cost IRs(O) over feedback controller k:
J= j [x(t)'Qx(t) + ld(t,x)R(t)k(t,x)]ds + x'(tE)QFX(tF). (5.42)
]~s(O)=min]Rs(O). (5.33) 0
where (y(t) + (A - H(B'R-1B ' + 0W))' cr(t) = 0 is a back- qb(s) = E e x p ( - sl). (5.44)
ward linear equation. The matrix P satisfies the forward
Riccati-type equation: The cumulant generating function tb(s) is defined by:
010
.............. i ................ i ................ ! ................ ! .................
0 ' ~AZ'!;:
~.'.:;.'~ : : : :
0 0 0 1 0.1 ,- ..... ~ ................ ~ ................. : ................. : .................
0 hwoac hw 0
I22 I22
+
r ° 0
m(t)dt + dw(t)
(5.48) -0.2-0"1 ...... ; ............. -.-i ................ i ................. !i.................
cos (c~)
-0'30 5 10 15 20 25
L sin (oO L/~2J Time [hours]
dy(t) = I4x4X(t)dt + dr(t) (5.49) FIGURE 5.2 Roll (Dark) and Yaw (Light) Versus Time, RS Control
1066 Chang-Hee Won
0 I ] x(t)dt
dx(t) = _MT1Ks _MT1C,
-o.1 . . . . . . . . . . . . . ~. . . . . . . . . . . . . . . . . ~. . . . . . . . . . . . . . . . . ::. . . . . . . . . . . . . . . . . ::. . . . . . . . . . . . . . . . .
+ E] 0
MT1B,
u(t)dt +
E°I -Fs
dw(t),
-0"30 5
i 10
............. 15
. . . . . . . . . . . . . . . .
20
. . . . . . . . . . . . . . . . . .
o]
Cs = , r$ ~
mance. 0 -c3 C3
To compare the results with the well-known LQG controller,
k~ + k2 -k2
the system was simulated with an LQG controller. The 0
approached infinity in equation 5.36. Note that equation 5.36 K, = -k2 k2+k3 -k 3 .
x3(t)
m3
160
k3 , c 3
150
140
x2( t) 130
m2
120
k2,c2 q-
110
>
100
90
80
k l , Cl
70
i i i i i i i i
60 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
Gamma
FIGURE 5.4 Schematic Diagram for Three Degree-of-Freedom
Structure FIGURE 5.5 Optimal Variance: Full-State Feedback, MCV, 3DOF
5 Cost-Cumulant and Risk-Sensitive Control 1067
0.02 i i i i i i i ~ i
t~
0.019
E
o~ 0.018
0.017
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
0'03cI~ '. ! ! ! ! ! ! !
I
oo15
t ........................................
i ; i ; i . . . .
:o+
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
o.o~| ! ! ! ! ! ! ! ! ! /
................. : ........................ 1
°.°3
oo21
r , , , , , , :, , , I
' 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
E
co
o.~
t oi
0
o .........
i
0.2
i
0.4
i
0.6
i
0.8
i
1 1.2
i
1.4
'
1.6 1.8 2
Lf')
X
¢.O
X
E
0.7 ............. : : ........ :
09
match modal frequencies and dampings of an experimental Figure 5.6 shows the RMS displacement responses of first
structure. The cost criterion is given by: (~xl), second (~x2), and third ((Yx3)floor and the RMS velocity
responses of first (Crx4), second (Crx~), and third (Crx6) floor,
respectively, versus the MCV parameter % It is important to
l = J(z'(t)K~(t) + kcu2(t))dt, note that both third floor RMS displacement and velocity
responses can be decreased by choosing large %
terms of the cost cumulants. Cost cumulant control, which is Kwakernaak, H., and Sivan, R. (1972). Linear optimal control systems.
also called statistical control, views the optimization criterion New York: John Wiley & Sons.
as a random variable and minimizes any cumulant of the Rhee, I., and Speyer, J. (1992). Application of a game theoretic
optimization criterion. Then LQG, MCV, and RS are all special controller to a benchmark problem. Journal of Guidance, Control,
and Dynamics 15(5), 1076-1081.
cases of cost cumulant control where in LQG the mean, in
Runolfsson, T. (1994). The equivalence between infinite-horizon op-
MCV the variance, and in RS all cumulants of the cost func-
timal control of stochastic systems with exponential-of-integral
tion are optimized. This chapter provides the optimal control- performance index and stochastic differential games. IEEE Transac-
lers for the LQG, MCV, and RS methods. Finally, satellite tions on Automatic Control 39(8), 1551-1563.
attitude control application using RS controller and building Sain, M.K., and Liberty, S.R. (1971). Performance measure densities
control application using MCV controller are described. for a class of LQG control systems. IEEE Transactions on Automatic
Control AC-16 (5), 431-439.
Sain, M.K., Won, C.H., and Spencer, Jr., B.E (1992). Cumulant mini-
References mization and robust control. In Duncan, T.E., and Pasik-Duncan,
Anderson, B.D.O., and Moore, J.B. (1989). Optimal control, linear B. (Eds.) Stochastic Theory and Adaptive Control Lecture Notes in
quadratic methods. Englewood Cliffs, NJ: Prentice Hall. Control and Information Services 184 Berlin: Springer-Veflag, pp.
Basar, T., and Bernhard, P. (1991). H~-optimal control and related 411-425.
minimax design problems. Boston: Birkhauser. Sain, M.K., Won, C.H., Spencer, Jr., B.F., and Liberty, S.R. (2000).
Bensoussan, A. (1992). Stochastic control of partially observable sys- Cumulants and risk-sensitive control: A cost mean and variance
tems. Cambridge: Cambridge University Press. theory with application to seismic protection of structures. In l.A.
Davis, M.H.A. (1977). Linear estimation and stochastic control. Filar, V. Gaitsgory, and K. Mizukami (Eds.), Advances in Dynamic
London: Halsted Press. Games and Applications, Annals of the International Society of
Fleming, W.H., and Rishel, R.W. (1975). Deterministic and stochastic Dynamic Games, Vol. 5. Boston: Birkhauser.
optimal control. New York: Springer-Verlag. Uchida, K., and Fujita, M. (1989). On the central controller: Charac-
Glover, K., and Doyle, J.C. (1988). State-space formulae for all stabil- terizations via differential games and LEQG control problems.
izing controllers that satisfy Ho~-norm bound and relations to risk Systems and Control Letters 15(1), 9-13.
sensitivity. Systems and Control Letters 11, 167-172. Whittle, P. (1996). Optimal control, basics and beyond. New York: John
Jacobson, D.H. (1973). Optimal stochastic linear systems with expo- Wiley & Sons.
nential performance criteria and their relationship to deterministic Won, C.H. (1995). Cost cumulants in risk-sensitive and minimal cost
differential games, IEEE Transactions on Automatic Control, AC-18, variance control, Ph.D. Dissertation. University of Notre Dame.
124-131.