Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Expectation-Maximization derivation
AR(1) model is defined as follows
A ∼ p(A) = N (A|0, P )
R ∼ p(R) = IG(R|α, β)
1 r−(a+1) b
IG(r|a, b) = exp(− )
Γ(a) b−a r
In the last lecture, we derived equations to implement Expectation-Maximization
to maximize A parameter. Below is the derivation:
We want to find
p(R|x, Aold )
Z
= log dR p(A, R, x)
p(R|x, Aold )
≥ hlogp(A, R, x)ip(R|x,Aold ) − log p(R|x, Aold ) p(R|x,Aold )
1
1 1 (x1 − Ax0 )2 β
log p(R|x, Aold ) =+ − logR − (α + 1)logR − −
2 2 R R
3 1 2 1
= −(α + )logR − ( (x1 − Ax0 ) + β)
2 2 R
+ 1 1 2
= log IG(R; α + , (x1 − Ax0 ) )
2 2
We found that p(R|x, Aold ) is distributed according to an inverse gamma
density with known parameters. Now we return to the bounding function. We
only choose terms of φ that depend on A:
A2 x20 A2
old + x1 Ax0
B(A|A )= − −
R 2R 2P p(R|x,Aold )
2 2
A2
A x0 1
= (x1 Ax0 − ) −
2 R p(R|x,Aold ) 2P
A2 x20 A2
B(A|Aold ) =+ (x1 Ax0 − )z −
2 2P
We take derivative with respect to A and equate to zero to find Anew :
δB(A|Aold )
0=
δA
Anew
0 = x1 x0 z − Anew x20 z −
P
zx1 x0
Anew =
x20 z
+ P −1
Question: