Eece 522 Notes - 12 CH - 7a

Chapter 7
Maximum Likelihood Estimate

(MLE)
Motivation for MLE

Problems:
1. MVUE often does not exist or cant be found

<See Ex. 7.1 in the textbook for such a case>
2. BLUE may not be applicable (x H + w)
Solution: If the PDF is known, then MLE can always be used!!!

This makes the MLE one of the most popular practical methods
Advantages:
1. It is a Turn-The-Crank method
2. Optimal for large enough data size
Disadvantages:
1. Not optimal for small data size

2. Can be computationally complex
- may require numerical methods
Rationale for MLE

Choose the parameter value that:
makes the data you did observe
the most likely data to have been observed!!!
Consider 2 possible parameter values: 1 & 2
Ask the following: If i were really the true value, what is the
probability that I would get the data set I really got ?
Let this probability be Pi
So if Pi is small it says you actually got a data set that was
unlikely to occur! Not a good guess for i!!!
But p1 = p(x;1) dx
p2 = p(x;2) dx
pick ML so that p( x;ML ) is largest

3
Definition of the MLE

ML is the value of that maximizes the Likelihood
Function p(x;) for the specific measured data x

p(x;)
ML maximizes the
ML
likelihood function
Note: Because ln(z) is a monotonically increasing function

ML maximizes the log likelihood function ln{p(x; )}
General Analytical Procedure to Find the MLE

1. Find log-likelihood function: ln p(x;)
2. Differentiate w.r.t and set to 0: ln p(x;)/ = 0
3. Solve for value that satisfies the equation
4
Ex. 7.3: Ex. of MLE When MVUE Non-Existent

x[n] ~ N(A,A)
x[n] = A + w[n]
A>0
Likelihood Function:
WGN
~N(0,A)
1 N 1
2
p( x; A) =
exp
( x[n ] A)
2 A n =0
2
( 2A)
$!!!!!!#!!!!!!"
1
To take ln of this use log properties:
Take /A, set = 0, and change A to A

1 N 1
1
N
+ ( x[n ] A ) +
2 A A n =0
2 A 2
N 1
( x[n] A ) 2 = 0
n =0
Expand this :
1
1 1
1
N
A 2 N
2
2 A x[n ] +
x [n ]
+ x[n ] NA +
=0
2
2
2A A
2A
A
2A
2A
Cancel
Manipulate to get:
1
+ A
N
N 1
[n ] = 0
n=0
Solve quadratic equation to get MLE:

A ML
1
=
+
2
N 1
1
N
[n ] +
n=0
1
4
Can show this estimator biased (see bottom of p. 160)

But it is asymptotically unbiased
Use the Law of Large Numbers:
Sample Mean True Mean
1 N 1 2
2
x
[
n
]
E
{
x
[ n] }
as N
N n =0
So can use this to show:
E A ML
var( A )
1
E
+
2
A2
N A+
E x 2 [n ] +
= CRLB
1
4
1
+
=
2
1
2
[
]
E
x
n
+
= A
$!# !
"
4
2
=A +A
AsymptoticallyUnbiased &
Efficient
6
7.5 Properties of the MLE
(or Why We Love MLE)
The MLE is asymptotically:

1. unbiased
2. efficient (i.e. achieves CRLB)
3. Gaussian PDF
Also, if a truly efficient estimator exists, then the ML procedure
finds it !
The asymptotic properties are captured in Theorem 7.1:
If p(x; ) satisfies some regularity conditions, then the
MLE is asymptotically distributed according to
a
ML ~ N ( , I 1 ( ))
where I( ) = Fisher Information Matrix
Size of N to Achieve Asymptotic

This Theorem only states what happens asymptotically
when N is small there is no guarantee how the MLE behaves
Q: How large must N be to achieve the asymptotic properties?
A: In practice: use Monte Carlo Simulations to answer this
Monte Carlo Simulations: see Appendix 7A

A methodology for doing computer simulations to evaluate
Not just for the MLE!!!
performance of any estimation method
Illustrate for deterministic signal s[n; ] in AWGN
Monte Carlo Simulation:
Data Collection:
1. Select a particular true parameter value, true
- you are often interested in doing this for a variety of values of
so you would run one MC simulation for each value of interest
2. Generate signal having true : s[n;t] (call it s in matlab)
3. Generate WGN having unit variance
w = randn ( size(s) );
4. Form measured data: x = s + sigma*w;
- choose to get the desired SNR
- usually want to run at many SNR values
do one MC simulation for each SNR value
Data Collection (Continued):

5. Compute estimate from data x
6. Repeat steps 3-5 M times
- (call M # of MC runs or just # of runs)
7. Store all M estimates in a vector EST (assumes scalar )
Statistical Evaluation:
1. Compute bias
2. Compute error RMS
1
b=
M
(
M
i =1
true
RMS =
3. Compute the error Variance

4. Plot Histogram or Scatter Plot (if desired)
1
M
)
( )
M
i =1
1
1
VAR =
M i =1 i M
i
i =1
M
Now explore (via plots) how: Bias, RMS, and VAR vary with:
value, SNR value, N value, Etc.
Is B 0 ?
Is RMS (CRLB) ?
10
Ex. 7.6: Phase Estimation for a Sinusoid

Some Applications:
1. Demodulation of phase coherent modulations
(e.g., DSB, SSB, PSK, QAM, etc.)
2. Phase-Based Bearing Estimation
Signal Model:x[n] = Acos(2fon + ) + w[n],
A and fo known, unknown
Recall CRLB:
()
var
n = 0, 1,, N-1
White
~N(0,2)
2 2
1
=
2
N SNR
NA
For this problem all methods for finding the MVUE will fail!!
So try MLE!!
11
So first we write the likelihood function:

p ( x; ) =
(2 )
N
2 2
1 N 1
2
[x[n ] A cos(2f o n + )]
exp
2
2 $
n = 0!!!!!#!!!!!
"
GOAL: Find that

maximizes this
So, minimize:
J ( ) =
End up in same
place if we
maximize LLF
equivalent to
minimizing this
N 1
[x[n ] A cos(2f o n + )]2
Setting
n =0
J ( )
= 0 gives
x[n ] sin (2f o n + ) = A sin (2f o n + )cos(2f o n + )
N 1
N 1
n =0
n = 0!!!!!
$
!#!!!!!!
"
0
sin and cos are when summed over full cycles
So MLE Phase Estimate satisfies:

Interpret via inner product or correlation
N 1
=0
x
[
n
]
sin
2
f
n
+
n =0
12
Nowusing a Trig Identity and then re-arranging gives:
cos( ) x[n ] sin (2f o n ) = sin( ) x[n ] cos(2f o n )

n
Or
ML
x[n ] sin (2f o n )
1 n
= tan
(
)
x
[
n
]
cos
2
f
n
o
Recall: I-Q Signal Generation

LPF
-sin(2fot)
LPF
Dont need to know

A or 2 but do need
to know fo
yi(t)
cos(2fot)
x(t)
Recall: This is the

approximate MLE
yq(t)
The sums in the above equation

play the role of the LPFs in the
figure (why?)
Thus, ML phase estimator can be
viewed as: atan of ratio of Q/I
13
Monte Carlo Results for ML Phase Estimation
See figures 7.3 & 7.4 in text book
14

Eece 522 Notes - 12 CH - 7a

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Eece 522 Notes - 12 CH - 7a

Caricato da

Copyright:

Formati disponibili

Chapter 7

Maximum Likelihood Estimate

Motivation for MLE

1. MVUE often does not exist or cant be found

Solution: If the PDF is known, then MLE can always be used!!!

1. Not optimal for small data size

Rationale for MLE

pick ML so that p( x;ML ) is largest

Definition of the MLE

Function p(x;) for the specific measured data x

Note: Because ln(z) is a monotonically increasing function

General Analytical Procedure to Find the MLE

Ex. 7.3: Ex. of MLE When MVUE Non-Existent

To take ln of this use log properties:

Take /A, set = 0, and change A to A

Solve quadratic equation to get MLE:

Can show this estimator biased (see bottom of p. 160)

7.5 Properties of the MLE

(or Why We Love MLE)

The MLE is asymptotically:

where I( ) = Fisher Information Matrix

Size of N to Achieve Asymptotic

Monte Carlo Simulations: see Appendix 7A

Data Collection (Continued):

3. Compute the error Variance

Ex. 7.6: Phase Estimation for a Sinusoid

So first we write the likelihood function:

GOAL: Find that

[x[n ] A cos(2f o n + )]2

x[n ] sin (2f o n + ) = A sin (2f o n + )cos(2f o n + )

sin and cos are when summed over full cycles

So MLE Phase Estimate satisfies:

Nowusing a Trig Identity and then re-arranging gives:

cos( ) x[n ] sin (2f o n ) = sin( ) x[n ] cos(2f o n )

x[n ] sin (2f o n )

Recall: I-Q Signal Generation

Dont need to know

Recall: This is the

The sums in the above equation

Monte Carlo Results for ML Phase Estimation

See figures 7.3 & 7.4 in text book

Potrebbero piacerti anche