Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Bernoulli distribution
Bernoulli Distribution
Discrete distribution.
The outcome in a Bernoulli experiment is one of two mutually
exclusive events, classified as either success (x = 1) or failure
(x = 0).
Denote the probability of success (i.e., x = 1) by p,
0 < p 1. Let X denote the Bernoulli random variable so
that:
1, w.p. p;
X =
0, w.p. 1 p.
Notation: X Bernoulli (p) .
Formulae & Tables (F&T) book page 7.
202/220
=0 (1 p) + 1 p = p.
variance:
Var (X ) =E X 2 2X
X
= pX (x) x 2 p 2
all x
2
=0 (1 p) + 12 p p 2 = p (1 p) .
m.g.f.:
h i
MX (t) =E e Xt
X
= pX (x) e xt
all x
203/220
=(1 p) e 0t + pe 1t = p e t + (1 p) .
Bernoulli distribution
204/220
n = 20, p = 0.5
0.2
0.1
0
0
10
20
x
Binomial(200,0.5) p.m.f.
n = 200, p = 0.5
0.2
0.1
205/220
0
0
100
x
200
Binomial(20,0.5) p.m.f.
0.3
Binomial(20,0.1) p.m.f.
0.3
n = 20, p = 0.1
0.2
0.1
0
0
10
20
x
Binomial(200,0.1) p.m.f.
n = 200, p = 0.1
0.2
0.1
0
0
100
x
200
Binomial(20,0.5) c.d.f.
1
0.5
n = 20, p = 0.5
0
10
20
x
Binomial(200,0.5) c.d.f.
0.5
206/220
n = 200, p = 0.5
0
0
100
x
200
Binomial(20,0.1) c.d.f.
1
0.5
n = 20, p = 0.1
0
10
20
x
Binomial(200,0.1) c.d.f.
0.5
n = 200, p = 0.1
0
0
100
x
200
n
p x (1 p)nx , for x = 0, 1, . . . , n,
x
and zero otherwise.
p.m.f.: pX (x) =
mean: (alternative
" n proof:
# seen slide 211)
X
X
E [X ] =E
Yi =
E [Yi ] = n E [Yi ] = n p.
i=1
i=1
variance: (alternative
see slide 211)
! proof:
n
n
X
X
Yi =
Var (X ) =Var
Var (Yi ) = n Var (Yi ) = n p (1 p) .
i=1
i=1
m.g.f.: (alternative
slide
h i proof:
h Psee
i 209)h
n i
n
Xt
Y
t
E e
=E e i=1 i = E e Yi t
n
= p e t + (1 p)
207/220
p x (1 p)nx = (p + 1 p)n = 1.
(a + b) =
n
X
n
k=0
ank b k ,
= p e t + (1 p)
* again, applying the binomial expansion with a = p e t and
b = (1 p).
209/220
MX00 (t) =n (n 1) p e t + (1 p)
p et
n1
+ n p e t + (1 p)
p et
n2
=npe t pe t + (1 p)
(n 1) pe t + pe t + (1 p) .
* using product rule:
210/220
f (x)g (x)
x
f (x)
x
g (x) +
g (x)
x
f (x).
212/220
Bernoulli distribution
mean: X
X
E [X ] = x p (1 p)x1 = p
x (1 p)x1
all x
=p
X
x=1
x=1
(1 p)x = p
(1 p)x
(1 p)
(1 p)
x=1
(1 p)
1
1
=p
=p 2 = .
(1 p) 1 (1 p)
p
p
1
1
a
x
x=0 a = 1a
x=1 a = 1a 1 = 1a .
f (x)
g (x)
g (x) x f (x)
f (x)
x
** Using quotient rule: x
.
g (x) =
(g (x))2
214/220
h
i X
m.g.f.:
X t
MX (t) =E e
=
e xt p (1 p)x1
x=1
X
t
=p e
e tz (1 p)z
z=0
=p e t
215/220
z
e t (1 p)
z=0
p et
1 (1 p) e t
1
Note: (1 p) e t < 1 t < log 1p
.
P z
1
* using geometric series:
z=0 a = 1a .
Geo(0.5) p.m.f.
p = 0.5
0.4
0.3
0.2
0.1
0
0
10
15
x
Geo(0.5) c.d.f.
1
0.5
216/220
p = 0.5
0
10
x
15
Geo(0.1) p.m.f.
p = 0.1
0.4
0.3
0.2
0.1
0
0
20
40
x
Geo(0.1) c.d.f.
0.5
p = 0.1
0
20
40
x
Bernoulli distribution
217/220
0
10
20
0
50
100
x
x
Negative Binomial(10,0.5) p.m.f.
Negative Binomial(10,0.1) p.m.f.
0.2
0.2
r = 10, p = 0.5
r = 10, p = 0.1
0.15
0.15
0.1
0.05
219/220
0
0
20
x
40
0.1
0.05
0
0
100
x
200
0.5
r = 3, p = 0.5
0
10
20
x
Negative Binomial(10,0.5) c.d.f.
1
0.5
220/220
r = 10, p = 0.5
0
20
x
40
0.5
r = 3, p = 0.1
0
50
100
x
Negative Binomial(10,0.1) c.d.f.
1
0.5
r = 10, p = 0.1
0
0
100
x
200
Week 2
Week 3
Week 4
Probability:
Week 6
Review
Estimation: Week 5
Week
7
Week
8
Week 9
Hypothesis testing:
Week
10
Week
11
Week
12
Linear regression:
Week 2 VL
Week 3 VL
Week 4 VL
Video lectures: Week 1 VL
Week 1
Week 5 VL
Last week
Introduction to probability;
Definition of probability measure, events;
Calculating with probabilities; Multiplication rule,
permutation, combination & multinomial;
Distribution function;
Moments: (non)-central moments, mean, variance (standard
deviation), skewness & kurtosis;
Generating functions;
301/354
This week
302/354
Poisson p.m.f.
0.4
= 1
= 2
= 4
1
0.9
0.35
Poisson c.d.f.
0.3
0.25
0.2
0.15
0.1
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.05
0
304/354
= 1
= 2
= 4
0.1
0
5
x
10
5
x
10
e x
x! ,
for x = 0, 1, . . .;
X
X
x e
E [X ] = x pX (x) =
x
x!
x=0
all x
x1
X
x=1
z
X
=
=
(x 1)!
z=0
e
= ;
z!
P
all x
305/354
pX (x) = 1.
Var (X ) =E X 2 (E [X ])2 = 2 + 2 = ,
* using:
X
X
x e
E X 2 = x 2 pX (x) =
(x (x 1) + x)
x!
all x
x=0
= + 2
x!
x2
X
x=2
x=0
(x (x 1))
x=0
e
(x 2)!
= + 2
=+ ;
** z = x 2 *** using
306/354
P
all z
pX (z) = 1.
x e
x!
z
X
e
z!
z=0
X
e x
x!
x=0
= e e = 1,
* using the
Pseriesxexpansion for the exponential function (i.e.,
exp() =
x=0 x! ).
The m.g.f. is derived as (note: e tx = (e t )x ):
h i X
X
e x
e ( e t )x
e xt
MX (t) =E e tX =
=
x!
x!
x=0
x=0
=e e e
=
X
e e ( e t )x
x!
|x=0
{z
}
pY (y ) = 1, with Y POI( e t )
all y
307/354
= exp e t 1
Exercise
The expected number of claims for personal accident
insurance is once every ten year.
a. Question: Which distribution would you use to model the
number of claims for an insured in a year?
b. Question: What is the probability of two or more claims
using a Poisson distribution?
c. Question: What are the parameters of the Binomial
distribution?
d. Question: What are the differences between the Poisson
distribution and the Binomial distribution, especially in the
tail?
308/354
Solution
a. Solution: X POI( = 0.1) distribution.
b. Solution: Using a. and the p.m.f. we have:
Pr(X 2) =1 Pr(X < 2) = 1 Pr(X 1)
=1 (Pr(X = 0) + Pr(X = 1))
=
309/354
Exponential p.d.f.
1
= 1
= 2
= 4
0.9
0.9
0.8
0.8
Exponential c.d.f.
1
0.7
0.6
0.5
0.4
0.3
0.7
0.6
0.5
0.4
0.3
0.2
0.2
0.1
0.1
0
0
4
x
311/354
0
0
= 1
= 2
= 4
2
4
x
Pr (X > a + b |X > a ) =
e (a+b)
= e b
e a
= Pr (X > b) .
=
312/354
1
,
2
0.9
0.8
0.8
Gamma c.d.f.
1
= 1, = 1
= 2, = 1
= 4, = 1
0.7
0.6
0.5
0.4
0.3
0.7
0.6
0.5
0.4
0.3
0.2
0.2
0.1
0.1
0
0
4
x
0
0
= 1, = 1
= 2, = 1
= 4, = 1
2
4
x
Gamma p.d.f.
0.45
0.4
Gamma c.d.f.
1
= 1, = 0.5
= 2, = 0.4
= 4, = 0.4
0.9
0.8
0.5
0.35
0.3
0.25
0.2
0.15
0.7
0.6
0.5
0.4
0.3
0.1
0.2
0.05
0.1
0
0
314/354
10
x
20
0
0
= 1, = 0.5
= 2, = 0.4
= 4, = 0.4
10
x
20
Gamma Distribution
density: fX (x) =
;
2
m.g.f.: MX (t) = E e Xt = t
,
provided t < (prove: see slide 318);
Proof of mean and variance: Use the moments (see slide 319).
315/354
Gamma function
The Gamma function (see F&T page 5) is defined by:
Z
() =
x 1 e x dx, > 0.
0
2 0 e
dt = 2 /2 = ,
* using x = t 2 /2, dx = tdt (see slides 328-329).
316/354
Gamma density
To show that the Gamma is a proper density, note that:
Z
Z
x 1 e x dx =
x 1 e x dx
()
()
0
0
and re-parameterising with z = x so that dx = 1 dz, we have:
()
Z
0
(z/)1 e z dz
() 0
=
z 1 e z dz
()
0
|
{z
}
x 1 e x dx =
=()
=1.
317/354
Gamma m.g.f.
To prove the formula for the m.g.f., we have:1
Z
e xt
x 1 e x dx
MX (t) =
()
0
Z
=
x 1 e x(t) dx
()
0
Z
( t) 1 x(t)
x
e
dx
=
( t)
()
0
|
{z
}
R
=1, because
0 fY (y )dy
= 1, with Y Gamma(, t)
318/354
Gamma moments
There is a useful formula for higher (non-central or raw) moments
of the Gamma distribution.
Z
n
x 1 e x dx
E [X ] =
xn
()
0
Z
=
x (n+1) e x dx
()
0
Z n+ (n+1) x
x
e
n
=
(n + )
dx
()
(n + )
0
|
{z
}
R
=1, because
319/354
1 (n + )
.
n
()
0 fY (y )dy
= 1, with Y Gamma(n + , )
Exercise
The Gamma(,) distribution models the time required for
events to occur, given that the events occur randomly in a
Poisson process with a mean time between events of . We
know that major flooding occurs in Queensland on average
every six years. You have to valuate a reinsurance contract
that pays:
- Zero, if there are less than two major floods in the next ten
year;
- $100 million, if there are two major floods in the next ten year;
- $150 million, if there are three or four major floods in the next
ten year;
- $200 million, if there are more than four major floods in the
next ten year.
Exercises
A life insurance company offers an annuity. In order to reduce
longevity risk, the payout after age 90 depends on the number
of survival in the pool. Each pool has 20 insured.
The probability of surviving to 90, conditional on reaching 65 is:
Female Male
Both One Only
Prob surviving to 90 41.2% 29.8% 12.3%
58.7%
You want to buy a joint and survivor annuity, which pay out
$100 if both spouses are alive and $70 if only one is alive.
- The payout is reduced with 20% if for more than 17 contracts
at least one of the spouses is alive.
- The payout is increased with 20% if for less than 10 contracts
at least one of the spouses is alive.
322/354
Exercises
a. What is the probability that the payout increases when you
reach 90?
b. What is the probability that the payout decreases when you
reach 90?
You want to reduce your risk of a reduction in your payout.
c. How many insurance contract can you buy in order to have a
probability of 5% that none of the contracts will reduce your
payments? (Hint: use Geometric distribution)
d. How many insurance contract do you need to buy in order to
have a probability of 95% that at least 3 the contracts will
increase your payments?
323/354
Solution
a. Pr(X 9) = 0.0132.
b. Let X Bin(20, 0.71) be the number of contracts with at
least one of the spouses is alive at age 90.
1 Pr(X 17) = 1 0.9567 = 0.0433.
c. Let Y Geo(0.0433) be number of contracts which do not
get a reduction in the payment.
We have: Pr(Y 1) = 0.0433, Pr(Y 2) = 0.0847,
Pr(Y 3) = 0.1244, and Pr(Y 4) = 0.1623.
Thus, you can buy one contracts.
d. Let Z NBin(3, 0.0132) be number of contracts needed to
get a at least 3 contracts with an increase in the payment.
Pr(Z 62) = 0.0489, Pr(Z 63) = 0.0509,
Pr(Z 474) = 0.9496, and Pr(Z 475) = 0.9501.
324/354
325/354
X
N (0, 1) ,
Var (Z ) =
X
Var X
E [Z ] = E
and
=
=
X = Z + .
E[X ]
Var (X )
2
= 0.
= 1.
2 !
, for
1
2
2 t 2
x 2
1
1
showing:
e 2 ( ) dx = 1.
2
x
and consider the case of the standard
Transform z =
e 2 (y +z ) dydz.
2
2
328/354
and
z = r sin()
so that
y 2 + z 2 = r 2 sin2 () + cos2 () = r 2
{z
}
|
=1
and
dydz = det
y
r
z
r
drd = det
cos() r sin()
sin() r cos()
drd = rdrd.
=0(1)=1
Normal p.d.f.
0.45
0.4
Normal c.d.f.
1
= 2, = 1
= 0, = 1
= 2, = 1
0.9
0.8
0.5
0.35
0.3
0.25
0.2
0.15
0.7
0.6
0.5
0.4
0.3
0.1
0.2
0.05
0.1
0
5
330/354
0
x
0
5
= 2, = 1
= 0, = 1
= 2, = 1
0
x
Normal p.d.f.
0.7
Normal c.d.f.
1
= 0, = 0.5
= 0, = 1
= 0, = 2
0.9
0.8
0.8
0.6
0.5
0.4
0.3
0.2
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
5
0.1
0
x
0
5
= 0, = 0.5
= 0, = 1
= 0, = 2
0
x
We can easily verify the following (check from F&T page 160-161):
Pr (1 Z 1) =0.6826
Pr (2 Z 2) =0.9544
Pr (3 Z 3) =0.9974
331/354
f(x)
0
x
z1
f(x)
Pr(Z>z)=1
Pr(Z<z)=
332/354
0
x
z1
X
N (0, 1) .
|
{z
}
=fX (x)
1
1
exp x t
2
2
2 !
dx.
1
=x t 2 x 2 2x + 2
2
=
1
1 2
|{z}
x 2 + 2 t x 2 2
2
2
2
|
{z
}
2
a
2ba
2
2
1
1
= 2 x + 2 t
2 2 + 2 t
2
2
1
1 2 2
2
2
= 2 x + t
+ t + t .
2
2
336/354
dx.
exp x t
MX (t) =
2
Using the previous slide, replace the blue part by the red part:
Z
2
1
1 2 2
1
2
MX (t) =
+ t + t
dx
exp 2 x + t
2
2
2
=e (t+ 2
2 t 2
=1, because
1 2 2
=e (t+ 2 t ) .
337/354
1
e
2
R
0
21
{z
x + 2 t
!2
dx
}
fY (y )dy = 1, with Y N( + 2 t, 2 )
Lognormal Distribution
F&T book, page 14.
Consider X Lognormal , 2 .
Density is:
1
1
exp
fX (x) =
2
x 2
log(x)
2 !
,
338/354
x > 0.
Lognormal Distribution
If Y N , 2 and X = e Y then X is lognormal; or
log(X ) N(, 2 ), i.e., log is normally distributed.
X = exp (Y ) Lognormal , 2 is said to have a lognormal
distribution with parameters and 2 .
LogNormal p.d.f.
0.7
LogNormal c.d.f.
1
= 0, = 1
= 0, = 0.5
= 1, = 0.5
0.9
0.8
0.8
0.6
0.5
0.4
0.3
0.2
0.7
0.6
0.5
0.4
0.3
0.2
0.1
339/354
0
0
0.1
5
10
0
0
= 0, = 1
= 0, = 0.5
= 1, = 0.5
5
10
340/354
= Pr
log(a)
= Pr Z
.
log(1.16)
.
2
Pr(X >
log (40))
40) = 1 Pr (Y log(1.16)
=
log(40)log(25)+ 2
1Pr Z
= 1(1.4126) = 0.0789.
341/354
log(1.16)
1
(ba) ,
Uniform p.d.f.
Uniform c.d.f.
1
a= 0, b= 0.5
a= 0, b= 2
a= 1, b= 2
0.9
0.8
1.5
0.5
0.7
0.6
0.5
0.4
0.3
0.2
a= 0, b= 0.5
a= 0, b= 2
a= 1, b= 2
0.1
0
342/354
1
x
1
x
Uniform Distribution
cumulative distribution:
Rx
0dx
R
x
1
fX (x)dx =
FX (x) =
a (ba) dx
R
1
dx
Z
a (ba)
= 0,
if x < a;
xa
= ba , if a x b;
= 1,
if x > b.
Z
=
a
b
1 x2
a+b
1
dx =
=
.
x
(b a)
2ba a
2
variance:
Var (X ) =E X 2 (E [X ])2
b
Z b
1
a + b 2 1 x3
a + b 2 (b a)2
=
x2
dx
=
=
.
(b a)
2
3ba a
2
12
343/354
a
344/354
Beta function:
R 1 1
(1 x)1 dx =
B(, ) = ()()
(+) = 0 x
* using density of Beta function (next slide),
** if and are integers.
(1)!(1)!
(+1)! .
Beta Distribution
density: fX (x) =
(a + b) a1
x
(1 x)b1 , for 0 x 1.
(a) (b)
| {z }
1/B(a,b)
0
Z 1
1
B (a + 1, b)
=
x a (1 x)b1 dx =
B (a, b) 0
B (a, b)
(a + b) (a + 1) (b)
=
(a) (b) (a + b + 1)
(a + b)
a (a) (b)
a
=
=
(a) (b) (a + b) (a + b)
a+b
variance: Var (X ) =
345/354
ab
.
(a+b)2 (a+b+1)
Beta p.d.f.
Beta c.d.f.
1
a=
a=
a=
a=
2, b= 4
0.2, b= 1
4, b= 2
0.5, b= 0.5
0.9
0.8
2.5
1.5
0.7
0.6
0.5
0.4
0.3
0.2
0.5
0.1
0
0
346/354
0.5
x
0
0
a=
a=
a=
a=
0.5
x
2, b= 4
0.2, b= 1
4, b= 2
0.5, b= 0.5
Application
An insurance company offers nuclear incident insurance for
n = 10 years.
In these ten years there have been x = 3 years with a claim.
The insurer does not price idiosyncratic risk, but does price
systematic risk (i.e., uncertainty in p).
The insurer has only limited information of the true value p.
Assume claims are $1 billion each.
a. Question: What is the price of the contract when the price is
the mean half the standard deviation?
b. Question: What is the price of the contract when the price is
the 75% quantile?
a. Solution: (See Excel file) $0.398705 billion.
347/354
e u ( u)1 du.
Pr (u < < u + du) =
()
348/354
LTP
Pr (X = x) =
e u
e u ( u)1 du
x!
()
0
x
1
+x 1
=
x
1+
1+
i.e., NB(p =
, k = ), using:
1+
R +x1 u(+1)
e
du =
0 u
x+1
R z x+1
1
1
e
dz =
1+
1+
0
(+x)
(+1)+x .
dz
= +1
)
.
x
x! ( 1)!
x!
()
349/354
Weibull Distribution
F&T book, page 15.
Application: Survival function: used for modelling lifetimes
(time to failure).
Let X Weibull(c, ) is Weibullly distributed variable.
probability density: fX (x) = c x 1 exp (c x ), for
x > 0, and zero otherwise.
cumulative distribution function:
FX (x) = 1 exp (c x ).
moments: E [X r ] = 1 + r c r1/ .
Parameters: < 1 ( = 1/ > 1): decreasing
(constant/increasing) failure rate over time.
350/354
Pareto Distribution
Heavy tailed distribution (an extreme value distribution).
Often used for reinsurance purposes. The Pareto distribution
tapers away to zero much more slowly than LogNormal.
Hence, it is more appropriate for estimating reinsurance
premium in respect of very large claims.
F&T book page 1415. Used to model r.v. with very large
values with very low probabilities (e.g. incomes).
Cumulative distribution function ( > 0, > 0):
, x > 0.
FX (x) = 1
+x
351/354
fX (x) =
=
.
+1
(1 + x/)+1
( + x)
Pareto p.d.f.
=
=
=
=
1.8
1.6
3,
3,
3,
1,
Pareto c.d.f.
=
=
=
=
1.4
1.2
1
0.8
0.6
1
1
2
4
4
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.4
0.2
0.2
0.1
0
0
4
x
352/354
0
0
=
=
=
=
2
3,
3,
3,
1,
=
=
=
=
4
1
2
4
4
Pareto Distribution
Moments do not always exist:
E [X r ] =
( r ) (1 + r ) r
,
()
For example:
Var (X ) =
and only exists if > 2.
Prove: see exercise 8.
353/354
2
( 1)2 ( 2)