Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Coursework
Walter Peralta
ID 201074957
Dr John Paul Gosling
(
)
1
1
l(; X) = (Xi |) exp 2 (x )2
(1)
Answer 2. Elicitation of our beliefs about the parameters and a prior specification:
Let start by defining the parameter over the range (15,56). We consider that the value
that splits the mode into two parts with equal probability is 35. A reasonable explanation for
accepting this value is that most lectures tend to finish after half an hour. We assume that
lecturers tend to approximate as far as possible up to the half hour.
1
Then, we want to know which value splits the mode into two over the range (15, 30), such
that there is an equal chance of being above or below this value. There is no chance that this
value is close to 15, because it is unlikely that a lecture would start if only records around 15
minutes. Moreover, when a lecture start, whatever the topic might be, there is an introduction
and a conclusion of the main topic. Thus, we will give an answer of 25. Finally, the value which
splits the mode over the upper range (30, 56) is going to be 40, considering that many lectures
tend to finish their recordings as close as the time is up.
Our judgements for the parameters are expressed below:
Range (15, 56)
Median = 35
LQ = 25
UQ = 40
Using the MATCH elicitation software tool, a Beta distribution is fitted according to our
judgements: ( 15)/41 Be(1.49407, 1.773119) in Figure 1-left.
Figure 1:
Regarding the spread value , we are uncertain about it. As far as we know, every lecture
takes one hour long, but we assume that all of them has different timing, due to the difference
in topics and lecturers. Therefore, we will define a range not too small, but not too big, being
likely to be around 15 and 5 minutes.
We are now able to build the preposterior distribution for the data in order to check our
beliefs. Figure 1-right reflects more or less our judgements about the data on video lengths.
L(; x) =
1
(x )2 + C
2
2
(2)
(, |Xi ) (, )(Xi |, )
!0.49407
!0.773119
15
15
1
41
41
!
! Z56
(
)
1
1
1
1
1
exp 2 (Xi )2
9
9
2
15
(3)
Looking at our data, we can know start numerical integration approach using the MetropolisHastings algorithm. We use logs over the values to ease the analysis. Using a sample chain of
100000, we get the following plot with respect to in Figure 2-left.
Figure 2:
The plot shows a pattern that moves around the value = 31.43583. This pattern may be seen
as a a white noise process. However, in Figure 2-right, there is a some autocorrelation between
the chains with memory between them. A similar pattern is observed in Figure 3 with respect
to .
3
Figure 3:
Looking at the chain after a burn-in of 5000 in Figure (4), we obtain the following statistics:
Post-mean = 31.43639
95%CI : (29.17826, 33.41413)
Post-variance = 1.24441
95%CI : (1.035652, 1.676541)
Figure 4:
Answer 6. Networks:
We add the following explanatory variables to the DAG in Figure 6:
Z9 = late bus
Z10 = preparation of the lecture
Figure 6: DAG
a) The lecturers bedtime as well as his condition of being tired affects the start of the lecture.
Being tired can also be affected by his late bedtime. Commuting to the university has another
influence over being on time and thus over the lecture starting time. Moreover, if the lecturer is
tired, his condition has influence over the preparation of the lecture, which can also be affected
5