Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
0 Introductions
1.1 Background Project
Mathematics, numbers and calculations have become important parts in our
lives. It is being used by almost everyone every day especially for statistic engineers. In
the last centuries and decades ago, people used different methods to solve calculations.
Nevertheless, this has become huge different in this era of science and technology.
Since, many software was developed and introduced, people start using it in their daily
lives. There is a special yet powerful software that we can be used to calculate for
statistical calculation. This kind of software is very popular among statistic engineers
around the globe. SPSS software is easy to obtain and it is user friendly.
1.2 Objectives
1. To study the type of distribution that will be used in various sampling technique.
2. To find the solution to the problem involving independent variable (x) and
dependent variable (y).
3. To analyze all the data by using SPSS software.
1 53.3
2 52.05
3 50.75
4 50.14
5 49.79
6 49.4
7 48.81
8 48.64
9 47.97
10 46.73
11 46.26
12 45.96
13 45.13
14 44.77
15 44.51
16 44.47
17 44.36
18 44.01
19 43.76
20 43.4
21 43.1
22 42.89
23 42.6
24 42.39
25 42.16
26 41.91
27 41.59
28 41.39
29 41.19
30 40.9
31 40.57
32 40.37
33 40.16
34 39.96
35 39.89
36 39.55
37 39.2
38 38.96
39 38.82
40 38.61
41 38.35
42 38.21
43 37.82
44 37.54
45 37.29
46 37.16
47 36.95
48 36.54
49 36.31
50 36.12
51 36.05
52 35.64
53 35.4
54 35.18
55 34.88
56 34.49
57 34.26
58 33.89
59 33.38
60 33.26
61 33.08
62 32.73
63 32.42
64 32.09
65 31.78
66 31.66
67 31.6
68 31.29
69 31.11
70 30.76
71 30.51
72 30.31
73 29.99
74 29.77
75 29.54
76 29.26
77 29.1
78 28.84
79 28.44
80 28.12
81 27.82
82 22.76
83 22.37
84 22.34
85 17.53
86 17.23
87 16.99
88 16.66
89 16.27
90 15.88
91 15.63
92 15.46
93 15.08
94 14.57
95 14.27
96 14.03
97 13.81
98 13.41
99 13.29
100 12.78
2.2 Analysis of data
2.2.1 Question A
State the mean, standard deviation, minimum and maximum value for the
independent and dependent variables.
Descriptive Statistics
Mean for Y:
∑ 𝑌𝑌
𝑌𝑌 =
n
3429.722
𝑌𝑌� 2 = = 117629.7928
100
2
∑ 𝑌𝑌 2 − 𝑌𝑌� 2
𝑆𝑆 =
𝑛𝑛 − 1
128775.3 − 117629.7928
𝑆𝑆 2 =
100 − 1
𝑆𝑆 2 = 112.580881
Mean for X:
∑ 𝑋𝑋
𝑋𝑋 =
n
1 + 2 + 3 + ⋯ + 98 + 99 + 100
𝑋𝑋 = = 50.50
100
50502
𝑋𝑋� 2 = = 255025
100
∑ 𝑋𝑋 2 −𝑋𝑋� 2
𝑆𝑆 2 = 𝑛𝑛−1
338350 − 255025
𝑆𝑆 2 =
100 − 1
𝑆𝑆 2 = 841.6667
Minimum value X = 1
2.2.2 Question B
Sketch a scatter diagram and regression line for the data.
2.2.3 Question C
Assuming a linear relationship, use the least squares method to find the
simple linear regression model.
(∑ x)2
Sxx= ∑ x 2 − n
(5050)2
= 338350 – 100
= 83325
(∑ y)2
Syy= ∑ y2 − n
(3429.72)2
= 128775.3 – 100
= 11145.50722
To find β/β1 :
Sxy
β=
Sxx
−29538.96
= 83325
= -0.3545
To find α/β0 :
α = 𝑦𝑦� − β𝑥𝑥̅
= 34.2972 − (-0.3545)( 50.50 )
= 52.19945
y = α + βx
y = 52.19945 − 0.3545x
2.2.4 Question D
Check the significance (α=0.05) of the simple linear regression model in
question C; y = 𝟓𝟓𝟓𝟓. 𝟏𝟏𝟏𝟏𝟏𝟏𝟏𝟏𝟏𝟏 − 𝟎𝟎. 𝟑𝟑𝟑𝟑𝟑𝟑𝟑𝟑𝟑𝟑 .
Coefficientsa
95.0% Confidence
Unstandardized Standardized Interval for
Coefficients Coefficients B
Upper
Model B Std. Error Beta t Sig. Lower Bound Bound
1 (Constant) 52.200 .528 98.782 .000 51.151 53.248
% -.355 .009 -.969 -39.022 .000 -.373 -.336
a. Dependent Variable: Celsius
b. t= -39.022, and p-value = 0.000
For significant P-value = 0 that is less than α=0.05 (0 ≤ 0.05), so the simple
linear regression model is significant.
2.2.5 Question E
Interpret the meaning of the intercept and slope in this problem. Please
comment any impossible event.
y = 52.19945
y = 16.74945
Intercept, 𝛼𝛼 : When the number of humidity is 0, the temperature will be 52.19945°C.
2.2.6 Question F
Test the hypothesis 𝑯𝑯𝟎𝟎 : 𝜷𝜷𝟏𝟏 = 0 and 𝑯𝑯𝟏𝟏 : 𝜷𝜷𝟏𝟏 ≠ 0 by taking 5% level of
significance.
𝐻𝐻0 ∶ 𝛽𝛽1 = 0
𝐻𝐻1 ∶ 𝛽𝛽1 ≠ 0
𝑇𝑇𝑐𝑐 = 𝑇𝑇𝛼𝛼�2,𝑣𝑣 = 𝑇𝑇0.025,99 = 1.98, reject 𝐻𝐻0 when 𝑇𝑇𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 is more than 1.98.
Table 2.5: T-test with Test Value = 0 for Humidity (%) and Temperature
(Celsius)
One-Sample Test
Test Value = 0
5% Confidence Interval of the
Difference
t df Sig. (2-tailed) Mean Difference Lower Upper
% 17.407 99 .000 50.500 44.74 56.26
Celsius 32.324 99 .000 34.29720 32.1919 36.4025
= 29.01149
𝑋𝑋� − 𝜇𝜇 50.50 − 0
𝑇𝑇𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 𝑠𝑠 = 29.01149 = 17.406898
√𝑛𝑛 √100
For 𝑌𝑌� = 34.2972;
= 10.6104
𝑌𝑌� − 𝜇𝜇 34.2972 − 0
𝑇𝑇𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 𝑠𝑠 = 10.6104 = 32.3241
√𝑛𝑛 √100
Decision: The test value is 17.406898 and 32.3241 which is higher than the
critical value, 𝑇𝑇𝑐𝑐 > 1.98. The decision is to reject the null hypothesis.
2.2.7 Question G
Compute the coefficient of Pearson correlation and interpret the meaning.
% Celsius
% Pearson Correlation 1 -.969**
Sig. (2-tailed) .000
N 100 100
Celsius Pearson Correlation -.969** 1
Sig. (2-tailed) .000
N 100 100
**. Correlation is significant at the 0.01 level (2-tailed).
To find r:
Sxy
r=
�Sxx Syy
−29538.96
=
�(83325)(11145.50722)
= -0.9693
2.2.8 Question H
= 0.93954×100%
= 93.954%
3.0 Conclusion
As a conclusion, we managed to analyses 100 data of temperature against humidity.
From the analysis we are able to identify the type of distribution that will be used in various
sampling technique. We also manage to find the solution to the problem involving independent
variable (x) and dependent variable (y). Finally, based on what we learnt in the class, we are
able to analyze all the data by using SPSS software and then compare the value with the manual
calculations.