Sei sulla pagina 1di 12

1.

0 Introductions
1.1 Background Project
Mathematics, numbers and calculations have become important parts in our
lives. It is being used by almost everyone every day especially for statistic engineers. In
the last centuries and decades ago, people used different methods to solve calculations.
Nevertheless, this has become huge different in this era of science and technology.
Since, many software was developed and introduced, people start using it in their daily
lives. There is a special yet powerful software that we can be used to calculate for
statistical calculation. This kind of software is very popular among statistic engineers
around the globe. SPSS software is easy to obtain and it is user friendly.

This software is widely used by an engineer in statistical engineering. The


reason is because the software provides better calculation with fast response and it can
be used for wide variety of statistics. With SPSS software, you can predict the value for
certain things in the future with the reference of tabulated number. This helps the user
to organize or create more systematic planning to avoid any unwanted events. Based on
the mathematical experts, this software can be used for managing data and it is built
around the SPSS programming language. Moreover, it is very easy to learn as a
beginner, we can learn it in short time of period on how to use this software correctly.
For this course, each group required to find 100 data’s in any field of Electrical
and Electronics Engineering, which consists of independent variable (x) and dependent
variable (y). We have to use SPSS software to analyze the data that we’ve collected.
Our group has chosen the topic on Factors That Affect the Resistors. An analysis will
be conducted to make sure the aim of this project is achieved.

1.2 Objectives
1. To study the type of distribution that will be used in various sampling technique.
2. To find the solution to the problem involving independent variable (x) and
dependent variable (y).
3. To analyze all the data by using SPSS software.

2.0 Factors That Affect the Resistors


2.1 Question
An investigation was conducted to study the relationship between humidity
effect by resistor heat and the heat produced by the resistor. The resistor is placed in a
box with the diameter of 5cm x 5cm and 5cm in a room temperature. The humidity
effect by resistor heat is the dependent variable and the heat produced by the resistor is
the independent variable. The measurement was taken for 100 data and the tabulated
data are given below.

Table 2.1: Data On Humidity and Temperature


Humidity (%) Temperature (Celsius)

1 53.3
2 52.05
3 50.75
4 50.14
5 49.79
6 49.4
7 48.81
8 48.64
9 47.97
10 46.73
11 46.26
12 45.96
13 45.13
14 44.77
15 44.51
16 44.47
17 44.36
18 44.01
19 43.76
20 43.4
21 43.1
22 42.89
23 42.6
24 42.39
25 42.16
26 41.91
27 41.59
28 41.39
29 41.19
30 40.9
31 40.57
32 40.37
33 40.16
34 39.96
35 39.89
36 39.55
37 39.2
38 38.96
39 38.82
40 38.61
41 38.35
42 38.21
43 37.82
44 37.54
45 37.29
46 37.16
47 36.95
48 36.54
49 36.31
50 36.12
51 36.05
52 35.64
53 35.4
54 35.18
55 34.88
56 34.49
57 34.26
58 33.89
59 33.38
60 33.26
61 33.08
62 32.73
63 32.42
64 32.09
65 31.78
66 31.66
67 31.6
68 31.29
69 31.11
70 30.76
71 30.51
72 30.31
73 29.99
74 29.77
75 29.54
76 29.26
77 29.1
78 28.84
79 28.44
80 28.12
81 27.82
82 22.76
83 22.37
84 22.34
85 17.53
86 17.23
87 16.99
88 16.66
89 16.27
90 15.88
91 15.63
92 15.46
93 15.08
94 14.57
95 14.27
96 14.03
97 13.81
98 13.41
99 13.29
100 12.78
2.2 Analysis of data

2.2.1 Question A

State the mean, standard deviation, minimum and maximum value for the
independent and dependent variables.

Table 2.2: Mean, Standard Deviation, Minimum and Maximum Value of


the Independent and Dependent Variables

Descriptive Statistics

N Minimum Maximum Mean Std. Deviation


% 100 1 100 50.50 29.011
Celsius 100 12.78 53.30 34.2972 10.61044
Valid N (listwise) 100

Mean for Y:
∑ 𝑌𝑌
𝑌𝑌 =
n

53.30 + 52.05 + 50.75 + ⋯ + 13.41 + 13.29 + 12.78


𝑌𝑌 = = 34.2972
100

∑ 𝑌𝑌 2 = 53.302 + 52.052 + 50.752 + ⋯ + 13.412 + 13.292 + 12.782 = 128775.3

3429.722
𝑌𝑌� 2 = = 117629.7928
100

2
∑ 𝑌𝑌 2 − 𝑌𝑌� 2
𝑆𝑆 =
𝑛𝑛 − 1
128775.3 − 117629.7928
𝑆𝑆 2 =
100 − 1
𝑆𝑆 2 = 112.580881

Standard Deviation for Y (Celsius):

𝑆𝑆 = �𝑆𝑆 2 = √112.580881 = 10.6104


Minimum value Y = 12.78

Maximum value Y = 53.30

Mean for X:
∑ 𝑋𝑋
𝑋𝑋 =
n
1 + 2 + 3 + ⋯ + 98 + 99 + 100
𝑋𝑋 = = 50.50
100

∑ 𝑋𝑋 2 = 12 + 22 + 32 + ⋯ + 982 + 992 + 1002 = 338350

50502
𝑋𝑋� 2 = = 255025
100
∑ 𝑋𝑋 2 −𝑋𝑋� 2
𝑆𝑆 2 = 𝑛𝑛−1

338350 − 255025
𝑆𝑆 2 =
100 − 1
𝑆𝑆 2 = 841.6667

Standard Deviation for X (% humidity):

𝑆𝑆 = �𝑆𝑆 2 = √841.6667 = 29.0115

Minimum value X = 1

Maximum value X = 100

2.2.2 Question B
Sketch a scatter diagram and regression line for the data.

Figure 2.1: Scatter diagram


Figure 2.2: Regression line

2.2.3 Question C
Assuming a linear relationship, use the least squares method to find the
simple linear regression model.

Table 2.3: Standardized and Unstandardized Coefficients on Dependent


Variable
Coefficientsa
Standardized
Unstandardized Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 52.200 .528 98.782 .000
% -.355 .009 -.969 -39.022 .000
a. Dependent Variable: Celsius

Proven by using the least squares method as shown in below:

Based on the data collection, using calculator and Excel:

We get n= 100, ∑ x = 5050, ∑ x 2 = 338350, ∑ y = 3429.72 ∑ y2 = 128775.3, ∑ xy =


143661.9
To find Sxy:
∑x ∑y
Sxy= ∑ xy −
n
(5050)(3429.72 )
=143661.9 – 100
=-29538.96

To find Sxx and Syy:

(∑ x)2
Sxx= ∑ x 2 − n
(5050)2
= 338350 – 100
= 83325

(∑ y)2
Syy= ∑ y2 − n
(3429.72)2
= 128775.3 – 100
= 11145.50722

To find β/β1 :

Sxy
β=
Sxx
−29538.96
= 83325
= -0.3545

Let 𝑥𝑥̅ = 50.50 and 𝑦𝑦� = 34.2972 from question (A),

To find α/β0 :

α = 𝑦𝑦� − β𝑥𝑥̅
= 34.2972 − (-0.3545)( 50.50 )
= 52.19945

Thus, the equation of the least square line is:

y = α + βx
y = 52.19945 − 0.3545x
2.2.4 Question D
Check the significance (α=0.05) of the simple linear regression model in
question C; y = 𝟓𝟓𝟓𝟓. 𝟏𝟏𝟏𝟏𝟏𝟏𝟏𝟏𝟏𝟏 − 𝟎𝟎. 𝟑𝟑𝟑𝟑𝟑𝟑𝟑𝟑𝟑𝟑 .

The SPSS output has shown in below:

Test Statistic (T-test):

Table 2.4: Standardized and Unstandardized Coefficients on Dependent


Variable with 95.0% Confidence Interval for B

Coefficientsa
95.0% Confidence
Unstandardized Standardized Interval for
Coefficients Coefficients B
Upper
Model B Std. Error Beta t Sig. Lower Bound Bound
1 (Constant) 52.200 .528 98.782 .000 51.151 53.248
% -.355 .009 -.969 -39.022 .000 -.373 -.336
a. Dependent Variable: Celsius
b. t= -39.022, and p-value = 0.000

For significant P-value = 0 that is less than α=0.05 (0 ≤ 0.05), so the simple
linear regression model is significant.

2.2.5 Question E
Interpret the meaning of the intercept and slope in this problem. Please
comment any impossible event.

Equation of y = 52.19945 − 0.3545x

Let 𝑥𝑥 = 0 y = 52.19945 − 0.3545(0)

y = 52.19945

Let 𝑥𝑥 = 100 y = 52.19945 − 0.3545(100)

y = 16.74945
Intercept, 𝛼𝛼 : When the number of humidity is 0, the temperature will be 52.19945°C.

When the number of humidity is 100, the temperature will be 16.74945°C.

Slope, 𝛽𝛽 : When the number of humidity is increase, the temperature value of is


decrease.

2.2.6 Question F

Test the hypothesis 𝑯𝑯𝟎𝟎 : 𝜷𝜷𝟏𝟏 = 0 and 𝑯𝑯𝟏𝟏 : 𝜷𝜷𝟏𝟏 ≠ 0 by taking 5% level of
significance.

𝐻𝐻0 ∶ 𝛽𝛽1 = 0

𝐻𝐻1 ∶ 𝛽𝛽1 ≠ 0

𝛼𝛼 = 0.05, 𝛼𝛼�2 = 0.025, 𝑣𝑣 = 𝑛𝑛 − 1 = 100 − 1 = 99

𝑇𝑇𝑐𝑐 = 𝑇𝑇𝛼𝛼�2,𝑣𝑣 = 𝑇𝑇0.025,99 = 1.98, reject 𝐻𝐻0 when 𝑇𝑇𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 is more than 1.98.

Table 2.5: T-test with Test Value = 0 for Humidity (%) and Temperature
(Celsius)

One-Sample Test
Test Value = 0
5% Confidence Interval of the
Difference
t df Sig. (2-tailed) Mean Difference Lower Upper
% 17.407 99 .000 50.500 44.74 56.26
Celsius 32.324 99 .000 34.29720 32.1919 36.4025

𝐹𝐹𝐹𝐹𝐹𝐹 𝑋𝑋� = 50.50;

∑(𝑋𝑋𝑛𝑛 − 𝑋𝑋)2 (1 − 50.50)2 + (2 − 50.50)2 … + (99 − 50.50)2 + + (100 − 50.50)2


𝑠𝑠 = � =�
𝑛𝑛 − 1 100 − 1

= 29.01149

𝑋𝑋� − 𝜇𝜇 50.50 − 0
𝑇𝑇𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 𝑠𝑠 = 29.01149 = 17.406898
√𝑛𝑛 √100
For 𝑌𝑌� = 34.2972;

∑(𝑌𝑌𝑛𝑛 − 𝑌𝑌�)2 (53.30 − 34.2972)2 + (52.05 − 34.2972)2 … + (12.78 − 34.2972)2


𝑠𝑠 = � =�
𝑛𝑛 − 1 100 − 1

= 10.6104

𝑌𝑌� − 𝜇𝜇 34.2972 − 0
𝑇𝑇𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 𝑠𝑠 = 10.6104 = 32.3241
√𝑛𝑛 √100

Decision: The test value is 17.406898 and 32.3241 which is higher than the
critical value, 𝑇𝑇𝑐𝑐 > 1.98. The decision is to reject the null hypothesis.

Hypothesis 𝐻𝐻1 : 𝛽𝛽1 ≠ 0 is accepted.

2.2.7 Question G
Compute the coefficient of Pearson correlation and interpret the meaning.

Table 2.6: Pearson Correlation


Correlations

% Celsius
% Pearson Correlation 1 -.969**
Sig. (2-tailed) .000
N 100 100
Celsius Pearson Correlation -.969** 1
Sig. (2-tailed) .000
N 100 100
**. Correlation is significant at the 0.01 level (2-tailed).

Proven by using manually calculation as shown in below:


∑ y2 = 128775.3, ∑ y = 3429.72, Sxy = −29538.96, Syy =
11145.50722, Sxx = 83325, from question (C),

To find r:
Sxy
r=
�Sxx Syy
−29538.96
=
�(83325)(11145.50722)
= -0.9693

The range of the coefficient of Pearson correlation is -1≪ r ≪ 1. If the range of


0.5≪ r ≪ 1 is strong positive linear relationship, so the range of -1≪ r ≪ −0.5
is strong negative linear relationship. Thus, r = -0.9693 is strong negative linear
relationship.

2.2.8 Question H

Compute the coefficient of determination and interpret the meaning.

Table 2.7: Determination


Model Summary
Adjusted R Std. Error of the
Model R R Square Square Estimate
1 .969a .940 .939 2.62238
a. Predictors: (Constant), %

The coefficient of determination, r2 is equal to 0.940 from the model summary


of SPSS output as shown in Table 6.
Proven by using manually calculation as shown in below:

Let r = 0.999 from question (G),


r2 = (-0.9693)2 = 0.93954

= 0.93954×100%

= 93.954%

Thus, almost 93.954%variation in y (temperature, Celsius) is explained by x


(humidity, %) and regression line.

3.0 Conclusion
As a conclusion, we managed to analyses 100 data of temperature against humidity.
From the analysis we are able to identify the type of distribution that will be used in various
sampling technique. We also manage to find the solution to the problem involving independent
variable (x) and dependent variable (y). Finally, based on what we learnt in the class, we are
able to analyze all the data by using SPSS software and then compare the value with the manual
calculations.

Potrebbero piacerti anche