Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
13U00324
Case 1
10)
175
150
Clients
125
100
75
50
25000 27500 30000 32500 35000
Stamps
Th plot above displays that there is a somewhat moderate positive relationship between the two
variables.
175
150
Clients
125
100
75
50
100 105 110 115 120 125
Index
Urooj Qaiser
13U00324
Th plot above displays that there is a somewhat moderate positive relationship between the two
variables.
11)
Pearson correlation of Clients and Index = 0.604
P-Value = 0.000
There is a significant relationship between number of clients and index as the p value is less than
0.05. The relationship is moderately positive as depicted by the Pearson correlation which shows a
value of 0.604.
There is a significant relationship between number of clients and stamps as the p value is less than
0.05. The relationship is moderately positive as depicted by the Pearson correlation which shows a
value of 0.431.
12)
The normal probability plot displays that errors are normally distributed since blue points lie on the
red line. This result is also reinforced by the shape of the histogram. The versus fit diagram depicts
Urooj Qaiser
13U00324
that there is a problem of heteroscedasticity since the errors points are not randomly distributed
and are clustered. The versus order graph explains that since errors are evenly distributed around
the zero line so the errors are independently distributed. The Durbin Watson statistic of 1.75062 is
also close to 2 indicating that there is no problem of serial correlation.
Analysis of Variance
Ho: B1= 0
Since the p value is less than 0.05 so we fail to accept H0 and conclude that at least one of the
regression coefficient is significant. We can further explore it by the individual t-test of regression
coefficients.
Regression Equation
S R-sq R-sq(adj)
20.9152 36.49% 35.11%
The coefficient index is a significant predictor of the number of clients seen, since the p value is less
than 0.05. There is also no problem of multicollinearity since value of VIF is less than 10.
The index explains 36.49 percent of variation of the number of new clients seen.
FORECAST
The normal probability plot displays that errors are normally distributed since blue points lie on the
red line. This result is also reinforced by the shape of the histogram. The versus fit diagram depicts
that there is no problem of heteroscedasticity since the errors points are randomly distributed and
are not clustered. The versus order graph explains that since errors are evenly distributed around
the zero line so the errors are independently distributed. The Durbin Watson statistic of 1.84859 is
also close to 2 indicating that there is no problem of serial correlation
Analysis of Variance
Ho: B1= 0
Since the p value is less than 0.05 so we fail to accept H0 and conclude that at least one of the
regression coefficient is significant. We can further explore it by the individual t-test of regression
coefficients.
Regression Equation
The coefficient stamps is a significant predictor of the number of clients seen, since the p value is
less than 0.05. There is also no problem of multicollinearity since value of VIF is less than 10.
The stamps explain 18.60 percent of variation of the number of new clients seen.
14)
15) Business activity index is a good predictor of number of clients since it explains 36. 49
percent of variation of the number of new clients seen. Whereas, stamps only explain 18.60 percent
of variation of the number of new clients seen.
Q16)
Analysis of Variance
Source DF SS MS F P
Regression 4 17435.7 4358.9 13.16 0.000
Residual Error 43 14247.5 331.3
Total 47 31683.2
Urooj Qaiser
13U00324
Ho: B1=B2=B3=B4=0
Since the p value is less than 0.05 so we fail to accept H0 and conclude that at least one of the
regression coefficient is significant. We can further explore it by the individual t-test of regression
coefficients.
Since the variables stamps and permits do not impact the number of clients seen so we drop these
coefficients and re run the model.
50 0
10 -20
1 -40
-50 -25 0 25 50 100 120 140 160 180
Residual Fitted Value
20
Frequency
9
Residual
6 0
3 -20
0 -40
-40 -20 0 20 40 1 5 10 15 20 25 30 35 40 45
Residual Observation Order
Urooj Qaiser
13U00324
The normal probability plot displays that errors are normally distributed since red points lie on the
blue line. This result is also reinforced by the shape of the histogram. The versus fit diagram depicts
that there is no problem of heteroscedasticity since the errors points are randomly distributed and
are not clustered. The versus order graph explains that since errors are evenly distributed around
the zero line so the errors are independently distributed. The Durbin Watson statistic is also close to
2 indicating that there is no problem of serial correlation.
Analysis of Variance
Source DF SS MS F P
Regression 2 15123.7 7561.8 20.55 0.000
Residual Error 45 16559.6 368.0
Total 47 31683.2
Ho: B1=B2= 0
Since the p value is less than 0.05 so we fail to accept H0 and conclude that at least one of the
regression coefficient is significant. We can further explore it by the individual t-test of regression
coefficients.
Both the coefficient index and bankruptcies are significant predictors of the number of clients seen,
since the p value is less than 0.05. There is also no problem of multicollinearity since value of VIF for
Urooj Qaiser
13U00324
both variables is less than 10. However the BAI is a better significant predictor of number of clients
seen compared to bankruptcies.
The two significant predictors explain 45.4 percent of variation of the number of new clients seen.
Q17)
Analysis of Variance
Source DF SS MS F P
Regression 12 61203.6 5100.3 16.10 0.000
Residual Error 86 27249.2 316.9
Total 98 88452.7
Ho: B1=B2=B3=B4=B5=B6=B7=B8=B9=B10=B11=B12=0
Since the p value is less than 0.05 so we fail to accept H0 and conclude that at least one of the
regression coefficient is significant. We can further explore it by the individual t-test of regression
coefficients.
90 20
Residual
Percent
50 0
10
-20
1
0.1 -40
-50 -25 0 25 50 100 125 150 175 200
Residual Fitted Value
12 20
Frequency
Residual
8 0
4 -20
0 -40
-30 -20 -10 0 10 20 30 40 1 10 20 30 40 50 60 70 80 90
Residual Observation Order
The normal probability plot displays that errors are normally distributed since red points lie on the
blue line. This result is also reinforced by the shape of the histogram. The versus fit diagram depicts
that there is no problem of heteroscedasticity since the errors points are randomly distributed and
are not clustered. The versus order graph explains that since errors are evenly distributed around
the zero line so the errors are independently distributed. The Durbin Watson statistic is also close to
2 indicating that there is no problem of serial correlation.
Analysis of Variance
Source DF SS MS F P
Regression 5 59989 11998 39.20 0.000
Urooj Qaiser
13U00324
Residual Error 93 28464 306
Total 98 88453
Ho: B1=B2=B3=B4=B5= 0
Since the p value is less than 0.05 so we fail to accept H0 and conclude that at least one of the
regression coefficient is significant. We can further explore it by the individual t-test of regression
coefficients.
The final regression equation is
Clients = - 179 + 2.70 Index + 25.2 S1 + 17.5 S2 + 25.2 S3 + 13.0 S10
The coefficient Index, S1, S2, S3 and S10 are significant predictors of the number of clients seen,
since the p value is less than 0.05. There is also no problem of multicollinearity since value of VIF for
all variables is less than 10. However, Index, S1 and S4 are better significant predictor of number of
clients seen compared to bankruptcies.
The significant predictors explain 66.1 percent of variation of the number of new clients seen. This
indicates that seasonal variation does exist and explains the variation of the dependent variable.
Urooj Qaiser
13U00324
18)
Since the actual values are very different from the forecasted values the model is not very accurate.