Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Assigned By:
PhD, Professor
Dept. of BBA, North South University
Prepared By:
Letter of Transmittal
M.A. Matin
PhD, Professor
Department of Management
Dear sir,
We are pleased to submit the report on data analysis of applied statistics course.
We have tried our level best to come up with the expectation of you, therefore we pray and hope
that you would consider the mistakes if it has.
Sincerely,
Salina Akter
Rifat Mahmud
Sadia Afrin Mim
Md. Imran Khan
Introduction
A simple regression model includes one independent and one dependent variable. But usually a
dependent variable is affected by more than one independent variable. So when a regression
model that includes two or more independent variables it will call a multiple regression model.
Our project is about to monthly auto insurance, driving experience, number of driving violations,
and age of drivers in a company. Here driving experience, number of violation and age are
independent variable and monthly auto insurance is the dependent variable.
As there is more than one independent variables in this project, we will use multiple regression
model for the project.
Multiple Regression
The research conducted in this paper focuses mainly on the schemes of multiple regressions
where the validity and variability of the independent variables are driving experience. The test
aims to figure out driver’s preference towards a particular violation and age where the dependent
variable .
Dependent Variable
Independent Variables
Driving Experience
Number of Driving violations
Age
Where,
Model Summary
The values of the standard deviation of errors, the coefficient of multiple determinations, are
given in the MINITAB solution which is given below:
Interpretation of Se
Standard Deviation of Errors, Se =14.4263 tells us that the average difference of the sample
mean from the population mean is about 14.4263.
Interpretation of R2
We are 87.35% confident that the total variation of beginning salary can be explained by the use
of regression model.
ANOVA
Findings:
The ANOVA table gives the result of F-test. F-test measures the joint
significance of the independent variables whether the independent variables
taken together would influence the dependent variable or not.
Our SSR (Regression Sum of Squares) is 20124 while SSE (Error Sum of
Squares) is 2914, giving a SST (Total Sum of Squares) of 23038. The F-ratio
calculated by Minitab is 32.23.
If the calculated F-statistic is greater than the critical F-statistic, then the test is
significant.
From the t distribution table, the value of t for .025 t distribution curve is 1.985
95% Confidence Interval for B1: (b1 ± t × Sb1) = -8.50 ± 1.985 × 2.04
= -8.50 ± 4.0494
= -4.4506 to -12.5494
We are 95% confident that for driving experience, the monthly auto insurance changes by an
amount between, $-4.4506 to $-12.5494.
= 12.41 ± 5.5183
= 17.9283 to 6.8917
We are 95% confidence that for a change in the num. of violation, the monthly auto insurance
changes by an amount between $17.9283 to $6.8917
95% Confidence Interval for B3: (b3 ± t ×Sb3) = 3.80 ± 1.985 × 1.17
= 3.80 ± 2.32245
= 6.12245 to 1.47755
We are 95% confidence that for a change the age , the monthly auto insurance changes by an
amount between $6.12245 to $1.47755
The important factors for determining the monthly auto insurance of the drivers are:
1. Driving experience
2. Num. of violation
3. Age
By using this equation, we found out the monthly auto insurance of drivers of different violation,
groups, experience, and age based on their insurance.
We know, the p-value for each term tests the null hypothesis that the coefficient is equal to zero
(no effect). A low p-value (< 0.05) indicates that you can reject the null hypothesis.
The p-value for the driving experience is 0.001. So, we reject the null hypothesis.
The p-value for the num. of violations is 0.001. So, we reject the null hypothesis.
The p-value for the age is 0.006. So, we do not reject the null hypothesis.
We can conclude that regression has no impact in on the monthly auto insurance.
Regression Analysis: Monthly auto ins versus Driving experience, Num. of driving ,
Age
Analysis of Variance
Model Summary
Coefficients
Regression Equation
Monthly
auto
Obs insurance Fit Resid Std Resid
14 120.00 145.43 -25.43 -2.04 R
R Large residual