Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Assignment #1
Name:
SCORE: 50 points
General Instructions
Although I do encourage you to work together both in and outside of class,
remember that collaboration on homework problems should be minimal and
everyone should create their own set of solutions.
For all assignments in this class, please remember that neatness matters! Except
for problems that require lots of hand-calculations, you should generally make your
answers clear and readable. Please do the problems in order, and provide plots /
graphs / equations as needed and within the solution to the problem (DO NOT
include appendices or spam me with Minitab/R output, if either is the case you will
be given a 0 for that portion). Please use complete sentences and/or show work as
necessary. Answers that are not supported by appropriate reasoning will not be
graded.
Additional Important Notes
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
I acknowledge that I am the one whose name listed above, who did this HW
assignment and
( ) no additional help received from other students.
( ) I did receive additional help from the following students (list them
below)
1
2(16),0.84
Mean
17.3
26.8
Standard Deviation
9.1
10.1
Question Three: Suppose a professor gave a 9-point quiz to a small class of five
students. The results of the quiz were 5, 4, 9, 6.5 and 8. For the sake of discussion,
assume that the five students constitute the population.
(a) Find the mean and the standard deviation of the population.
(b) Present the data graphically (use bar graph with attached bars)
(c) Take a sample of size 2 repeatedly and for each sample find the mean.
Question Seven: Let Y1 and Y2 be two random variables with means 1 and 2 and
variances 12 and 22 . Define W 3Y1 2Y2 and V 3Y1 2Y2
(a) Find E(W) and E(V)
(b) Find Var(W) and Var(V)
(c) Show that cov(W, V) = 912 4 22
(d) Now, define G (W V ) , where is a constant, show that the standard
1
2
deviation of ( G ) is 2 2 .
Question Eight: In this small exercise, we asked each ten students from a Stat1001
to collect a random sample of times on how long it took students to get to class from
their homes. All the sample sizes were 16, this means that each students collected
16 samples. The data are summarized in the following table.
Student 1
2
Mean
21 26
Std. Dev. 2.3 1.8
3
23
2.7
4
29
2.4
5
14
3.1
6
24
2.2
7
27
1.9
8
17
2.8
9
24
2.2
10
29
2.1
(a) The students noticed that everyone had different answers. If you randomly
(b)
(c)
(d)
(e)
(f)
sample over and over from any population, with the same sample size, will
the results ever be the same? Explain.
The students wondered whose results were right. How can they find out
what the population mean and standard deviation are?
Input the means into the R and check to see if the distribution is normal.
(Draw a histogram)
Is the distribution of the means a sampling distribution?
Check the sampling error for students 3, 7, and 10.
Compare the standard deviation of the sample of the 10 means. Is that equal
to the standard deviation from student 3 divided by the square of the sample
size? How about for student 7, or 10?
Question Nine: Suppose you want to determine whether the brand of laundry
detergent used and the temperature affects the amount of dirt removed from your
laundry. To do this end, you buy two different brand of detergent (Super and
Best) and choose three different temperature levels (cold, warm, and hot).
Then you divide your laundry randomly into 6n piles of equal size and assign each
n piles into the combination of (Super and Best) and (cold, warm, and hot).
The data are given below
(a) Use R to draw a well-nice side-by-side box plots and clearly explain what you
see in the graph.
(b) Ignore the variable brand of detergent, use an appropriate statistical method
to test if the mean amount of dirt removed is the same between the three
levels of temperature. Be sure you show all of your work. Use alpha = 0.05.
(c) Now ignore the variable temperature, use an appropriate statistical method
to test if the mean amount of dirt removed is the same between the two
levels of brand of detergent. Be sure you show all of your work. Use alpha =
0.05.
(d) Now, you need to explain why you chose the statistical methods to do the
analyses in parts b and c.
Question Ten: Use the following data taken from a sample of 5 people.
minutes exercise/day:
15
80
85
45
50
percent body fat:
14
8
10
22
21
i)
ii)
iii)
Using the data set, identify the response and explanatory variables.
Calculate Pearsons correlation by hand and interpret the result. In
doing so, show your calculations. Use Google to find the formula for the
Pearson correlation.
Use R to draw a scatterplot of the data (with Y on the vertical axis), AND
to obtain Pearson correlation. Turn in the resulting printouts.
17
1 19.5
Y
, X
18.1
1 21.9
17.1
1 18.8
19.8
1 23.4
i. What are the dimensions of Y and of X?
ii. Find the following:
T
(a) X (X-transpose)
T
(b) X X
T
(c) X Y
T
1
(d) ( X X ) (this is just finding the inverse of 2 by 2 matrix found in part
b)
T
T
1
(e) ( X X ) X Y . Call this matrix B matrix.
*Hint: Go back to your linear algebra course or use Google.
iii. Now, use R to find the equation of the least-squares line (Regression
equation). Hint you need to enter the data in R as follows
X=c(15.3, 16.4, 19.5, 21.9, 18.8, 23.4)
Y=c(14.9, 15.8, 17, 18.1, 17.1, 19.8)
Compare this result with the one in part (ii/e)..what can you see?.
Submit both the R code and the output.
The End
6