Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
*All questions are taken from the 2011 Youth Risk Behavior High School Survey, with the exception of
the Paired Sample T-Test in which the Keeping it Real dataset was used.*
Research Question: What percentage of students has never used marijuana?
Steps for Analysis: Use the 2011 Youth Risk Behavior High School survey. The variable of interest is
Q42
Statistics
During your life, how many times have
you used marijuana
N
Valid
Missing
2318
53
Mean
2.09
Median
1.00
Mode
Std. Deviation
1.890
Skewness
1.619
.051
Minimum
Maximum
Interpretation: Looking at the Statistics output we would need to determine which level of
measurement to use. This variable is an Ordinal level of measurement because it cannot be broken
down into even increments. Therefore, the most beneficial level of measurement to use would be the
median. The mode would be used to determine the most frequently occurred response of how many
times an individual has used marijuana.
During your life, how many times have you used marijuana
Cumulative
Frequency
Valid
0 times
Percent
65.1
66.6
66.6
1 or 2 times
185
7.8
8.0
74.5
3 to 9 times
173
7.3
7.5
82.0
10 to 19 times
84
3.5
3.6
85.6
20 to 39 times
75
3.2
3.2
88.9
40 to 99 times
98
4.1
4.2
93.1
160
6.7
6.9
100.0
2318
97.8
100.0
999
39
1.6
System
14
.6
Total
53
2.2
2371
100.0
Total
Total
Valid Percent
1543
Missing
Percent
Interpretation: Observing the table below just tells us the frequency and percents in which tells us how
many students answered the question for each response. However, for this question we would like to
know the percentage of students who have never used marijuana. The answer for this question is
66.6%. The valid percent should be used because it depicts the number of people who actually
answered the question. Whereas, the percent includes those who did and did not answer the question.
Correlations
Research question: Is there a relationship between how old were you when you had your first drink of
alcohol (Q37), and tried marijuana for the first time (Q43)?
Steps for Analysis: Use the 2011 Risk Behavior High School Survey. The variable of interests is Q37 and
Q43.
Correlations
How old were
you when you
drink of alcohol
tried marijuana
sips
Pearson Correlation
.499
**
.000
2339
2300
**
Pearson Correlation
.499
Sig. (2-tailed)
.000
time
2300
2320
Interpretation: Pearsons Correlation value is .499, which tells us that there is a slight or moderate
correlation between the two variables. The closer a variable is to (1) or (-1) depending on if the
relationship is positive or negative will determine if there is a linear relationship. The closer it a
correlation is to zero the shows no relationship. We can also observe the p-value which is displayed as
Sig. (2-tailed) and tell there is a relationship between the two because the p-value is less than .05
(P<.05=a significant relationship.)
Click, Okay
Group 1: 1
Group 2: 2
Interpretation: The first thing to determine is if the variance is the same. To do this we look at Levenes
test and look at the p-value, this will also tell you if there is a difference in the standard deviation or if
they are the same. The p-value for Levenes in this test is .789, which means there is no difference in the
variation and variance is equal. We recall, if the p-value is > or equal to .05 there is no difference. Now,
this means we will look at the top line for the T-Test Equality of Means (because equal variances were
assumed.) Looking at the Sig (2-tailed) the p-value is .000, which is less than .05, which means there is a
significant difference. You can also look at The 95% Confidence Interval of Difference to tell where the
true difference mean is. Because there is no zero between the confidence interval, there is a significant
difference.
Finally, we can go back up and observe the Group Statistics Table. Observing the statistics here, we
can see that on average males are more physically active during the day than females. We know this
because the mean for males was 5.43, whereas the mean for females were 4.30. Even though we
already observed there was a difference from the t-test, this is just another way to observe the
difference in means.
Interpretation: The Paired Sample Statistic box displays what we refer to as descriptive statistics for the
childs age that was reported for time one and two. The first table that will be observed is the Paired
Samples Test, the first thing we observe is the p-value, which is .000, this p-value, is less than .05 which
means there is a statistically significant difference between the two. We could also have used the
confidence interval to determine if there was a statistically significant difference. After observing this
table, when we go up and look at the statistics of the Paired Samples table and we can see that there is
an overall difference between the two means in time one and time two. The childs self-reported age
increased from time one to time two.
Chi-Square
Research Question: Is there a relationship between the sexes (Q2) of students and if they have ever
gotten in trouble with family or friends, missed school, or got into fights, while using alcohol or drugs
(Q52).
Steps for Analysis: Use the 2011 Risk Behavior High School Survey. The variable of interests is Q2 and
Q52. Both variables are nominal.
Missing
Percent
Total
Percent
Percent
2330
98.3%
41
1.7%
2371
100.0%
What is your sex * Have you ever got into trouble with your family or friends, missed school, or got into
fights, while using alcohol or drugs Crosstabulation
Have you ever got into trouble with
your family or friends, missed
school, or got into fights, while
using alcohol or drugs
yes
What is your sex
Female
Count
no
Total
136
1121
1257
44.9%
55.3%
53.9%
167
906
1073
55.1%
44.7%
46.1%
303
2027
2330
100.0%
100.0%
100.0%
Count
% within Have you ever got
into trouble with your family
or friends, missed school, or
got into fights, while using
alcohol or drugs
Total
Count
% within Have you ever got
into trouble with your family
or friends, missed school, or
got into fights, while using
alcohol or drugs
Interpretation: This cross-tabulation table is used to tell us the differences in the groups. If we look at
the column percentages, we can see the response to whether an individual has gotten in trouble male or
female. Whereas, if we observe the row percentages we can see the responses amongst the sex of the
students. We can gather from this chart that 55.1% of males have gotten in trouble within their sex,
whereas only 44.9% of females have gotten in trouble within their sex.
Chi-Square Tests
Asymptotic
Value
Pearson Chi-Square
Continuity Correction
Likelihood Ratio
sided)
sided)
sided)
.001
11.102
.001
11.479
.001
11.518
b
df
Significance (2-
.001
11.513
.000
.001
2330
a. 0 cells (0.0%) have expected count less than 5. The minimum expected count is 139.54.
Interpretation: The Pearson Chi-Square is 11.518. The p-value is .001, which is less than .05. There
is a significant relationship between the the sex of the students and if they have gotten in trouble.
A-NOVA
*An ordinal variable was used for the dependent variable to run this test, whereas you would typically
use a ratio/interval for this question*
Research Question: Are there differences amongst students who take steroid pills or shots without a
doctor's prescription (Q49), and race (rcat).
Steps for Analysis: Use the 2011 Risk Behavior High School Survey. The variable of interests is Q49 and
RCAT. Independent variables are the races. In addition, the dependent variable is the race.
Interpretation: The descriptive table gives all of the descriptive statistics within the race. The first step
of looking at this set of tables would be to observe the p-value of the A-NOVA test. Observing the pvalue of the A-NOVA, we see that it is .209, which means that it is more than .05, meaning there is no
statistical difference between races and if a student has taken steroid pills or shots without a doctors
prescription. This is where we would stop for this particular test.
Interpretation: Due to their being no difference within the variables that I chose, I cannot go any further
on the test. However, lets say my p-value was less than .05. The next step would have been to look at
the Levenes test and to look at the p-value to see if the variances were equal or not equal. If the p-value
is less than .05 then this would mean the variance is not equal and we would use Tamhanes to observe
the differences within race. If the p-value was more than .05 then we could assume the variances are
equal and use Tukeys to observe the differences with race. Once it is decided to use Tukeys or
Tamhanes, we would look at the multiple comparisons table within the mean difference and standard
of error to tell us where differences lie. Alternatively, we could observe the 95% Confidence Interval to
tell us where the difference lies, and we would compare each race to one another. Following this step,
we would look at the p-value (sig.) and see which groups had statistical significance. After noting the
differences within this table, we would go back up to descriptives and determine which category has the
lower mean and which has the higher mean which would help determine where the big difference is
within race.