Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Introduction:
The term project below is one that demonstrates the many
concepts I have learned over the course of the semester. The concepts
represented include organizing and representing data in graphs such
as pie charts and histograms, and using confidence intervals and
hypothesis testing to draw conclusions about the data collected. I
collected my own data using a 2.17 ounce bag of Skittles, along with
20 other students in my class. Below is my individual data:
Number of
Number of
Number of
Number of
RED candies
ORANGE
YELLOW
GREEN
PURPLE
11
candies
13
candies
10
candies
12
candies
10
RED
18
ORANGE
15
YELLOW
11
GREEN
9
PURPLE
7
7
10
12
9
9
12
11
15
10
10
18
7
8
8
16
13
10
7
14
15
8
8
15
16
12
14
13
17
15
13
5
13
10
15
8
13
10
22
16
19
13
18
8
17
21
6
10
8
14
10
19
10
5
12
11
8
12
11
10
4
19
11
12
12
6
13
12
8
10
8
11
18
10
12
19
18
17
9
15
9
16
12
11
5
9
14
10
15
12
17
7
13
16
11
8
8
15
9
7
11
The pie charts show that the frequencies of each candy occurring
are similar between all the colors. There are a few times that the
number of candies occurred less often than expected, and times where
the number of candies was greater than what happened the most. But
overall, this shows that the majority of the bags had equal numbers of
each color in them. My own data slightly differs from the values from
the rest of the class, because it shows that some of the colors that
were in my bag occurred less frequently than the other colors.
Summary statistics:
Column
RedCandies
OrangeCandies
YellowCandies
GreenCandies
PurpleCandies
Mean
11.38
13.234
11.38
12.29
11.09
Std.dev.
Min
3.49
3.94
4.42
3.93
3.49
7
6
4
6
5
Max
Q1 Q3
18
9
14
22
10
15
21
8
13
19
9
15
17
8
14
5 Number Summaries:
Red Candies: MIN: 7; Q1: 9; MEDIAN: 10; Q3: 14 MAX: 18
Orange Candies: MIN: 6; Q1: 10; MEDIAN: 13; Q3: 15; MAX: 22
Yellow Candies: MIN: 4; Q1: 8; MEDIAN: 11; Q3: 13; MAX: 21
Green Candies: MIN: 6; Q1: 9; MEDIAN: 11; Q3: 15; MAX: 19
Purple Candies: MIN: 5; Q1: 8; MEDIAN: 11; Q3: 14; MAX: 17
Median
10
13
11
12
11
The distribution for each candy varies, but overall represents a normal
distribution. Some of the data showed that different colors of candy
had less than the mean values for that color candy, while some of the
colors were greater than the mean value for that color. I was expecting
the distribution to be less normal, because based on my own individual
data I didnt expect that when all the data was compiled it would be
normal.
Categorical data groups the data by category, and does not use the
frequencies in the data. Skittles groups by color would be an example
of categorical data since each color can be considered a category.
Quantitative data relies on numbers, or values, and this can be best
represented using frequency. When the graphs above showed how
frequently each color occurred, this used quantitative data. It would
not make sense to group colors as quantitative, as they dont have a
numerical value attached to them, and it would not make sense.
Hypothesis Tests
The general purpose of hypothesis testing is to test the original claim
based on the data that we have collected, and find out if the data is
sufficient enough to fail to reject the original claim, or find out if the
data is insufficient enough to reject the original claim.
Use a 0.05 significance level to test the hyptothesis that 20% of all
Skittles candies are red.
Alternative Hypothesis:
1.9600
P-Value:
1.0000
Use a 0.01 significance level to test the claim that the mean number of
candies in a bag of skittles is 55.
Alternative Hypothesis:
not equal to (hyp)
t Test
Test Statistic, t: 0.0000
Critical t:
2.8453
P-Value:
1.0000
This project was challenging, and given the short amount of time
that was allowed for this final project, it was especially difficult to
finish. It did allow me to figure out which concepts I still struggle with
and need to study more, and it showed me how much I have learned
about statistics thus far. Overall though, this project has showed me
how to think of statistics in real-world applications and has given me a
better understanding of representing sample populations, populations,
and calculations of data through the use of various types of graphs.