The statistic project displayed below gave us the opportunity to apply the concepts we have learned during the course such as collecting samples, organizing and analyzing data, and drawing conclusions of the Exhale Study Data. First I started by sorting the Exhale Study Data using the website called StatCrunch using a gender categorical variable of the entire population. Results: Female- 48.55% Male-51.45%
2
Secondly, I took the next four graphs displayed by using two different sampling methods: Systematic and Random Sampling. Each sample displays the results with a pie chart and a pareto chart. 3
The random sample had a sample size of 36/654 and the systematic method had 45/654 while using the online website Random.org. 4 Part three of the assignment by using StatCrunch I was able to compute the population mean, population standard deviation and the five- number summary. A box plot and frequency histogram were used to display the data results.
5 Using two sampling methods: Systematic and Random sampling I was able to compute the sample statistics for each method.
6
The quantitative variable used in the study was the age of the population. The first method used was random sampling, using Random.org sample population 41/654. The second sampling method was systematic sampling with sample population 42/654. The entire population and the systematic graphs are both skewed right, while the simple random shape is skewed left. 7 Here are the total numbers generated of each sample by using StatCrunch: Entire Population Population Sample: 654 Mean: 9.933 Median: 10 Standard Deviation: 2.957 IQR: 4 Min: 3, Q1: 7, Med: 10, Q3: 14, Max: 19 Random Sampling N=41 Mean: 9.756 Median: 10 Standard Deviation: 2.764 IQR: 4 Min: 4, Q1, 8, Med: 10, Q3: 12, Max: 15 Systematic Sampling N=42 Mean: 9.81 Median: 10 Standard Deviation: 3.046 IQR: 3 Min: 5, Q1: 8, Med: 10, Q3: 11, Max: 19
Random Sampling Technique: Random.Org- Using this number generating website I started with 1 to 654 and clicked generate 41 times, receiving the 41 random numbers that I recorded in StatCrunch.
Systematic Sampling Technique: I divided 654/42 and calculated 15.57 so I rounded up to 16. Using random.org I generated a random number from 1 to 16 and from that number I added 16 to every number to generate my data. For the fourth part of the assignment I selected a level of confidence first for the categorical variable for the entire population using each sample. I also calculated the population mean of the quantitative sample as well as the standard deviation. Here is my work shown: 8
9
The meaning of the confidence intervals shows that the population of the gender for the systematic sampling is between .0299<p< .0621 and for the random sampling its between .0327<p< .0728. By moving the decimal over twice to the right the numbers become a percentage i.e. 29.9% < 62.1% / 32.7% < 72.8%. The same goes for the results 10 involving the quantitative samples shown above. These results captured the population parameter as a whole.
Part five I selected a level of confidence by completing a hypothesis test for the population proportion for my categorical variables using gender. Also using samples from the quantitative data I completed a hypothesis test for the population mean using age. Here is my work shown: 11
12 After performing the hypothesis tests, it was interesting to see the different outcomes between the tests for the population proportion and the population mean. My results told me to fail to reject the population proportion while rejecting the null of the population mean. Type l error is to reject the null hypothesis when it is actually true. In this case, type 1 has occurred with the population mean because the mean of the age is around 9 years old, while supporting the claim of the true actual average age. Extra Credit: I took the age and height of the exhale study data and entered into StatCrunch to see if there was a correlation. Result- No correlation because there was no linear correlation and the linear coefficient is smaller than the critical value. Equation of regression: y=.547x + 52.695 Linear Coefficient: r=.1638 Critical Value= .444 r=.1645>C.V. .444
Reflection-The exhale study data project and overall math class has taught me many valuable lessons learned involving statistics. After completing this project it was very satisfying to take a certain data set or subject and be able to fully dissect every possible aspect involving the categorical and quantitative variables involved. The math skills learned in this class will help in other mathematical classes as well as other classes involving research and realistic data sets in any situation. Statistics is a subject that is very useful in everyday life and is a helpful 13 tool for recognizing and analyzing important information. Ill now know and better understand different concepts involving certain statistical information or other interesting surveys.