Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
1 / 19
2 / 19
Density Curves
A density curve may be used to display the distribution of the data in addition to or instead of a histogram. We can consider a density curve as a smooth approximation to the histogram computed from the data. For continuous response variables, the histogram computed from the data (sample), approximates the (unknown) population density of the response variable.
3 / 19
= 0.25
x = 0.2556
= 0.144338
s = 0.144446
5 / 19
where e = 2.71828...
(1)
6 / 19
7 / 19
Normal Approximation:
8 / 19
THE 68-95-99.7 RULE for the NORMAL CURVE Approximately 68% of observations fall within 1 standard unit of 0 (1 < z < 1). Approximately 95% of observations fall within 2 standard units of 0 (2 < z < 2). Approximately 99.7% of observations fall within 3 standard unit of 0 (3 < z < 3).
The Empirical Rule, which is applicable to bell-shaped normal-like histograms, is the direct consequence of the above property of the normal curve. The range 1 < z < 1 in standard units correspond to x s < x < x + s in the original, nonstandard units.
Dr. Joseph Brennan (Math 148, BU) Chapter 5 - The Normal Curve 9 / 19
Figure : Normal curve and percentage of observations under it. Horizontal scale uses the standard units z.
Dr. Joseph Brennan (Math 148, BU) Chapter 5 - The Normal Curve 10 / 19
z-Scores z-Score:
The transformation of data into standard units, normal approximation: observation mean z= standard deviation Thus, any data point x may be recomputed in standard units as x x zx = . s We call the z which corresponds to x the z-score zx . Note that zx < 0 if x < x ; zx = 0 if x = x ; zx > 0 if x > x . We may reverse the transformation; if zx is known, x can be found by x = x + s zx .
Dr. Joseph Brennan (Math 148, BU) Chapter 5 - The Normal Curve
(2)
(3)
11 / 19
z-Scores
zx =
x x s
The z - score indicates the number of standard deviations away a data point falls above or below the average x . If the histogram plotted against the z - scores follows the normal curve well, we say that the normal distribution provides a good approximation for the distribution of the data. The normal curve is well studied and many of its values have been stored in normal tables. Data that is found to have a good normal approximation can be correlated with the normal curve.
12 / 19
Normal Table
A normal table which we will use gives the area between z and z:
14 / 19
Example 8, p.85
The heights of the men age 18 and over in HANES5 averaged 69 inches; the SD was 3 inches. Use the normal curve to estimate the percentage of these men with heights between 63 inches and 72 inches. Solution: The exact percentage is equal to the area under the height histogram
between 63 inches and 72 inches. We assume that the histogram can be well approximated by the normal curve. We will estimate the percentage of men between 63 and 72 inches by nding the area of the corresponding region under the standard normal curve.
15 / 19
Example 8, p.85
Step 2: Mark the mean on the line and convert to standard units. The z - score for the left endpoint is z63 = 63 69 x x = = 2. s 3
Step 3: Sketch the normal curve and nd the area under the curve above the shaded interval by using normal tables.
16 / 19
Example 8, p.85
Conclusion: From our table of z-scores, z63 = 2 is the 2.28% and z72 = 1 is the 84.13%. Therefore, about 82% of the heights were between 63 inches and 72 inches. This is only an approximation, though, in truth, 81% of the men were in that range.
17 / 19
Example (S.A.T.)
The SAT is a test for readiness of students for college. The average SAT score (on a 1600 point scale) is 1025 points and the standard deviation is 200 points. How well must Jessica do on the SAT in order to place in the top 10% of all students?
Solution: The problem does not say that the histogram of the SAT scores is bell-shaped, but it is reasonable to assume so. We will use the normal approximation to the distribution of the SAT scores to solve the problem.
18 / 19
Example (S.A.T.)
Using the normal table provided in the textbook, Jessica is hoping for a score that translates to z 1.3. We know x = 1025 and s = 200. x x x = x + s z= 1025 + 200 1.3 = 1285 z= s So Jessica should score 1285 points to expect to be among the top 10% of students.
Dr. Joseph Brennan (Math 148, BU) Chapter 5 - The Normal Curve 19 / 19