Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Year 1
Matt Kean
Manchester Medical School
matthew.kean@manchester.ac.uk
Displaying data
Summarising data
The normal distribution
Categorical Variables
No. of births
Percentage
Normal
478
79.7
Forceps
65
10.8
Caesarean section
57
9.5
Total
600
100.0
Categorical Variables
often shown as bar graphs
or pie charts
No. of births
600
500
400
300
200
100
0
Ca
se esa
ct ria
io
n n
(5
7)
)
65
s(
p
rce
Fo
Normal
Forceps
Caesarean section
Normal
delivery (478)
Numerical Variables
To form a frequency distribution, the data may be
grouped.
Haemoglobinlevels(g/100ml)for70women:
rawdatawiththehighestandlowestvaluesunderlined
10.2
13.7
10.4
14.9
11.5
12.0
11.0
13.3
12.9
10.6
10.5
12.1
9.4
13.2
10.8
11.7
13.7
11.8
14.1
10.3
13.6
12.1
9.3
12.9
11.4
12.7
10.6
11.4
11.9
13.5
14.6
11.2
11.7
10.9
10.4
12.0
12.9
11.1
8.8
10.2
11.6
12.5
13.4
12.1
10.9
11.3
14.7
10.8
13.3
11.9
11.4
12.5
13.0
11.6
13.1
9.7
11.2
15.1
10.7
12.9
13.4
12.3
11.0
14.6
11.1
13.5
10.9
13.1
11.8
12.2
Haemoglobinlevels(g/100ml)for70women:
frequencydistribution
Haemoglobin
(g/100ml)
No.of
women
Percentage
1.4
4.3
10
14
20.0
11
19
27.1
12
14
20.0
13
13
18.6
14
7.1
1515.9
1.4
Frequencies: Numerical
Variables
often shown as histograms
polygons
or frequency
Shapes of
Distributions
Normal or Gaussian:
Symmetrical and
bell-shaped, e.g.,
height
Positively skewed, or
skewed to the right,
e.g., triceps skinfold
measurement
Negatively skewed,
or skewed to the left,
e.g., period of
gestation
Quantile
s
Equal-sized divisions of a distribution.
Examples of quantiles:
Summarising numerical
data
Numerical variables are often summarised in two
measurements:
- the sum of
n - number of observations
x
- the mean
Simplest measure
16
14 13
12 11 10
15
9
8
Normal distribution
The normal (or Gaussian) distribution is a
frequency distribution with a symmetrical, bellshaped curve.
Important because:
Many observed variables are
normally distributed
0.08
Mean = 70 SD = 5
0.07
0.06
Density
Bell-shaped curve
0.05
0.04
Mean = 70 SD = 10
0.03
0.02
0.01
0.00
40
50
60
70
Grades
80
90
100
Normal distribution
Outliers
Extreme values in a distribution
Task
Work in pairs
Open Task questions
Open StatsDirect data file