Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Bowers D. Medical Statistics from scratch. 2nd Ed. Chichester, John Wiley & Sons Ltd. 2008
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 3
Types of variables
Bowers D. Medical Statistics from scratch. 2nd Ed. Chichester, John Wiley & Sons Ltd. 2008
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 4
Types of data
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 5
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 6
1. Bowers D. Medical Statistics from scratch. 2nd Ed. Chichester, John Wiley & Sons Ltd. 2008
2. Machin D, Campbell MJ, Walters SJ. Medical Statistics. 4th Ed. Chichester, John Wiley & Sons. 2007
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 8
1. Bowers D. Medical Statistics from scratch. 2nd Ed. Chichester, John Wiley & Sons Ltd. 2008
2. Machin D, Campbell MJ, Walters SJ. Medical Statistics. 4th Ed. Chichester, John Wiley & Sons. 2007
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 9
Ranks
In some studies it may be appropriate to assign
ranks:
• Patients with rheumatoid arthritis may be asked to order
their preference for four dressing aids.
• although numerical values from 1 to 4 may be assigned to
each aid, one cannot treat them as numerical values.
• They are in fact only codes for
1. best,
2. second best,
3. third choice and
4. worst.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 10
1. Bowers D. Medical Statistics from scratch. 2nd Ed. Chichester, John Wiley & Sons Ltd. 2008
2. Machin D, Campbell MJ, Walters SJ. Medical Statistics. 4th Ed. Chichester, John Wiley & Sons. 2007
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 11
Machin D, Campbell MJ, Walters SJ. Medical Statistics. 4th Ed. Chichester, John Wiley & Sons. 2007
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 12
Bowers D. Medical Statistics from scratch. 2nd Ed. Chichester, John Wiley & Sons Ltd. 2008
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 13
Machin D, Campbell MJ, Walters SJ. Medical Statistics. 4th Ed. Chichester, John Wiley & Sons. 2007
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 15
Bowers D. Medical Statistics from scratch. 2nd Ed. Chichester, John Wiley & Sons Ltd. 2008
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 16
Distributions of variables –
Categorical variables
Cook A, Netuveli G, Sheikh A. Basic skills in statistics. London, Class Publishing Ltd. 2004
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 21
Distributions of variables –
Categorical variables
Distributions of variables –
Numerical variables
• The strategy of representing distributions is
not possible.
• Group data into intervals on the measuring
scale – bins or class intervals
• The bins need not be of equal size
Cook A, Netuveli G, Sheikh A. Basic skills in statistics. London, Class Publishing Ltd. 2004
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 23
Distributions of variables –
Numerical variables
• Important differences between a histogram
and a bar chart
• bar chart, the height of the column represents
the frequency,
• in a histogram the frequency is represented by
the area of the column.
Cook A, Netuveli G, Sheikh A. Basic skills in statistics. London, Class Publishing Ltd. 2004
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 24
1. Measures of location
Cook A, Netuveli G, Sheikh A. Basic skills in statistics. London, Class Publishing Ltd. 2004
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 25
Cook A, Netuveli G, Sheikh A. Basic skills in statistics. London, Class Publishing Ltd. 2004
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 26
Cook A, Netuveli G, Sheikh A. Basic skills in statistics. London, Class Publishing Ltd. 2004
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 27
Key Messages
Categorical variables can be summarised using
counts and percentages.
Discrete numerical variables can be
summarised using the mode and median as
measures of location, and ranges and
percentiles as measures of dispersion.
Normally distributed numerical variables
should be summarised using the mean and
standard deviation.
Non-normally distributed numerical variables
should usually be summarised with the median
and a measure of range.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 28
Shapes of Distributions
SYMMETRIC:
histogram in which the right half is a
mirror image of the left half.
SKEWED TO THE RIGHT:
histogram in which the right tail is
more stretched out than the left.(long
tail to the right)
SKEWED TO THE LEFT:
histogram the left tail is more
stretched out than the right.(long tail
to the left)
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 29
Shapes of Distributions
NUMBER OF MODAL CLASSES:
the number of distinct peaks in a
histogram
BELL-SHAPED:
A histogram looks like a bell.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 30
Skewness
• Skewness measures the degree of
asymmetry exhibited by the data
– Positive skewness – More
observations below the mean than
above it
– Negative skewness – A small number
of low observations and a large
number of high ones
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 31
Skewness
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 32
Kurtosis
• Kurtosis measures how peaked the
histogram is
• The kurtosis of a normal
distribution is 0
• Kurtosis characterizes the relative
peakedness or flatness of a
distribution compared to the normal
distribution
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 33
Kurtosis
• Platykurtic– When the kurtosis < 0,
the frequencies throughout the curve
are closer to be equal (i.e., the
curve is more flat and wide)
– Thus, negative kurtosis indicates a
relatively flat distribution
TABLES
• the best way of showing structured numeric
information
GRAPHS / CHARTS
• better for showing relationships
• making comparisons
• indicating trends
• it is usual to include a table to show the data from which it
was drawn
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 35
Tables
Where there is no
natural ordering of the
rows (or indeed
columns), they
should be ordered
by size (category with
the highest frequency
first, lowest frequency
last) as this helps the
reader to scan for
patterns and
exceptions in the data.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 54
Presenting Graph
• “a picture is worth a thousand
words.”
• Graphs tell a story in “pictures”
rather than in words or numbers.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 55
Good Graph
• a clear title (with
the sample size),
• labeled axes,
• no gridlines and
• the marital status
categories are
ordered by their
frequency.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 57
Table or Graph ?
• A table a display of numbers in a
rectangular grid,
• A graph or chart a picture in which the
numbers are represented by points or lines.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 58
Type of Graphs
Categorical Numerical graphs
graphs (nominal – Stemplot (stem-
or ordinal) and-leaf plot)
– Bar graph – Histograms
– Pie chart – Frequency
polygon
– Boxplots
– Scatter plot
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 59
Bar Chart
• Give a clear display of simple results.
Bar Chart
• Bar Graph uses bars to represent the
frequencies (or relative frequencies)
the height of each bar equals the
frequency or relative frequency of each
category.
• Bar Graph: height indicates count or
percent
• Frequencies: counts
• Relative frequencies: percent
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 61
Data 1
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 62
Bar Graph
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 63
Pie Chart
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 65
Pie Chart
• Generally pie charts are to be avoided, as
they can be difficult to interpret
particularly when the number of
categories is greater than five.
This should never be done as they are especially difficult to read and interpret. When
the charts are displayed as three dimensional this relationship is lost as what is
displayed becomes a volume. Only the front face is proportional to the numbers in the
categories and so only these should be displayed
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 67
Data 2
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 68
Data 3
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 71
Count Data
• Count data can
only take whole
numbers and the
best method to
display them is
using a bar chart.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 73
Count Data
• On the horizontal
axis are the number
of deaths per day,
going from a
minimum of 0 deaths
per day to a
maximum of 16
deaths per day,
• On the vertical axis
is the frequency with
which these occur
during this 5-year
period.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 74
Histograms
In order to construct a histogram
• the data range is divided into several
non-overlapping equally sized bins
(categories)
• the number of observations falling into
each bin counted.
• the categories are then displayed on the
horizontal axis (X-axis) and the
frequencies displayed on the vertical axis
(Y-axis)
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 76
Histogram
• Breaks the range of the values of a
quantitative variable into intervals and
displays only the count or percent of the
observations that fall into each interval.
• You can choose any convenient number of
intervals.
• Intervals must be of equal width.
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 77
Histogram
Box–Whisker Plots
• Box plots can be particularly useful for
comparing the distribution of the data
across several groups.
Box–Whisker Plots
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 80
Boxplot
• Boxplots graphically represent
the scores in a distribution
• Made using 5 number
summary
• Within the box are all scores that
fall between the 25th and 75th
percentile
• The whiskers capture all scores
within 1.5 IQRs of the box
boundary
• Outliers are between 1.5 and 3
IQRs
• Extreme outliers are beyond 3
IQRs
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 81
Scatter Diagram
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 84
Scatter Diagram
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 85
Scatter Diagram
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 86
Line Graph
• A graph showing the differences in frequencies or
percentages among categories of an interval-
ratio variable.
• Points representing the frequencies of each
category are placed above the midpoint of the
category and are joined by a straight line.
• Appropriate when the horizontal axis is
continuous rather than categories.
• They could be used to show progress over
time
– e.g. development of a measured skill each week over a
ten-week course
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 87
Line Graph
Biostat_3 MDU-PPDS FKUI Wresti Indriatmi 88
References
1. Peat J, Barton B. Medical statistics. A
guide to data analysis and critical
appraisal. Oxford, Blackwell Publishing
Ltd., 2005
2. Freeman JV, Walters SJ, Campbell MJ.
How to display data. Oxford, Blackwell
Publishing Ltd., 2008
3. Hall GM. How to write a paper. 5th Ed.
Oxford, John Wiley & Sons, Ltd., 2013