Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Basic Concepts in
Biostatistics
1
BIOSTATISTICS and EPIDEMIOLOGY
BIOSTATISTICS
• It is the science that deals with the
COLLECTION, ORGANIZATION, ANALYSIS and
INTERPRETATION of numerical data
• “Life”
• It is the application of statistical methods to life
science like biology, medicines and public
health
BIOSTATISTICS and EPIDEMIOLOGY
Scope of Biostatistics
• Biostatistics covers applications and
contributions
• health
• medicines
• Nutrition
• genetics, biology, epidemiology, and many
others.
3
BIOSTATISTICS
Why do we needand EPIDEMIOLOGY
to study Statistics?
“Medical
students may
not like statistics,
but as doctors
they will.”
Martin
Bland
BIOSTATISTICS and EPIDEMIOLOGY
• Variation
• Tendency of a measurable characteristics to
change with respect to person, place & time
• E.g. weight, age, height, etc.
BIOSTATISTICS and EPIDEMIOLOGY
BIOSTATISTICS and EPIDEMIOLOGY
BIOSTATISTICS and EPIDEMIOLOGY
BIOSTATISTICS and EPIDEMIOLOGY
BIOSTATISTICS and EPIDEMIOLOGY
12
BIOSTATISTICS and EPIDEMIOLOGY
Scales of Measurements
• Nominal
• Ordinal
• Interval
• Ratio
BIOSTATISTICS and EPIDEMIOLOGY
Nominal
• A classificatory scale where the categories are
used as labels only (does not represent quantity)
• Number or names which represent a set of
mutually exclusive and exhaustive classes to which
individuals or objects (attributes) may be assigned
• E.g. Sex (Male and Female), Race, Blood Groups,
seatbelts in car, psychological diagnosis, patient ID
no.
BIOSTATISTICS and EPIDEMIOLOGY
Ordinal
• Same characteristics as the nominal scale
• Categories can be ordered or ranked; however the
distance between the two categories cannot be
clearly quantified
• E.g. Likert scales, Age groups (Infant, childe,
teenager, adult), psychosocial scales (strongly
disagree, disagree, agree, strongly agree)
BIOSTATISTICS and EPIDEMIOLOGY
Interval
• Distances between all adjacent classes are equal
• Conceptually, these scales are infinite, in that they
have neither beginning nor ending
• Zero point is arbitrary and does not mean absence
of the characteristic
• E.g. Temperature, IQ
BIOSTATISTICS and EPIDEMIOLOGY
Ratio
• A meaningful zero point exists
• Ratio of two numbers can be meaningfully
computed and interpreted
• E.g. Weight, Blood Pressure, Height, Doctor visits,
number of DMF teeth
BIOSTATISTICS and EPIDEMIOLOGY
18
BIOSTATISTICS and EPIDEMIOLOGY
Types of Data
• Primary data
• Obtained first hand by the investigator to help
him answer specifically the purpose of his study
• Secondary data
• Those which are already existing and which have
been obtained by some other people for
purposes
BIOSTATISTICS and EPIDEMIOLOGY
• Census
• Vital Registry
• Reports of occurrence of notifiable
disease
• Health records
BIOSTATISTICS and EPIDEMIOLOGY
MEASURES OF DATA
BIOSTATISTICS and EPIDEMIOLOGY
At the end of the unit, students must be able to:
1.summarize data by constructing a frequency distribution
or relative frequency distribution;
2.be able to find lower class limits, upper class limits, class
boundaries, class widths, and class marks for a given
frequency table;
3.calculate measures of center by finding the mean,
median, mode, and midrange;
4. calculate measures of variation by finding the standard
deviation, variance, and range;
BIOSTATISTICS and EPIDEMIOLOGY
Frequency Distribution
FREQUENCY
DISTRIBUTION TABLE
BIOSTATISTICS and EPIDEMIOLOGY
FREQUENCY DISTRIBUTION
(Frequency Table)
Class Limits
BIOSTATISTICS and EPIDEMIOLOGY
Class Limits
• Class Boundaries
• Lower class
boundary = (lower
class limit – 0.5
units)
• Upper class
boundary = (upper
class limit + 0.5
units)
BIOSTATISTICS and EPIDEMIOLOGY
Class Midpoints
• (Lower Class Limit + Upper Class Limit)/2
• (Lower Class Boundary + Upper Class
Boundary)/2
BIOSTATISTICS and EPIDEMIOLOGY
Assignment
Describe how each of the following variables can be
measured using the nominal, ordinal, and ratio scale
of measurements.
A.Nutritional Status
B.Educational attainment
C.Anemia status
BIOSTATISTICS and EPIDEMIOLOGY
Assignment
Differentiate the following:
a.Qualitative data in nominal scale and
qualitative data in the ordinal scale
MEASURES OF CENTRAL
TENDENCY
BIOSTATISTICS and EPIDEMIOLOGY
BIOSTATISTICS and EPIDEMIOLOGY
Example #1
For what value of x will 8 and x have the same mean (average)
as 27 and 5?
27 + 5 = 16 x + 8 = 16 X = 24
2 2
Example #2
On his first 5 biology tests, Bob received the following scores: 72,
86, 92, 63, and 77. What test score must Bob earn on his sixth test
so that his average (mean score) for all six tests will be 80?
72 + 86 + 92 + 63 + 77 + x = 80 X = 90
6
BIOSTATISTICS
Example #3 and EPIDEMIOLOGY
Seven students taking a course in Literature read, 16, 10, 24, 13,
20, 11, and 17 sections of the assigned books. Find the median
of the number of books that they are reading.
10 11 13 16 17 20 24
Ten students spent 90, 104, 83, 75, 115, 92, 83, 79, 88, & 111
minutes studying for an examination. Find the median time.
88 + 90
~X = -------------- = 89 minutes
2 39
Mode
BIOSTATISTICS andof Ungrouped
EPIDEMIOLOGY Data
1 2 3 4 5 5 5 6 8 9
4 5 6 7 8 9 10
10 10 11 11 12 12 13 13 14 14
4 5 6 7 8 8 9 9 10 10 11 12
5 6 6 7 8 9 9 10
3 4 4 5 6 6 7 8 8 9
40
BIOSTATISTICS and EPIDEMIOLOGY
Assignment
1. The weekly salaries of six nurses at UST Hospital are
P5140, P6220, P4090, P5800, P5140, P6200. For these six
salaries, find: (a) the mean (b) the median (c) the mode
MEASURES OF VARIATION
BIOSTATISTICS and EPIDEMIOLOGY
Example RANGE
R = 136.5 – 86.5
= 50
BIOSTATISTICS and EPIDEMIOLOGY
Sample
Interpretation of Standard Deviation
BIOSTATISTICS and EPIDEMIOLOGY
∑ (Xi - X) 2 ∑ (Xi - µ) 2
S2 = ----------------- σ2 = -----------------
n-1 N
Where:
X – observation value N - population size
X - sample mean s - sample standard deviation
µ - population mean σ - population standard deviation
n - sample size 46
BIOSTATISTICS and EPIDEMIOLOGY
(100)
s σ
cv = ----- (100) cV = ----- (100)
X µ
CV of Sample Data CV of Population
Remark: The higher the C. V. , the higher the variability and lower the C. V., the
higher is the consistency of the data.
BIOSTATISTICS and EPIDEMIOLOGY
Assignment
A sample of ages of five women enrolled in Slimmers’ World
showed the following ages in years to the nearest birthday:
22, 18, 26, 20, and 24. Their weights in pounds were 115, 159,
141, 137, and 130 respectively. Which of the two sets of data
is relatively more variable?
Assignment
• Following are the weights (in pounds) and transportation expenses
(in pesos) of 12 students. Find out which set of data is more
variable and why?