Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
:
Statistics is the scientific field that deals with collection, classification,
presentation, description, analysis & interpretation of data.
It includes:
1-
Descriptive statistics:
which is concerned with the summary measures of data for a sample of
a population.()
2-
Analytic statistics:
concerns with the use of data from a sample of population. (
)
3-
Vital statistics:
is the ongoing collection by government agencies of data relating to
events such as births, deaths, marriage, divorces, health and disease
related conditions reportable by local health authorities.
:
Biostatistics:
Uses in Medicine:
Physicians should have the basic knowledge for evaluation and criticism of
researches published in medical journals.
Statistics is helpful to assess diagnostic testing and the effects of new drugs and
treatment modalities.
Data:
are the basic building blocks of statistics and refer to the individual values
measured or observed. Data can be derived from a total population or a sample.
Methods of collection of data
1. By conducting survey:
Data collected from the population in the field of the study using a designed
questionnaire. There are two types of surveys
a) Comprehensive surveys:
b) Sample survey:
:
Types of data:
a) Constant data:
These are observations which do not vary from one person to another such as
number of eyes, fingers, ears etc.
b) Variables:
These are observations, which vary from one person to another or from one
group of members to others and are:
1. Quantitative variables:
These may be continuous or discrete.
Which are obtained by enumeration and its value is always integer value.
Examples: Pulse, family size, number of live births, number of abortions.
2. Qualitative variables:
Which are expressed in quality and cannot be enumerated or measured but can
be categorized only.
Can be put in order. e.g. degree of success: excellent, very good, good, fair.
b- Nominal qualitative:
Methods of presentation:
I.
Numerical presentation:
II.
Graphical presentation
1- Numerical presentation:
Types of tables:
) (
N.B.: For comparison the total groups should have the same total frequencies
otherwise calculate the percent of total for each frequency
One group is classified according to two variables e.g. weight and height or age
and blood pressure to find any correlation between these two variables.
simple
II-GRAPHICAL PRESENTATION
3. The histogram
represent data of the two subtypes of qualitative and quantitative discrete type
3-the histogram
suitable for continuous quantitative variable. It is used only when the table is of a
simple frequency distribution type.
Continous data
This type is used when the variable is of continuous quantitative type and the
table is of simple or complex type
Each interval in the table is represented by a single point opposite its frequency
on Y axis and opposite the mid-point of the interval on X axis. Then every two
consecutive points are connected by a straight line.
5- Pie Chart
It can be used for all the four types of variables when represented in a simple
table.
categories or intervals in the table, usually the division of the circle starts from 12
Oclock and it goes in a clockwise direction.
/ total frequency
III.
MATHEMATICAL PRESENTATION
These are computed values around which most of the observations tend to
concentrate or allocate.
The formula for computation of the arithmetic mean for ungrouped data is:
X=X/ n
Advantages of arithmetic mean:
Disadvantages:
1. Find the mid point (X) for each interval given by:
X=lower limit +upper limit /2
2-The median:
The median is the value that lies in the middle of the ordered observations.
Computation of the median from ungrouped data:
A) When n is odd:
The steps are:
3. Refer back to ordered observations and using the obtained ranks, we determine
the two middle values.
patients is: 1, 2,4,3,5. The median is 3 days and the mean is 1+2+3+4+5/5= 3 days
too.
While if the length of stay of a patient is 150 days instead of 5, then the median is
still 3 days ,while the mean will be 1+2+3+4+150/5= 32 days.
Disadvantages:
3-The mode:
The mode is the observation which has the highest frequency, or it is the most
frequent observation
This is done by finding the observation which has the highest frequency.
e.g. Weight of five children as follows: 9, 8, 12, 7, 8 kg.
A similar procedure can be used for finding the mode from qualitative data
Determination of mode from grouped data:
Two methods can be used:
a) The modal interval. This is the interval opposite the highest frequency.
b) The mid-point of modal interval (used only for quantitative data) in this
method the modal interval is determined as before, and the midpoint is obtained
as follows:
B) Measures of dispersion
1. Range:
It is a simple measure of dispersion and by definition range is difference
between the biggest and smallest observation.
From the above two examples range for first group = 36 28 = 8 years and for
second group = 62 8 = 54 years.
The largest possible observation the lowest possible observation i.e. the
upper limit of last interval lower limit of first interval.
Normal Distribution:
Definition:
The normal distribution curve is bell shaped, with lower and upper tails
and is determined by the mean and the standard deviation of the
population.
Standard deviation=1