Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Note: This lesson is very basic and is meant for technicians in agriculture to learn
statistical concepts and methods for use field-level research. For a comprehensive study,
you should refer to textbooks on statistical methods and experimental designs. You can
also visit available on-line statistical tutorials such as:
http://statpages.org/javasta4.html
Simplicity,
High precision,
Absence of systematic error,
Range of validity of conclusions, and
Calculation of degree of uncertainty.
Some Basic Terms
Statistical science helps overcome this difficulty by requiring the collection of data in a
way so as to provide unbiased estimates of treatment effects and the valuation of
treatment differences by tests of significance based on measuring variability.
An experimental unit refers to the unit of material to which a treatment is applied. It can
be a single leaf, a whole plant, an area of ground containing many plants, a pot, or in the
green house. The term plot is synonymous with experimental unit. A treatment may be an
amount of material or a method that is to be tested and compared with other treatments in
the experiment; e.g., cultivar fertilizer doses, etc.
Individual measurements of a variable are data; 150 cm plant height, 45 days to 50%
flowering, 12 cm panicle length, or 3240 kg/ha of grain etc.
For example: All 150 plants in a plot form the population. Ten plants used for recording
the plant height from this population form the sample. We obtain information from the
sample.
Mean: Mean is the simple arithmetic average (of the sample or population).
Standard Deviation (SD): The measure of dispersion of the data around their mean is the
Standard Deviation. SD tells us how scattered the sample observations are around the
mean.
Variance: Square of the Standard Deviation.
Standard Error (SE): The SD of the population of means is its standard error. SE tells us
how scattered the treatment means (for example) will be.
Degrees of freedom (df): Represents the freedom with which the variability in a data set
could be accounted for. Usually, but not necessarily, df is one less than the number of
observations (n-1).
For example, with 24 observations on plant height, the degree of freedom to account for
the variability in plant height is 23 (24 observations - 1).
Probability level: We use a probability level to tell us whether an observed result is likely
to have happened by mere chance.
For example, a probability of 5% (0.05) tells us that a particular result observed may not
happen in 5 trials out of 100 trials purely by chance. In other words, the result would
normally be true in 95 cases out of 100 trials.
Distribution of data
Binomial distribution
Poisson Distribution
This is a distribution, which represents occurrences of rare events. For example, count
data such as the number of infested plants, the number of lesions per leaf, or number of
weeds per unit area.
Normal Distribution
These data represent a continuous distribution. Most biological data, when plotted in a
frequency curve, represent a bell shaped and symmetrical curve. Such data are said to be
normally distributed. For example, grain yield, plant height, etc.
Test of Significance:
Variability is a characteristic feature of nature. Two plants growing side by side are not
alike even under similar conditions. We also know that data on plant growth characters
reveal this variability. For example, from data on five samples plant height such as 115
cm, 95 cm, 82 cm, 108 cm, and 72 cm, ....., can we say the growth was good (115 cm
plant height) or the growth was poor (72 cm plant height)? What is the truth?
't' test, 'F' test, and Chi-square test are commonly used statistical procedures for testing
significance of observed results.
The single-sample 't' is a statistics computed from a sample, which expresses the
difference between the sample mean and a hypothetical population mean in standard error
units.
Thus:
The theoretical 't' values for the degrees of freedom for different sample sizes at different
probability levels are provided in a 't' Table.
Comparing the calculated 't' value with that of the theoretical 't' value from the 't' table,
one could test the significance of the sample mean being different from the hypothetical
population mean.
If the calculated 't' is > 't' table value, then the two mean values that are being tested are
significantly different.
If the calculated 't' is < 't' table value, then the two mean values that are being tested are
not significantly different from each other.
The 'F' test is a ratio between two variances and is used to determine whether two
independent estimates of variance can be assumed to be estimates of the same variance.
In the analysis of variance, the 'F' test is used to test equality of means of two or more
treatments; that is, to answer the question, can it reasonably be assumed that the treatment
means resulted from populations with equal means?
Like the theoretical 't' values, theoretical 'F' ratios are also computed for different sample
sizes at a given probability level in the form of a 'F' Table.
Comparing a calculated 'F' ratio with that of the theoretical 'F' ratio from the 'F' Table, the
significance among a group of treatments could be tested.
If the 'F' calculated is > 'F' Table values, then the treatments are significantly different at
that probability level.
If the 'F' calculated is < 'F' Table value, then the treatments are not significantly different
at that probability level.
Analysis of Variance:
A partitioning of the total variability in the data arising from various sources is presented
in the form of a Table. This Table is called the Analysis of Variance Table and consists
of:
The usage of an analysis of variance could be better understood if you review the lessons
on experimental designs.
We hope that this tutorial helped you to understand some of the basic principles in the
statistical procedures used in agricultural experimentation.