Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Domingo, Alvin
Exercise 9
Table 1 Summary of Disease incidence if Backcross Population and Parentals
Source of
Variation SS df MS F P-value F crit
Replication 20932.91 40 523.3228 2.206875 0.00134 1.544887
Genotype 3905.317 2 1952.658 8.234446 0.00056 3.110766
Error 18970.64 80 237.133
Source of
Variation SS df MS F P-value F crit
Replication 14989.07 40 374.7268 1.84453 0.028109 2.114232
Genotype 3144.644 1 3144.644 15.47899 0.000324 7.3141
Error 8126.227 40 203.1557
Total 26259.94 81
28.45 11.97
CV R2
Table 4 Histogram Summary of Disease Incidence of Backcross Population and Parentals
bin Frequency P1 P2
10 0
20 1 1
30 2
40 6
50 9
60 14
70 6
80 2
90 0 1
100 0
More 0
HISTOGRAM
Frequency P1 P2
14
FREQUENCY
9
6
6
2
2
1
1
1
0
10 20 30 40 50 60 70 80 90 100
BIN
Answer to questions
3.a. The frequency distribution somewhat follows a normal bell curve. Highest population in a range
is in between 50 to 60 with a population of 14. The susceptible parental (P1) is in between range 10 to
20 and the resistant parental (P2) is in between range of 80 to 90. The frequency distribution is not
symmetrical favoring the lower mean.
3.b. The variances are heterogenous and does not follow a linear pattern. The variance and mean
are not related. There is too much deviation between the expected pattern between the variance and
mean.
3.c. Transforming the variance by logarithmic transformation. The data is still somewhat
heterogenous but the trend is now clearly visible. The pattern follows a linear horizontal line.
4.a There is significant level of variation in all data except for the replication data for 1%
significance. It has a p value of 0.02
4.b. Yes, it is important. Only the 1% significance replication shows that it is not significant. This
signifies that the data for replication has significant value.
4.c. Coefficient of variation is a measure of distance between data points. It can describe how far
each data from each other.
4.d. R2 is a measure on how close the data points are to the regression line. The regression line is a
line which best describes the trend of the data points. This can help describe if the effects of the gene to
the phenotype follows a trend.
4.e. The higher the R2 value, the more predictable the effect of the QTL to the trait of an organism. A
usable trend can be used. The higher the R2, the more a trend can be formed. The QTL can more
accurately be used determined.