Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Case Study
Lactose tolerance Steve Cross,
Bronwyn Terrill
and colleagues
Wellcome Trust Sanger Institute
Version 1.1 Hinxton
Case Stu
Case Stud
lactose tolerance
Lactose tolerance
Most humans lose the ability to digest lactose and become lactose
intolerant by the time they are about ten years old. In some populations,
however, people still produce lactase, the enzyme that digests lactose,
in adulthood. The production of lactase gives an evolutionary advantage
to people who have access to cow’s milk.
The majority of people living today are unable to digest lactose in adulthood. 81–90% 21–30%
In this activity you’ll analyse DNA data from different human populations
71–80% 11–20%
to work out which variations in the human genome have enabled some
adult humans to consume dairy products without problems.
61–70% 1–10%
You will also discover which variants are associated with lactose tolerance
51–60% 0%
(or lactase persistence) in different populations. A statistical test (Chi-
squared) will be used to work out whether specific genetic changes have
41–50% No data
significant effects on lactase persistence.
LACTASE
Regulating genes
The difference between most lactose-intolerant and lactose-tolerant adults
isn’t that they have different genes — it’s that their genes are controlled
differently. Lactase-persistent people have a lactase gene that is not
switched off after childhood.
The body has three main ways of regulating the effect of a gene:
Exercise 1
Genotypes associated with lactose
tolerance and intolerance
Of the 137 cases showing lactase persistence, none were homozygous with
respect to the alleles C and G, at C/T–13910 and G/A–22018 respectively; 74
were homozygous for alleles T and A, with 63 being heterozygous at both
positions.
C/T –13910
Lactase persistence
Total samples
G/A –22018
Lactase persistence
Total samples
Questions
a. Which of the positions (C/T–13910 or G/A–22018) is 100% associated
with lactase persistence?
b. From this data at this position, which genotype(s) are associated with
lactose tolerance/lactase persistence?
c. From the data at this position, which genotype(s) are associated with
lactose intolerance/lactase non-persistence?
d. How associated is the other variant (C/T–13910 or G/A–22018) with
lactase persistence? (number of non-consistent findings divided by the
total number of samples x 100, subtracted from 100%).
Exercise 2
Other lactose persistence mutations
Although the 100% associated variant you identified in Exercise 1 is thought
to cause lactase persistence in Europeans, this variant is only present at
low levels in other lactase-persistent populations.
A Nature Genetics paper (Tishkoff et al, 2007) sought DNA variations that
could help to explain lactase persistence in East African populations. There,
lactase persistence was most common in Sudanese populations which
raised livestock, and least common in Tanzanian hunter-gatherers. The
researchers sequenced 3.3 kb and 1.8 kb in the same area on chromosome 2
(2q21) that yielded the European variant.
Below are the results from three different African populations at two
different positions in the genome: at 5 bases and 100 bases further
upstream from the LCT gene than the European variant. In this study, a
lactose tolerance blood test (a method of indirectly measuring levels of LCT
actiivty) was used to determine whether a person was LCT persistent, LCT
intermediate or LCT non-persistent.
The Chi-Squared test can be used here to test the following Null hypothesis.
Alternative hypothesis: The genotype (at this position) has a significant effect
on lactase persistence.
TOTALS
* To calculate the Expected values, you will need to count the number of number of samples with a single genotype
found across the entire population. This figure, divided by the total sample size will give you the prevalence of the
genotype. Multiplying the prevalence of each form of the allele by the number of samples in each condition group,
will give you the Expected values.
Size of population
Category Prevalence of allele Expected
(ignoring alleles)
Genotype 1 (CC)
LCT persistent
Genotype 2 (CG)
LCT persistent
Genotype 3 (GG)
LCT persistent
Genotype 1 (CC)
LCT non-persistent
Genotype 2 (CG)
LCT non-persistent
Genotype 3 (GG)
LCT non-persistent
Genotype 1 (CC)
Intermediate
Genotype 2 (CG)
Intermediate
Genotype 3 (GG)
Intermediate
TOTALS
* To calculate the Expected values, you will need to count the number of number of samples with a single genotype
found across the entire population. This figure, divided by the total sample size will give you the prevalence of the
genotype. Multiplying the prevalence of each form of the allele by the number of samples in each condition group,
will give you the Expected values.
Size of population
Category Prevalence of allele Expected
(ignoring alleles)
Genotype 1 (CC)
LCT persistent
Genotype 2 (CG)
LCT persistent
Genotype 3 (GG)
LCT persistent
Genotype 1 (CC)
LCT non-persistent
Genotype 2 (CG)
LCT non-persistent
Genotype 3 (GG)
LCT non-persistent
Genotype 1 (CC)
Intermediate
Genotype 2 (CG)
Intermediate
Genotype 3 (GG)
Intermediate
TOTALS
* To calculate the Expected values, you will need to count the number of number of samples with a single genotype
found across the entire population. This figure, divided by the total sample size will give you the prevalence of the
genotype. Multiplying the prevalence of each form of the allele by the number of samples in each condition group,
will give you the Expected values.
Size of population
Category Prevalence of allele Expected
(ignoring alleles)
Genotype 1 (GG)
LCT persistent
Genotype 2 (GT)
LCT persistent
Genotype 3 (TT)
LCT persistent
Genotype 1 (GG)
LCT non-persistent
Genotype 2 (GT)
LCT non-persistent
Genotype 3 (TT)
LCT non-persistent
Genotype 1 (GG)
Intermediate
Genotype 2 (GT)
Intermediate
Genotype 3 (TT)
Intermediate
(O–E)2
The total of E across all four groups is the chi-squared value ( χ 2).
Using this information, calculate the probability of the Null hypothesis
being true using this lookup table, where p is the probability.
Questions
For each of the three populations: