Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Copyright 2006, The Johns Hopkins University and Karl Broman. All rights reserved. Use of these materials
permitted only in accordance with license rights granted. Materials provided AS IS; no representations or
warranties provided. User assumes all responsibility for use, and all liability related thereto, and must independently
review all materials for accuracy and efficacy. May contain materials owned by others. User is responsible for
obtaining permissions for use from third parties as needed.
Outline
The essentials of
microarray data analysis
Experimental design
Take logs!
Pre-processing: affy chips and 2-color arrays
Clustering
Identifying differentially expressed genes
Thanks to Rafael Irizarry for
the slides!
Biological question
Differentially expressed genes
Sample class prediction etc.
Resources
Experimental design
Microarray experiment
Image analysis
http://www.biostat.jhsph.edu/~ririzarr/Teaching/688
R, G
Estimation
Testing
Clustering
Biological verification
and interpretation
Discrimination
Experimental design
Experimental design
Choice of platform
Array design
Creation of probes
Location on the array
Controls
Target samples
Avoidance of bias
Mouse model
Dissection of
tissue
Biological Replicates
RNA
Isolation
Balance
Randomization
Amplification
Technical replicates
Probe
labelling
Hybridization
7
Two-color layouts
Cy3
sample
Cy5
sample
Data provided by
Dave Lin (Cornell)
9
10
T2
Tn-1
Tn
Time Series
Ref
Experiment for which the common reference design is appropriate
Meaningful biological control (C) Identify genes that responded differently / similarly
across two or more treatments relative to control.
Large scale comparison. To discover tumor subtypes when you have many different tumor
samples.
T1
T2
T3
T4
Ref
T1
T2
T3
T4
T1
T2
T3
T4
Advantages:
Ease of interpretation.
Extensibility - extend current study or to compare the results from current study to other
array projects.
T1
11
T2
T3
T4
12
Pooling
Mouse model
Dissection of
tissue
RNA
Isolation
Pooling
Probe
labelling
Hybridization
13
14
Alternative pooling
strategy
16
Design 1
Pooled vs Individual
samples
Design 2
Taken from
Kendziorski etl al (2003)
17
18
Outline
Experimental design
Take logs!
Pre-processing: affy chips and 2-color arrays
Clustering
Identifying differentially expressed genes
20
Outline
Experimental design
Take logs!
Pre-processing: affy chips and 2-color arrays
Clustering
Identifying differentially expressed genes
21
22
Work flow
Affymetrix GeneChip
Design
Image analysis
Reference sequence
TGTGATGGTGGGGAATGGGTCAGAAGGCCTCCGATGCGCCGATTGAGAAT
CCCTTACCCAGTCTTCCGGAGGCTA Perfectmatch
CCCTTACCCAGTGTTCCGGAGGCTA Mismatch
Statistical test
Rank (list)
Choose filter
Significance level
NSB & SB
NSB
24
Self-self hybridization
log2 R vs. log 2 G
26
Self-self hybridization
Self-self hybridization
M vs. A
M vs. A
28
Boxplots by print-tip-group
Intensity
log-ratio, M
29
Simplest Idea
30
32
33
Quantile normalization
34
35
36
Before and
These are two technical replicates. Only the red are differentially expressed
38
Outline
39
40
10
Clustering is not a
universally good tool
Outline
Experimental design
Take logs!
Pre-processing: affy chips and 2-color arrays
Clustering
Identifying differentially expressed genes
41
Work flow
42
Pre-processing
normalization
Rank (list)
Choose filter
Significance level
Candidate genes (short list)
43
44
11
What's the difference? Mainly the way in which the variation within
population is incorporated
45
46
47
48
12
Useful Plots
Example
49
50
Summary
Experimental design is critical
Biological vs technical replicates
51
52
13