Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Geostatistics
Exploratory Analysis
Carlos Alberto Felgueiras
cfelgueiras@isegi.unl.pt
1
Master of Science in Geoespatial Technologies
Contents
EDA and ESDA
Univariate Description
Bivariate Description
2
Master of Science in Geoespatial Technologies
Structural
Analysis
Realizations Fields
Conditional
Simulation
Exploratory
Analysis
Derived Scenarios
Uncertainty Maps 3
Master of Science in Geoespatial Technologies
4
Master of Science in Geoespatial Technologies
important to be able to link numerical and graphical procedures with the map
need to be able to answer the question: where are those cases on the
map?.
With modern graphical interfaces this is often done by 'brushing' - for
example cases are identified by brushing the relevant part of a boxplot, and
the related regions are identified on the map.
This lecture only covers the case of attribute data attached to irregular areal
units because we want to focus on the range of analytical tools required for
ESDA and their provision using GIS.
5
Master of Science in Geoespatial Technologies
Sampling
Spatial sampling involves determining a limited
number of locations in a geo-space for faithfully
measuring phenomena that are subject to
dependency and heterogeneity.
nA = 8 and nB = 7
9
Master of Science in Geoespatial Technologies
Continuous Variables
Height Frequency Cumulative
Values Frequency
<=3 1 1
<=19 2 3
<=26 3 6
<=39.7 5 11
<=49.82 6 17
<= 57 9 26
<= 65 4 30
A = Basketball Cummulative
Frequency
B = Basketball + Football
C = Basketball + Football + Hokey
C = Basketball + Football + Hokey + Others
12
Master of Science in Geoespatial Technologies
n =1
Median Once the data is ordered: z1z2.zn 0
q.5
40 - <50 14
Master of Science in Geoespatial Technologies
Deciles split the data into tenths, percentiles split the data into
hundredths. Quantiles are the generalization of this idea to any fraction.
We represent as qp the p-quantile of the data. Example: q.1 is the first
decil and q.25 = Q1 is the first quartil, and so on.
15
Master of Science in Geoespatial Technologies
Set A Set B
Sample Sets Minimum 3 3
Set A Set B Obs. The
Maximum 132 20 Mean value is
3 3 sensitive to
10 10
Mean 27 12 erratic or
extreme
11 11 Mode 14 14 values
12 12
Median 13 12
14 14
14 14 Quartil 1 10,75 10,5
20 20 Quartil 2 13 12
132
Quartil 3 15,5 14
nA = 8 and nB = 7
16
Master of Science in Geoespatial Technologies
1 n
2
= ( z ( ) z )2
n 1 =1
17
Master of Science in Geoespatial Technologies
Sample Sets
Set A Set B Set A Set B
3 3 Mean 27 12
Obs. Variance and
10 10 Standard Deviation
Variance 1822,57 26,33
11 11 are also sensitive to
erratic or extreme
12 12 Std Var 42,69 5,13 values
14 14 Quartil 1 10,75 10,5
14 14 Obs. The interquartil
Quartil 3 15,5 14 value does not
20 20
depend on the
132 Q3-Q1 4,75 3,5 mean value.
nA = 8 and nB = 7
18
Master of Science in Geoespatial Technologies
=
(Z z )3
n
3
Sample Sets
Set A Set B
Set A Set B
3 3 Mean 27 12
10 10
Std Var 42,69 5,13
11 11
12 12 Coef. of
Variation 1,58 0,43
14 14
14 14 Skewness 1,81 -0,22
20 20
132 Curtosis 4.60 2,21
20
Master of Science in Geoespatial Technologies
22
Master of Science in Geoespatial Technologies
23
Master of Science in Geoespatial Technologies
24
Master of Science in Geoespatial Technologies
25
Master of Science in Geoespatial Technologies
EDA Univariate Description
An Example with Elevation data
NORMAL PROBABILITY
PLOTS
26
Master of Science in Geoespatial Technologies
Bivariate
Description
The ScatterPlot
Correlation between two variables
The covariance value
The correlation coefficient
Linear Regression
Conditional Expectation
27
Master of Science in Geoespatial Technologies
Objectives
Representation of relations/co-relations between
two variables
Comparing the distributions of the two variables
Use of graphics and statistic measurements that
summarize the main features of the bivariate
relations.
28
Master of Science in Geoespatial Technologies
29
Master of Science in Geoespatial Technologies
SCATTERPLOTS
Y Y Y
X X X
(a) (b) (c)
31
Master of Science in Geoespatial Technologies
1 n
C ij = ij = n (
(zi ( ) mi ) z j ( ) m j )
=1
32
Master of Science in Geoespatial Technologies
(z ( ) m ) (z ( ) m )
n
1
=
i i j j
= =1
ij
ij
i j
n i j
y = ax + b
where the slope, a, and the constant, b, are given by:
a=
y
b = mx a m y
X
34
Master of Science in Geoespatial Technologies
Sample Sets
Set A Set B
Set A Set B ScatterPlot
Mean 14 10
3 22 25
10 18 20
Std Var 7,39 7,01
15
Set B
11 11
10
12 11 5 Covariance -39,63
14 7 0
0 10 20 30
14 5 Set A
Skewness -0,875
20 4 Linear a =-0,83
28 2 Regression (-41,19o) b=22,3
nA = nB = 8
35
Master of Science in Geoespatial Technologies
36
Master of Science in Geoespatial Technologies
GIS are currently only poorly equipped with many of these tools,
despite containing the basic functionality to allow them to be
implemented.
37
Master of Science in Geoespatial Technologies
Geostatistics EDA
END of Presentation
38
Master of Science in Geoespatial Technologies
39