Sei sulla pagina 1di 47

GEOSTATISTIK

Syahrul
16 November 2017
Silabus kuliah

• Introduction to geostatistics Today’s


• Non-spatial statistics lecture

• Spatial statistics
• Estimation
• Simulation
Introduction to Geostatistics
CONTENT
• What is geostatistic?
• Application of spatial statistics
• Basic assumptions in spatial statistics
• Key concepts in geostatistics
• Exploratory data analysis (EDA) for non-spatial
statistics
• Spatial description
Geostatistics
• Geostatistics: branch of statistics that
deal with spatially correlated data
• Basic assumptions:
– Sample values are not independent
– Spatial dependency exists
• Goal of geostatistics:
– Spatial continuity model
– Use the model for estimation and/or
simulation of spatial data distribution
Geostatistics
• Geostatistics Term used by Hart (1952) - Application of
Statistics in a Geographic Context
• Matheron (1962, 1963) Used Term in a Geological
Context for Inferring Ore Reserves from Data Spatially
Distributed Within an Ore Body :
- Developed Theory of Regionalized Variables
– Formal Introduction of New Statistic - the
Semivariogram
– Used Kriging to Obtain Best Estimate of a Property (i.e.
Ore Grade) at Some Location in an Ore Deposit
– Built Theory on Practical Work of Krige (1951, 1960)
Geostatistics
•Mid 1980 to Mid 1990 >> Explosion of 3-D stochastic
(simulation based) reservoir modeling.

•Public domain GSLIB software library by Deutsch and


Journel, 1992

•Adopted by reservoir and production geologist for


reservoir modeling, Dubrule 1998
What is special about spatial data?
• Location of a sample  intrinsic part
of its definition
• All data sets  implicitly related by
their coordinates (models of spatial
structure)
• Data values may be related to their
coordinates  spatial trend
What is special about spatial data?

• Values at sample points can NOT be


assumed to be independent
• That is, there may be a spatial structure
to the data
– Classical statistics  independence
– Implications for sampling design
Key Concepts

• Spatial dependence: the value of a


variable at a point in space is related
to its value at nearby points
• Spatial structure: the nature of the
spatial relation
• Support of a sample: the physical
dimensions it represents
Geostatistic application
Reservoir Property Distribution Using the Available Well
Log, Core, and/or Seismic Data
Geostatistical
Analysis
Raw
Data

Selection of
Model
Appropriate
Estimation or
Stochastic Algorithm
Geostatistic application
• Quantify Uncertainty Using Multiple Geologically and
Statistically Valid Models
Individual
Reservoir
Simulation
Runs Are
Numbered

n
RESERVOIR
1 3
1 FLOW 6
2
SIMULATOR
3
5
4
4 2
5
6 OUTCOME
PROPERTY (PHI, K) (RECOVERY)
DISTRIBUTIONS
Geostatistic application
3D Static Earth Model

Well-log
Geostatistic and Earth Modelling
Limitations of geostatistics
Geostatistics does NOT :
• Create Data or Eliminate the Value of
Obtaining Additional Good Data
• Replace Sound Qualitative
Understanding and Expert Judgment
• Necessarily Save Time, At Least in the
Short Term.
• Work Well as a “Black Box”
Some useful sites
• The central information server for
Geostatistics and Spatial Statistics
http://www.ai-geostats.org/
• gstat: http://www.gstat.org
• ArcGIS Geostatistical Analyst:
http://www.esri.com/software/arcgis/
arcgisxtensions/geostatistical/
• Geostatistical analysis tutor (Colorado
School of Mines) :
http://uncert.mines.edu/tutor/
Non-spatial statistics

Exploratory Data Analysis (EDA)


Univariate description

Measure the characteristic of data population

• Mean
• Variance/standard deviation
• Histograms
• Spread/central tendency
• Skewness
Frequency Table
values:
2, 4, 1, 5, 2, 3, 6.9, 2, 5, 7, 2.1, 3.4, 4.2, 2.2, 2.9, 1.7, 3.5, 6.2

Cumulative
Interval Frequency Frequency
1 - 1.999 2 2
2 - 2.999 6 8
3 - 3.999 3 11
4 - 4.999 2 13
5 - 5.999 1 14
6 - 6.999 2 16
7 - 8.000 1 17
Histograms
6
5

frequency
4
3
2
1

0 1 2 3 4 5 6 7 8
data value
relative frequency

0.36
0.30
0.24
0.18
0.12
0.06
0 1 2 3 4 5 6 7 8

data value
Histograms
• Shape varies with Number of Bins
• Rule Of Thumb

Number of Bins = Number of Samples


Cumulative Distributions
cumulative

cumulative frequency
distribution
1.0
0.8
0.6
0.4
0.2

0 1 2 3 4 5 6 7 8
data value

• number of samples below bin maximum


• relative frequency below bin maximum
• probability of grade below bin maximum
Central Tendency Measurements

• Arithmetic Mean = Sum of values


No of values
• Mode = Highest Probability (i.e. ‘tallest’ bin in
histogram)
• Median = 50 percentile (i.e. 50% of values
are below the median)
Spread Measurements
• How different are values from the central
value?
– Range
• Maximum - Minimum
– Variance or Standard Deviation
– Inter-Quartile Range (IQR)
• 75 percentile - 25 percentile
• 90 percentile - 10 percentile
Skewness Measurements
• How symmetrical is the distribution?
– Skewness
– Kurtosis
Normal Distribution
• Many biological characteristics (e.g. height,
weight) follow a symmetrical distribution with
a predictable shape
• Called a Normal (or "Gaussian") distribution
Frequency
Normal Distribution
• Defined by mean and variance
Same Mean, Same Variance,
Different Variances Different Means

• Examples:
– Grain size, porosity, permeability, etc
Normal Distribution

Frequency

mean grade
mode
median

• Shape has known equation


• Mean = Median = Mode
Normal Distribution

• Where μ = Mean, σ = Standard Deviation


• From equation can calculate proportions
within Standard Deviation(s) of Mean
Probability Plots

• Straight line if data is Normally distributed


Skewed Distribution
• Unfortunately most variables in geology
follow a skewed, non symmetrical shape
Positively Negatively
Skewed Skewed

% %

grade grade
LogNormal Distribution
• Some variables in geology have a
LogNormal Distribution
– Logarithm of values have Normal Distribution
– Sometimes its Logarithm of (value + constant)

f%
f%

grade log-grade

RAW LOG-TRANSFORMED
LogNormal Distribution

• Mean ≠ Median ≠ Mode


• Mean is NOT antilog of Mean of Logs!
– Antilog (Mean + 0.5 x Variance)
Mixed Distributions

Frequency

Grade

• More than one mode


• Could be due to
– mixed domains
– multiple phases of mineralisation
Sample Support
• The characteristics of a sample:
– sample size (core diameter, sample length)
– sampling method (diamond, reverse circ)
– assay method (fire assay)
Volume-Variance
• Variance of data set changes according to
the support of the data (i.e. the volume of
material)
• The larger the sample volume, the lower the
variance of the samples
Volume-Variance
• Variance is inversely proportional to Volume
(size)
• Blasthole samples have a higher variance
than mining blocks
• Small model blocks have a higher variance
than bigger model blocks

Samples
Mining Blocks
Model Blocks
Outliers
• Outlier values may be cut to reduce their
impact on arithmetic mean and estimation
g/t gAu %
2 20,000 1%
4 40,000 3%
3 30,000 2%
7 70,000 4%
90 900,000 58%
15 150,000 10%
10 100,000 6%
20 200,000 13%

• E.g. if Top-Cut = 20 a value > 20 is changed


to 20
• Derive Top-Cut from Probability Plot
De-Clustering
• Statistics relies on samples being random
and un-biased
• In mining, we are more interested in ore
than waste
• Usually in mining, more drillholes are
located in high-grade areas
• So sampling is inherently biased - more
samples in high grade areas
De-Clustering
• Overcome this bias with de-clustering
• Put (3D) grid over data
• Within each cell take sample closest to
centre (or average of all samples)
Use average of Use single
4 samples sample

 Continue using this single value per cell


De-Clustering
• Any geological knowledge to split into
separate domains must be used

Only decluster if no geological separation


possible
Bivariate description
• Comparing two distributions
• Scatterplot
• Correlation
• Linear regression
Non-spatial statistics
Scatterplot

Regression line

  0. 7

x
Non-spatial statistics
different values of correlation coefficient

(Picture taken from Dubrule, 2003)

Potrebbero piacerti anche