10,00 modelling and analysis of geophysical data using geostatistics and machine learning vasily...

51
UNCERTAINTY QUANTIFICATION OF GEOSCIENCE PREDICTION MODELS BASED ON SUPPORT VECTOR REGRESSION V. Demyanov 1 , A. Pozdnoukhov 2 , M. Kanevski 3 , M. Christie 1 1 Institute of Petroleum Engineering, Heriot-Watt University, Edinburgh, UK [email protected] 2 National Centre for Geocomputation, National University of Ireland, Maynooth. 3 Institute of Geomatics and Risk Analysis, University of

Upload: geographical-analysis-urban-modeling-spatial-statistics

Post on 11-May-2015

1.876 views

Category:

Technology


7 download

DESCRIPTION

10,00 Modelling and analysis of geophysical data using geostatistics and machine learningVasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)Intelligent Analysis of Environmental Data (S4 ENVISA Workshop 2009)

TRANSCRIPT

Page 1: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

UNCERTAINTY QUANTIFICATION

OF GEOSCIENCE PREDICTION

MODELS BASED ON SUPPORT

VECTOR REGRESSION

V. Demyanov1, A. Pozdnoukhov2, M. Kanevski3, M. Christie1

1 Institute of Petroleum Engineering, Heriot-Watt University, Edinburgh, UK [email protected]

2 National Centre for Geocomputation, National University of Ireland, Maynooth.3 Institute of Geomatics and Risk Analysis, University of Lausanne

Page 2: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Outline

• Geoscience modelling under uncertainty

• Machine learning based geomodels• Semi-supervised SVR reservoir model

– Case study– Robustness to noise– Predictions with uncertainty

• Conclusions

Page 3: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Outline

• Geoscience modelling under uncertainty

• Machine learning based geomodels• Semi-supervised SVR reservoir model

– Case study– Robustness to noise– Predictions with uncertainty

• Conclusions

Page 4: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Uncertainty Quantification (UQ) Framework

Mathematical Model

(parameters, pde)

Computer Simulation

(discretisation, timestep)

Modelparameters

Observed Data

0

500

1000

1500

2000

2500

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

1200

1400

0 200 400 600 800 1000 1200 1400

time (days)

0

500

1000

1500

2000

2500

3000

3500

0 100 200 300 400 500 600

time (days)

Natural System

Simulated vs Data

0

500

1000

1500

2000

2500

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

1200

1400

0 200 400 600 800 1000 1200 1400

time (days)

0

500

1000

1500

2000

2500

3000

3500

0 100 200 300 400 500 600

time (days)

MISMATCH

Forecast Uncertainty

0

500

1000

1500

2000

2500

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

1200

1400

0 200 400 600 800 1000 1200 1400

time (days)

0

500

1000

1500

2000

2500

3000

3500

0 100 200 300 400 500 600

time (days)

Computationally expensive

Page 5: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Adaptive Stochastic Optimisation for UQ

Model 1Model 2Model 3…………Model n

Sampling prior distribution

ReproductionRanking

Evaluation:

Model simulation Mismatchcalculation

iteration

Inference

New population

Ensemble of Models

Inferred Ensemble of Models for prediction

Sampling algorithms:• Genetic algorithms• Particle swarm optimisation• Ant Colony optimisation• Neighbourhood approximation

Page 6: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Search for Matching Models Challenge• FW simulation of multiple models generated for different combinations of parameter values is computationally expensive • High-dimensional parameter space remains fairly empty and poorly described despite thousands of generated models Region of

computational efficiency 100-10,000 FW runs Number of points per axis

Num

ber

of p

aram

eter

s

Page 7: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

UQ Framework with fast ML approximation

Mathematical Model

(parameters, pde)

Computer Simulation

(discretisation, timestep)

Modelparameters

Observed Data

0

500

1000

1500

2000

2500

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

1200

1400

0 200 400 600 800 1000 1200 1400

time (days)

0

500

1000

1500

2000

2500

3000

3500

0 100 200 300 400 500 600

time (days)

Natural System

Simulated vs Data

0

500

1000

1500

2000

2500

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

1200

1400

0 200 400 600 800 1000 1200 1400

time (days)

0

500

1000

1500

2000

2500

3000

3500

0 100 200 300 400 500 600

time (days)

Forecast Uncertainty

0

500

1000

1500

2000

2500

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

0 200 400 600 800 1000 1200 1400

time (days)

0

200

400

600

800

1000

1200

1400

0 200 400 600 800 1000 1200 1400

time (days)

0

500

1000

1500

2000

2500

3000

3500

0 100 200 300 400 500 600

time (days)

Machine Learning

MISMATCH

Page 8: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Challenges in Geomodelling

• Improve representation of the reality with geologically realistic models based on identifiable parameters.

• More effective use of information from various sources by incorporating prior geological and expert knowledge with associate uncertainty

• Uncertainty propagation from data into the model without “freezing” assumptions and predefined model dependencies.

Page 9: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Aims

Uncertainty quantification with a geomodel which is able to improve geological realism by more effective use of prior information

• Model petrophysical properties in a fluvial reservoir using a robust machine learning approach – semi-supervised Support Vector Regression (SVR)

• Reproduce realistic geological structures and inherent uncertainty of the geomodel

• Integrate additional spatial data that are non-linearly correlated with reservoir properties.

Page 10: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Outline

• Geoscience modelling under uncertainty

• Machine learning based geomodels• Semi-supervised SVR reservoir model

– Case study– Robustness to noise– Predictions with uncertainty

• Conclusions

Page 11: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Support Vector Regression (SVR)

bxxKyxf i

L

iii

),()(1

bwxxf )(Kernel trick projects data into sufficiently high dimensional space:

• Linear regression in hyperspace• Complexity control with training errors:

L

ii

wCw

1

2

21min

SVR is formulated in terms of dot products of input data: (x ∙ x') → K (x , x')

where K(x,xi) is a symmetric and positively defined kernel function.

support vectors

Page 12: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Semi-supervised Learning Concept

• Supervised learning with a tutor– Learn from known input and output

(e.g. multi-layer perceptron neural network)

• Unsupervised learning without a tutor– Learn from known inputs only, no outputs are

available (e.g. Kohonen classification maps)

• Semi-supervised learning– Learn from a combination of data:

• Labelled with both known input and output• Unlabelled with only input available (manifold)

Page 13: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Kernel Methods on Geo-manifolds

• Data-driven models incorporate prior knowledge on the domain of the problem using graph models of natural manifolds

• Kernel function enforces continuity along the graph model – manifold – obtained from the prior information

Conventional regression estimate based on labelled data only (●)

Spiral manifold represented by unlabelled points (+)

Semi-supervised regression estimation follows the smoothness along the graph

Page 14: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Semi-supervised Approach

• Manifold assumption: data actually lie on the low-

dimensional manifold in the input space

• Geometry of the manifold can be estimated with

unlabelled data:

– incorporate natural similarities in data

– enforce smoothness on the manifold

• Manifold carries physical information and

incorporates prior physical knowledge

• Geo-manifold can reflect stochastic nature of the

inherent model uncertainty

Page 15: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Sources of Geo-manifold fro Reservoir Models

Geo-manifold for reservoir model can be elicited from prior information:

– on-site spatial data (seismic, well logs)

– other relevant data (outcrops, modern analogues, lab

experiments)

– expert knowledge in a non-parametric form

– parametric geological models (object shapes, process models)

– training image based models

Page 16: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Semi-supervised SVR Geomodel

SVR Learning Machine

poro&perm labelled data

from wells

Seismic data

+ geo-manifold unlabelled data

Stanford VI synthetic case study Semi-supervised (SVR)

Prior information

Page 17: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Outline

• Geoscience modelling under uncertainty

• Machine learning based geomodels• Semi-supervised SVR reservoir model

– Case study– Robustness to noise– Predictions with uncertainty

• Conclusions

Page 18: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Case Study

Stanford VI: a realistic synthetic reservoir data set

S. Castro, J. Caers and T. Mukerji

• Fluvial clastic reservoir:- sinuous channels- meandering channels- delta front

• Geomodel:- multi-points statistics models- sedimentation process model

• “Hard” poro/perm data from wells

•Synthetic seismic data: - 6 attributes:

AI, EI, λ, μ, Sw, Poisson ratio

Page 19: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Variability in Facies ModellingMulti-point simulation realisations

Training Image Hard well data Soft probabilistic data based on seismic

Page 20: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Case Study2D layer slices from different geological section:

• sinuous channels• delta front

porosity truth case

SVR geomodel (tuneable or fixed parameters):• Spatial correlation size

– Gaussian kernel width σ

• Continuity strength– Impact of unlabelled data of the manifold

• Smoothness along the manifold– Number of unlabelled points in the manifold – Number of neighbours in kernel regression

• Prior belief level for seismic data– Weight of additional seismic input (scaling parameter)

• Trade-off between goodness of fit and complexity– Regularisation term C determines balance between training error and

margin max

– Classification error

Page 21: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Stochastic Sampling for Matched Models

Misfit minimisation:

• 640 models generated in 8D parameter space• 40 good fitting models with misfit < 250

Generated models home in the regions of good fit:

170

180

200

220

250

300

500

1000

2000

5000

Misfit

channel porosity

shale porosity

channel permeability

shale permeability

channel permeabilitychannel porosity

Page 22: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Fitted Model: Property Distribution

porosity truth case

Realistic reproduction of geological structures detected from the prior data:– fluvial channels– thin mud channel boundaries– point-bars

Page 23: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Fitted Model Forecast: Fluvial Channels case

Oil and water production from 7 largest producing wells:

● History data (truth case + noise)

○ Validation truth case forecast data

Matched model

Page 24: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Variability of Uncertain Model Properties• Correlation

- kernel size σ

• Smoothness along the manifold - number of unlabelled points N

• Impact of additional data (seismic) on the predicted variables

• Seismic interpretation uncertainty

channel sands shale

channel sands shale

σσ

NN

Amplitude threshold for channel/shale boundary

scaling porosity scaling for permeability

Page 25: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Non-uniqueness of Semi-supervised SVR

Truth caseRealisation 2Realisation 1

Stochastic realisations, based on geo-manifolds generated with

different random seeds, represent inherent non-uniqueness of

the model with the given combination of the parameter values

Page 26: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Impact of Noise in Seismic DataOriginal seismic data with injected noise N(0,σ) ● unlabelled data

Semi-SVM porosityTruth case porosity

Semi-SVM porosity for N(0,2σ) added noise

Page 27: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Realisations of a single fitted model with unique set of parameters

Oil production profiles for 10 stochastic realisationsfor 6 wells:

● History data (truth case + noise)

○ Validation truth case forecast data

Oil production profiles for semi-SVR model realisations

Production: Stochastic Realisations

Page 28: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Multiple matching models vs Truth case porosity

Multiple good fitting models Truth case

The river delta front structure is very similar for different models due to the very clean synthetic seismic with no noise.

Page 29: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Fitted Model Forecast: Delta Front case

Oil and water production from 7 largest producing wells:

● History data (truth case + noise)

Fitted model

Truth case

Page 30: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Fitted Model Forecast: Delta Front case

Oil production from 7 largest producing wells:

● History data (truth case + noise)

Fitted model

Truth case

Page 31: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Confidence P10/P90 interval for production forecast based on multiple models:

Total oil and water production profiles:

● History data (truth case + noise)

○ Validation truth case forecast data

P10/P90 production forecast confidence bounds

Forecast with Uncertainty

Page 32: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Uncertainty of Model Parameters

Posterior probability distribution of the geomodel parameters:

• Kernel width – correlation – for poro & perm in sand or shale

• Continuity in sand and shale bodies – by N unlab

• Impact of seismic data to poro & perm – weight

Page 33: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Outline

• Geoscience modelling under uncertainty

• Machine learning based geomodels• Semi-supervised SVR reservoir model

– Case study– Robustness to noise– Predictions with uncertainty

• Conclusions

Page 34: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Conclusions• A novel learning based model of petroleum reservoir based

on capturing complex dependencies from data.• Semi-supervised SVR geomodel takes into account natural similarities

in space and data relations:

– Reproduction of geological structures and anisotropy of a fluvial systems in a realistic way based on prior information on geo-manifold represented by unlabelled data

– Robustness to noise and flexible control of signal/noise levels in data to detect geologically interpretable information

– Stochastic non-uniqueness inherent to the model is represented by the distribution of unlabelled data

• Multiple fitted models match both production history and the validation data in the forecast

• Uncertainty of the SVR model is quantified by inference of the multiple generated models, which provide uncertainty forecast envelope based on posterior probability

Page 35: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Further work

• Extension to 3D case by adding one more input to the SVR model

• Integrate other relevant data from outcrops and lab experiments

• Apply SVR modelling approach with Bayesian UQ framework to application in different fields: environmental and climate modelling, epidemiolgy, etc.

• 2 PhD positions in the Uncertainty Quantification project:

– Geologist, data integration

– Uncertainty modelling with machine learning

Apply to [email protected]

Page 36: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Acknowledgments

• J. Caers and S. Castro of Stanford University for providing Stanford VI case study

• UK EPSRC grant (GR/T24838/01)

• Swiss National Science Foundation for funding “GeoKernels: kernel-based methods for geo- and environmental sciences”

• Sponsors of Heriot-Watt Uncertainty Quantification project:

Page 37: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Research Summary

• Developed a novel model for petroleum reservoir based on capturing complex dependencies from data with learning methods.

• Novel model provide multiple HM model for different fluvial reservoirs: sinuous channels, delta front– both production history and the validation data in the forecast are

matched

• Benefits of the novel data driven geomodelling approach:– Reproduce realistic geological structure and anisotropy of property

distribution.– Robust to noise in prior data– Relate to identifiable properties: continuity, correlation, prior belief in

data, etc.

• Model uncertainty is described by the inference of multiple models– Posterior confidence interval describe uncertainty forecast – Uncertainty of the model parameters is quantified by posterior

probability distributions

Page 38: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Multiple good fitting models

Seismic dataLabelled (●) & unlabelled (+) data

Learning Machine

(SVR)

Prior information

Page 39: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Next Steps

• Production uncertainty forecasting based on the inference of the generated HM models.

• Extension to 3D case by adding one more input to the SVR model

• Integrate other relevant data from outcrops and lab experiments

Page 40: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Aims

• Explore robustness of semi-supervised SVR geomodel to noisy data

• Develop a way to reproduce inherent uncertainty of the semi-supervised SVR geomodel by stochastic realisations

• Integrate semi-supervised SVR geomodel into the Bayesian uncertainty quantification framework

Uncertainty quantification with a geomodel which is able to improve geological realism by more effective use of prior information

Page 41: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Content

• Motivation and Aims• Semi-supervised learning concept

– Support Vector Machine (SVM) recap

• Machine learning based geomodel– Noise pollution experiment– Inherent non-uniqueness of SVR-based model– SVR geomodel in Bayesian sampling

framework

• Conclusions

Page 42: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Impact of Noise in Seismic Data

In a real case additional data (seismic) are usually noisier than in our synthetic case

Channel geo-manifold defined by unlabelled points

Filtering low frequency component from seismic

Elastic impedance

Seismic is processed through a low pass filter to build a manifold of unlabelled points:

Page 43: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Seismic Data Polluted with NoiseGaussian noise with zero mean and 3 different std.dev σ is added.

Truth case

N(0, σ) N(0, 2σ) N(0, 3σ)

Page 44: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Filtering

Truth case

Only a low frequency component is left after filtering

N(0, σ) N(0, 2σ) N(0, 3σ)

Page 45: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Geo-manifoldUnlabelled points are generated only in the cells below the threshold

Truth case

N(0, σ) N(0, 2σ) N(0, 3σ)

Page 46: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Porosity SVR Estimates for Noisy DataNoise level: 1 σ Noise level: 2 σ Noise level: 3 σ

Geo-manifold becomes less concentrative and the channel “erodes” with increase of the noise level

Truth case

Page 47: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Prediction with a Large Noise LevelNoise level: 3σ

Even with large noise levels the channel continuity can be traced in SVR prediction although it is barely visible in the input data

Truth case

Page 48: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Impact of Inherent Non-uniqueness

Stochastic realisations of water production from 6 largest producing wells

Page 49: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

NA Sampling: Misfit Distribution

Misfit of models generated by NA

Lowest misfit = 188

Page 50: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

NA Sampling: Parameter Distributions

Histogram of parameter values for the generated models

Models generated by NA home in the regions of good fit

Page 51: 10,00 Modelling and analysis of geophysical data using geostatistics and machine learning Vasily Demyanov – Heriot–Watt Institute, Edinburgh (U.K.)

Support Vector Machine (SVM)

0 < i < C Support Vectors (SV)

i = 0 Normal Samples

i = C Support Vectors untypical or noisy

Trade-off between: margin maximisation & training error minimisation

Linear separation problem

Increase space dimension to solve separation problem linearly

L

ii

wCw

1

2

21min Soft margin:

i 0 slack variables to allow noisy samples & outliers to lie inside or on the outer side of the margin

wx 1+b=

1

wx 2+b=-1