anns (artificial neural networks). the perceptron

ANNs(Artificial Neural Networks)

THE PERCEPTRON

Perceptron

• X is a (column) vector of inputs.• W is a (column) vector of weights.• or w0 is the bias or threshold weight.

Perceptron

Usage:1.training/learning– local vs. global minima– supervised vs. unsupervised

2.feedforward (or testing or usage or application)– Indicate class i if output g(X)>0; not class i otherwise.

Perceptron

wXWXgd

Perceptron

The bias or threshold can be represented as simply another weight (w0) with a constant input of 1 (x0=1).

1 where

Perceptron

• This is the dot product of two vectors, W and X.

1 where

iixwXWXW cos

Activation functions

1. linear

2. threshold

3. sigmoid

4. tanh

Xgkout

otherwise 0)(or 1 ;0 if 1

Relationship between sigmoid and tanh

2)tanh(

tanh12

Perceptron training/learning

1. Initialize weights (including thresholds, w0) to random numbers in [-0.5,…,+0.5] (uniformly distributes).

2. Select training input vector, X, and desired output, d.

3. Calculate actual output, y.4. Learn. (Only perform this step when output is

incorrect.)where is in [0..1] and is the gain or learning rate.

icurrenti

newi xydww

• “The proof of the perceptron learning theorem (Rosenblatt 1962) demonstrated that a perceptron could learn anything it could represent.” [3]

• So what can it represent?– Any problem that is linearly separable.– Are all problems linear separable?

• Consider a binary function of n binary inputs.– “A neuron with n binary inputs can have 2n different

input patterns, consisting of ones and zeros. Because each input pattern can produce two different binary outputs, 1 and 0, there are 22n different functions of n variables.” [3]

– How many of these are separable? Not many! For n=6, 226 = 1.8x1019 but only 5,028,134 are linearly separable.

– AND and OR are linearly separable but XOR is not!

How about adding additional layers?

• “Multilayer networks provide no increase in computational power over a single-layer network unless there is a nonlinear function between layers.” [3]

– That’s because matrix multiplication is associative.• (XW1)W2 = X(W1W2)

• “It is natural to ask if every decision can be implemented by such a three-layer network … The answer, due ultimately to Kolmogorov …, is “yes” – any continuous function from input to output can be implemented in a three-layer net, given sufficient number of hidden units, proper nonlinearities, and weights.” [1]

MULTILAYER NETWORKS

Note threshold nodesfrom “Usefulness of artificial neural networks to predict follow-up dietary

protein intake in hemodialysis patients”http://www.nature.com/ki/journal/v66/n1/fig_tab/4494599f1.html

Backpropagation(learning in a multilayer network)

Sigmoid/logistic function(and its derivative)

compute) to(trivial 1

1)subtract & (add

rule) l(reciproca

xsxsxs

References

1. R.O. Duda, P.E. Hart, D.G. Stork, “Pattern Classification,” John Wiley and Sons, 2001.

2. S.K. Rogers, M. Kabrisky, “An introduction to biological and artificial neural networks for pattern recognition,” SPIE Optical Engineering Press, 1991.

3. P.D. Wasserman, “Neural computing,” Van Nostrand Reinhold, 1989.

anns (artificial neural networks). the perceptron

Documents

a perceptron based neural network data analytics

multilayer perceptron = feedforward neural network€¦ ·...

rainfall-runoff modeling using artificial neural...

intech-the use of artificial neural networks anns in aquatic...

neural networks part i the multilayer perceptron

artificial neural networks -...

4intro to anns perceptron

58962089 studi kasus mengenai aplikasi multi layer...

30 years of adaptive neural networks: perceptron, madaline...

advantages of artiﬁcial neural networks (anns) as...

viacheslav p. shkodyrev- perceptron – simplest neural...

introduction to neural networks - perceptron

multilayer perceptron and neural networks - wseas ·...

artificial neural networks (anns) for urban flood...

adaptation of multilayer perceptron neural …...adaptation...

neural networks and introduction to bishop (1995) : neural...

artificial neural networks for the perceptron, madaline...

cs 6501: deep learning for computer graphics …...overview...

exemplary application of artifficial neural networks (anns...

artificial neural networks perceptron