comp 3503 / 5013 dynamic neural networks

Comp 3503 / 5013Dynamic Neural Networks

Daniel L. SilverMarch, 2014

Outline

• Hopfield Networks• Boltzman Machines• Mean Field Theory• Restricted Boltzman Machines (RBM)

Dynamic Neural Networks

• See handout for image of spider, beer and dog• The search for a model or hypothesis can be

considered the relaxation of a dynamic system into a state of equilibrium

• This is the nature of most physical systems– Pool of water– Air in a room

• Mathematics is that of thermal-dynamics– Quote from John Von Neumann

Hopfield Networks

• See hand out

Hopfield Networks

• Hopfield Network video intro– http://www.youtube.com/watch?v=

gfPUWwBkXZY– http://faculty.etsu.edu/knisleyj/neural/

• Try these Applets:– http://lcn.epfl.ch/tutorial/english/hopfield/html/i

ndex.html– http://www.cbu.edu/~pong/ai/hopfield/

hopfieldapplet.html

Hopfield Networks

Basics with Geoff Hinton:• Introduction to Hopfield Nets– http://www.youtube.com/watch?v=YB3-Hn-inHI

• Storage capacity of Hopfield Nets– http://www.youtube.com/watch?v=O1rPQlKQBLQ

Hopfield Networks

Advanced concepts with Geoff Hinton:• Hopfield nets with hidden units– http://www.youtube.com/watch?v=bOpddsa4BPI

• Necker Cube – http://www.cs.cf.ac.uk/Dave/JAVA/boltzman/

Necker.html• Adding noise to improve search– http://www.youtube.com/watch?v=kVgT2Eaa6KA

Boltzman Machine

- See Handout - http://www.scholarpedia.org/article/Boltzmann_machine

Basics with Geoff Hinton• Modeling binary data– http://www.youtube.com/watch?v=MKdvJst8a6k

• BM Learning Algorithm – http://www.youtube.com/watch?v=QgrFsnHFeig

Limitations of BMs

• BM Learning does not scale well• This is due to several factors, the most important

being:– The time the machine must be run in order to collect

equilibrium statistics grows exponentially with the machine's size = number of nodes• For each example – sample nodes, sample states

– Connection strengths are more plastic when the units have activation probabilities intermediate between zero and one. Noise causes the weights to follow a random walk until the activities saturate (variance trap).

Potential Solutions

• Use a momentum term as in BP:

• Add a penalty term to create sparse coding (encourage shorter encodings for different inputs)

• Use implementation tricks to do more in memory – batches of examples

• Restrict number of iterations in + and – phases• Restrict connectivity of network

wij(t+1)=wij(t) +ηΔwij+αΔwij(t-1)

Restricted Boltzman Machine

Source: http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

SF/Fantasy Oscar Winner

Σj=wijvi

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Recall = Relaxation

Σi=wijhj

vo or ho

Σj=wijvi

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Recall = Relaxation

Σi=wijhj

vo or ho

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Σi=wijhj

vo or ho

Oscar Winner SF/FantasyRecall = Relaxation

Σj=wijvi

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Σi=wijhj

vo or ho

Oscar Winner SF/FantasyRecall = Relaxation

Σj=wijvi

i Σi=wijhj

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Learning = ~ Gradient Descent = Constrastive Divergence

Update hidden units

P=P+vihj vo or ho

Σj=wijvi

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Reconstruct visible units

vo or ho

Σj=wijvi

Σi=wijhj

Σj=wijvi

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Reupdate hidden units

vo or ho

Σi=wijhj

N=N+vihj

Δwij=<P>-<N>

Σj=wijvi

hj pj=1/(1-e-Σj)

vi pi=1/(1-e-Σi)

Σi=wijhj

vo or ho

wij=wij +ηΔwij

Update weights

• RBM Overview:– http

://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

• Wikipedia on DLA and RBM:– http://en.wikipedia.org/wiki/Deep_learning

• RBM Details and Code:– http://www.deeplearning.net/tutorial/rbm.html

Geoff Hinton on RBMs:• RBMs and Constrastive Divergence Algorithm– http://www.youtube.com/watch?v=fJjkHAuW0Yk

• An example of RBM Learning– http://www.youtube.com/watch?v=Ivj7jymShN0

• RBMs applied to Collaborative Filtering– http://www.youtube.com/watch?v=laVC6WFIXjg

Additional References

• Coursera course – Neural Networks fro Machine Learning:– https://class.coursera.org/neuralnets-2012-001/

lecture• ML: Hottest Tech Trend in next 3-5 Years– http://www.youtube.com/watch?v=b4zr9Zx5WiE

comp 3503 / 5013 dynamic neural networks

ejvi pi

cube http

learning algorithm http

hopfield networksbasics

sffantasyoscar winnerjihj

capacity of hopfield

wijvihj pj

sffantasyoscar winnerwijjij

Documents

series 3501 & 3503

comp 5013 deep learning architectures

3503 srv 1000139756 es 011

bld ufh 5013 081999 17 handbok

dspic30f5011/5013 data sheet - microchip...

hbef 3503 700420016001 mohd zaidi aktiviti 1

tétékás nyúz 5013

Молодь Черкащини №23 2010 (5013)

abstract #3503

3503: 61 81 (2012) zootaxa ... · 3503: 61 81 (2012) ...

n 3503 assemblÉe nationale

edición 3503

5013 e02 p137-168

5013 blitz

opening benisrael ppt 3503 - fig

5013.cyrillic font spec

mtkm 5013 global social responsibility

kpf 5013 (dpli) nota 1

t1a2 hbef 3503

€¦ · 2 draft amendment 3503 === budg/3503=== budget...