artificial neural networkย ยท 1 2 input layer output layer 11 12 1 1 ๐‘1 single layer perceptron...

Post on 28-Jun-2020

16 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

2016.02.04

Hyun Ho Jeon

2019-04-10 1

Artificial Neural Network

Hyun Ho Jeon

ISL Lab Seminar

2

Contents

Introduction Neural Network Back Propagation Implementation

3

Introduction

History

1943

M-P neuron

1957

Perceptron

1974

Back

Propagation

2006

Deep Neural

Network

4

Introduction

Neuron

20

16

6

8

92

60

7

2

1943MP

1957P

1974BP

2006DNN

5

Neural Network

Single Layer Perceptron

๐‘ฅ1

๐‘ฅ๐‘–

๐‘ฅ๐‘›

Input layer Output layer

๐‘ค1

๐‘ค๐‘–

๐‘ค๐‘›

๐‘ฆ1

๐‘  =

๐‘–=1

๐‘›

๐‘ฅ๐‘–๐‘ค๐‘– + ๐‘๐‘—

๐‘ฆ = ๐‘“(๐‘ )

๐‘  ๐‘“

1

๐‘1

๐‘“ ๐‘ก โˆถ

1943MP

1957P

1974BP

2006DNN

6

Neural Network

๐‘ฅ1

๐‘ฅ2

Input layer Output layer

๐‘ค11

๐‘ค12

๐‘ฆ1๐‘  ๐‘“

1

๐‘1

๐‘ฆ1 = ๐‘“ ๐‘ค11๐‘ฅ1 + ๐‘ค12๐‘ฅ2 + ๐‘1 = ๐‘ข1

๐‘ฅ2= โˆ’๐‘ค11

๐‘ค12๐‘ฅ1 โˆ’

๐‘1๐‘ค12

โ‡’ โˆ’๐‘ฅ1 + 1.5

Single Layer Perceptron

โ€ข Example - AND

1943MP

1957P

1974BP

2006DNN

7

Neural Network

๐‘ฅ1

๐‘ฅ2

Input layer Output layer

๐‘ค11

๐‘ค12

๐‘ฆ1๐‘  ๐‘“

1

๐‘1

Single Layer Perceptron

โ€ข Example โ€“ XOR Problem (Minsky, M.; S. Papert (1969). ใ€ŠAn Introduction to Computational Geometryใ€‹. MIT Press.)

1943MP

1957P

1974BP

2006DNN

8

Neural Network

Multi Layer Perceptron (MLP)

๐‘ฅ1

๐‘ฅ๐‘–

๐‘ฅ๐‘๐ผ

Input layer - ๐‘–Hidden layer - ๐‘—

๐‘ค111

1

๐‘๐‘—

Output layer - ๐‘˜

๐‘ค๐‘–๐‘—

1

๐‘ฆ12

๐‘ฆ๐‘˜

๐‘ฆ๐‘๐‘‚2

๐‘ฆ11

๐‘ค112

๐‘ฆ๐‘— ๐‘ค๐‘—๐‘˜

๐‘๐‘˜

1943MP

1957P

1974BP

2006DNN

9

Neural Network

Multi Layer Perceptron (MLP)

Hidden layer - ๐‘—

๐‘ฅ1

๐‘ฅ๐‘–

๐‘ฅ๐‘๐ผ

Input layer - ๐‘–๐‘ค111

1

๐‘๐‘—

Output layer - ๐‘˜

๐‘ค๐‘–๐‘—

1

๐‘ฆ12

๐‘ฆ๐‘˜

๐‘ฆ๐‘๐‘‚2

๐‘ฆ11

๐‘ค112

๐‘ฆ๐‘— ๐‘ค๐‘—๐‘˜

๐‘๐‘˜

1943MP

1957P

1974BP

2006DNN

10

Back Propagation

Back Propagation (Werbos : 1974, Parker : 1982)

๐‘ฅ1

๐‘ฅ๐‘–

๐‘ฅ๐‘๐ผ

Input layer - ๐‘–Hidden layer - ๐‘—

๐‘ค111

1

๐‘๐‘—

Output layer - ๐‘˜

๐‘ค๐‘–๐‘—

1

๐‘ฆ12

๐‘ฆ๐‘˜

๐‘ฆ๐‘๐‘‚2

๐‘ฆ11

๐‘ค112

๐‘ฆ๐‘— ๐‘ค๐‘—๐‘˜

๐‘๐‘˜

๐‘ฆ๐‘‘๐‘˜

๐‘’๐‘˜

1943MP

1957P

1974BP

2006DNN

11

Back Propagation

Update Process

๐‘ค ๐‘ก + 1 = ๐‘ค ๐‘ก + โˆ†๐‘ค ๐‘กMeasurement Process

๐‘’ ๐‘ก = ๐‘ฆ๐‘‘ ๐‘ก โˆ’ ๐‘ฆ(๐‘ก)

Back Propagation (Werbos : 1974, Parker : 1982)

โ€ข Concept of gradient descent algorithm

1943MP

1957P

1974BP

2006DNN

12

Back Propagation

Weight update

๐‘ค(๐‘ก + 1) = ๐‘ค(๐‘ก) โˆ’ โˆ†๐‘ค(๐‘ก)

Back Propagation (Werbos : 1974, Parker : 1982)

โ€ข Gradient descent algorithm

๐ธ๐‘Ÿ๐‘Ÿ๐‘œ๐‘Ÿ ๐ธ

๐ธ๐‘š๐‘–๐‘›

๐‘ค๐‘ค0๐‘ค1๐‘ค2๐‘คโˆ—

Objective function

E =1

2

๐‘˜=1

๐‘๐‘‚

๐‘’๐‘—2

The gradient

โˆ†๐‘ค(๐‘ก) = โˆ’๐œ‚๐œ•๐ธ

๐œ•๐‘ค

1943MP

1957P

1974BP

2006DNN

13

Back Propagation

Back Propagation (Werbos : 1974, Parker : 1982)

โ€ข Example

1943MP

1957P

1974BP

2006DNN

14

Back Propagation

Back Propagation (Werbos : 1974, Parker : 1982)

Update Process

๐‘ค ๐‘ก + 1 = ๐‘ค ๐‘ก + โˆ†๐‘ค ๐‘ก + ๐›ผโˆ†๐‘ค(๐‘ก โˆ’ 1)

Measurement Process

โˆ†๐‘ค๐‘—๐‘˜ ๐‘ก = ๐œ‚๐‘’๐‘˜๐‘“โ€ฒ ๐‘ ๐‘˜ ๐‘ฆ๐‘—

ฮ”๐‘๐‘˜ ๐‘ก = ๐œ‚๐‘’๐‘˜๐‘“โ€ฒ ๐‘ ๐‘˜

โˆ†๐‘ค๐‘–๐‘— ๐‘ก = ๐œ‚๐‘“โ€ฒ ๐‘ ๐‘— ๐‘ฅ๐‘–

๐‘˜=1

๐‘๐‘‚

๐‘’๐‘˜ ๐‘“โ€ฒ ๐‘ ๐‘˜ ๐‘ค๐‘—๐‘˜

โˆ†๐‘๐‘— ๐‘ก = ๐œ‚๐‘“โ€ฒ ๐‘ ๐‘—

๐‘˜=1

๐‘๐‘‚

๐‘’๐‘˜ ๐‘“โ€ฒ(๐‘ ๐‘˜)๐‘ค๐‘—๐‘˜

1943MP

1957P

1974BP

2006DNN

15

Implementation

XOR Classification

๐œ‚ = 0.1 ๐œ‚ = 0.5 ๐œ‚ = 0.9

1943MP

1957P

1974BP

2006DNN

16

Implementation

XOR Classification ๐œ‚ = 0.9

- Parameter - Output - Classification

1943MP

1957P

1974BP

2006DNN

17

Implementation

Deafsign Classification

- Model - ๐‘ฅ๐‘– & ๐‘ฆ๐‘‘

1943MP

1957P

1974BP

2006DNN

18

Implementation

Deafsign Classification

- Learning - result

1943MP

1957P

1974BP

2006DNN

19

Implementation

Deafsign Classification

- Learning - result

1943MP

1957P

1974BP

2006DNN

20

Deep Learning

Local minima problem

- Local minima - Unsupervised Learning => Pre-training

1943MP

1957P

1974BP

2006DNN

21

Depp Learning

Deep Neural Network

Leon A. Gatys, Alexander S. Ecker, Matthias Bethge. โ€ A Neural Algorithm of Artistic Styleโ€.

1943MP

1957P

1974BP

2006DNN

22

Q&A

top related