deep learning intro. - kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘–...

31
Deep Learning intro. 2016.01.02.

Upload: others

Post on 17-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ

Deep Learning intro.

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ

2016.01.02.

Page 2: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 2

Outline

Natural Language Processing (NLP)

Representation and Processing

Deep Learning Models

Page 3: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ

Natural Language Processing

Page 4: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 4

Natural Language Processing (NLP)

โ€ข ๋‹ต๋ณ€

โ€ข ๊ฒ€์ƒ‰

โ€ข ์ถ”๋ก 

โ€ข ๋Œ€ํ™”

์–ธ์–ด์ดํ•ด ์–ธ์–ด์ƒ์„ฑ์‘์šฉโ€ข ์ง€๋Šฅํ˜•๋กœ๋ด‡

โ€ข ์ •๋ณด๊ฒ€์ƒ‰

โ€ข ๊ธฐ๊ณ„๋ฒˆ์—ญ

โ€ข ๋ฌธ์„œ์š”์•ฝ

โ€ข ์งˆ๋ฌธ

โ€ข ๋‹จ์–ด์ดํ•ด

โ€ข ์˜๋ฏธ์ดํ•ด

โ€ข ์˜๋„ํŒŒ์•…

Page 5: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ

Representation and Processing

Page 6: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 6

Representation in mathematics

<0.156, 0.421, 0.954, โ€ฆ>

<0.096, 0.510, 0.991, โ€ฆ>

<0.496, 0.951, 0.321, โ€ฆ>

<0.196, 0.851, 0.119, โ€ฆ>

<โ€ฆ, 0.486, 0.854, โ€ฆ>

<โ€ฆ, 0.751, 0.912, โ€ฆ>

<โ€ฆ, 0.123, 2.554, 5.124, โ€ฆ>

<โ€ฆ, 7.451, 21.45, 8.999>

<โ€ฆ, 1.109, 11.854, 0.456>

Real World Vector Space

https://www.google.com/imghp?hl=ko

Page 7: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 7

์˜ค๋ฆฌ vs. ํ† ๋ผ

Page 8: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 8

์œ„์žฅ

Page 9: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 9

Neural Network for Human

https://uncyclopedia.kr/wiki/%EB%87%8C

Neural Network

Pattern recognition

Multi layer

Human: 10 layers

I see lion

Page 10: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 10

Neural Network

Vector representation

Pattern of layers

+ Learning

Page 11: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 11

Pattern of layers

Deep learning automatic pattern combination

Why we say deep ?

โ€ฆ โ€ฆ โ€ฆ โ€ฆ โ€ฆ โ€ฆ

โ€ฆ

Unit

layer

n

m

Connection link: (n x n) x (m-1)

Automatic combination

Page 12: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 12

How to use layers?

Input vector

Output real number or class (vector)

Vector representation โ€œOne-hotโ€

Page 13: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 13

Vector representation

[Symbol]

Lion[Text representation] [One-hot representation]

<0, 0, 0, 0, 0, 1, 0, 0, 0, 0, โ€ฆ>

[Symbol representation]

<1.45, 75.12, 0.425, 0.953, โ€ฆ>

Page 14: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 14

Jung, DEEP LEARNING FOR KOREAN NLP

Page 15: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 15

How to define symbol to one-hot

Lion

Big cat

[Symbolic words]

<0, 0, 1, 0, 0>

<0, 1, 0, 0, 1>

[One-hot]

If it uses AND op., two words is non-match

โˆด we need symbolic vector representation

Page 16: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 16

How to define symbol to one-hot

Lion

Big cat

TigerDog

Wolf

Mouse

โˆด [Symbolic representation]

<0, 0, 1, 0, 0>

<0, 1, 0, 0, 1>

<1.45, 75.12, 0.425, 0.953, โ€ฆ>

<1.78, 61.11, 0.611, 2.011, โ€ฆ>

Use cosine similarity

[Symbolic vectors] (from NNLM)

Page 17: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 17

Neural Network Language Model

Feed-forward NN

parametric Estimator

overall parameter set ๐œƒ = (๐ถ,๐‘ค)

one-hot representationโ€ข [0 1 0 0 0 0 0 0 0 0]

Lookup Tableโ€ข word embedding

Non-linear projectionโ€ข activation function

Normalize weightโ€ข softmax (length: ๐‘›)

Page 18: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 18

Neural Network Language Model

max๐œƒ โ†’ ๐‘™๐‘œ๐‘‘ ๐‘™๐‘–๐‘˜๐‘’๐‘™๐‘–โ„Ž๐‘œ๐‘œ๐‘‘

๐ฟ = max๐œƒ

1

๐‘‡ ๐‘ก ๐‘™๐‘œ๐‘”๐‘“(๐‘ค๐‘ก, ๐‘ค๐‘กโˆ’1, โ€ฆ , ๐‘ค๐‘กโˆ’๐‘›+1)

parametersโ€ข โ„Ž: ๐‘กโ„Ž๐‘’ ๐‘›๐‘ข๐‘š๐‘๐‘’๐‘Ÿ ๐‘œ๐‘“ โ„Ž๐‘–๐‘‘๐‘‘๐‘’๐‘› ๐‘ข๐‘›๐‘–๐‘ก๐‘ 

โ€ข ๐‘š: ๐‘กโ„Ž๐‘’ ๐‘›๐‘ข๐‘š๐‘๐‘’๐‘Ÿ ๐‘œ๐‘“ ๐‘“๐‘’๐‘Ž๐‘ก๐‘ข๐‘Ÿ๐‘’๐‘  ๐‘ค๐‘–๐‘กโ„Ž ๐‘’๐‘Ž๐‘โ„Ž ๐‘ค๐‘œ๐‘Ÿ๐‘‘

โ€ข ๐‘: ๐‘กโ„Ž๐‘’ ๐‘œ๐‘ข๐‘ก๐‘๐‘ข๐‘ก ๐‘๐‘–๐‘Ž๐‘ ๐‘’๐‘ 

โ€ข ๐‘‘: ๐‘กโ„Ž๐‘’ โ„Ž๐‘–๐‘‘๐‘‘๐‘’๐‘› ๐‘™๐‘Ž๐‘ฆ๐‘’๐‘Ÿ ๐‘๐‘–๐‘Ž๐‘ ๐‘’๐‘ 

โ€ข ๐‘ˆ: โ„Ž โˆ’ ๐‘ก๐‘œ โˆ’ ๐‘œ ๐‘ค๐‘’๐‘–๐‘”โ„Ž๐‘ก๐‘ 

โ€ข ๐‘Š: ๐ผ โˆ’ ๐‘ก๐‘œ โˆ’ ๐‘œ ๐‘ค๐‘’๐‘–๐‘”โ„Ž๐‘ก๐‘ 

โ€ข ๐ป: ๐ผ โˆ’ ๐‘ก๐‘œ โˆ’ ๐ป ๐‘ค๐‘’๐‘–๐‘”โ„Ž๐‘ก๐‘ 

โ€ข ๐ถ:๐‘ค๐‘œ๐‘Ÿ๐‘‘ ๐‘“๐‘’๐‘Ž๐‘ก๐‘ข๐‘Ÿ๐‘’๐‘  (๐‘™๐‘œ๐‘œ๐‘˜๐‘ข๐‘ ๐‘ก๐‘Ž๐‘๐‘™๐‘’)

โ€ข ๐œƒ = (๐‘, ๐‘‘,๐‘Š, ๐‘ˆ,๐ป, ๐ถ)

Page 19: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 19

NNLM for Korean

Leeck, ๋”ฅ๋Ÿฌ๋‹์„์ด์šฉํ•œํ•œ๊ตญ์–ด์˜์กด๊ตฌ๋ฌธ๋ถ„์„

Page 20: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ

Deep Learning Models

Page 21: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 21

Deep learning Models

โ€œ๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜๊ฐ€์–ด๋””์•ผ?โ€โ€ข ๊ฐ•๋Œ€/NNG ์ฃผ๋ณ€/NNG ์—/JX ์Šคํƒ€๋ฒ…์Šค/NNG โ€ฆ

Feed-forward Neural Network (FFNN)

๐‘Š๐‘ก

Y

๊ฐ•๋Œ€

NNG

์ฃผ๋ณ€

NNG

์—

JX

FFNN:

1-FFNN 2-FFNN 3-FFNN

Page 22: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 22

Deep learning Models

โ€œ๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜๊ฐ€์–ด๋””์•ผ?โ€โ€ข ๐‘Œ๐‘ก๐‘’๐‘ฅ๐‘ก [๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜], [์–ด๋””]

โ€ข ๐‘Œ๐‘ก๐‘Ž๐‘”๐‘  [ B I I I I ], [ B ]

Recurrent Neural Network (RNN)

๐‘Š๐‘ก

Y

unfold ๊ฐ•๋Œ€

B

์ฃผ๋ณ€

I

์—

I

์Šคํƒ€๋ฒ…์Šค

I

์œ„์น˜

I

RNN

Page 23: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 23

Deep learning Models

โ€œ๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜๊ฐ€์–ด๋””์•ผ?โ€โ€ข ๐‘Œ๐‘ก๐‘’๐‘ฅ๐‘ก [๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜], [์–ด๋””]

โ€ข ๐‘Œ๐‘ก๐‘Ž๐‘”๐‘  [ B I I I I ], [ B ]

Long Short-Term Memory RNN (LSTM-RNN)โ€ข Using gate matrix (LSTM or GRU)

๐‘Š๐‘ก

Y

unfold ๊ฐ•๋Œ€

B

์ฃผ๋ณ€

I

์—

I

์Šคํƒ€๋ฒ…์Šค

I

์œ„์น˜

I

LSTM-RNN

Page 24: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 24

Deep learning Models

โ€œ๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜๊ฐ€์–ด๋””์•ผ?โ€โ€ข ๐‘Œ๐‘ก๐‘’๐‘ฅ๐‘ก [๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜], [์–ด๋””]

โ€ข ๐‘Œ๐‘ก๐‘Ž๐‘”๐‘  [ B I I I I ], [ B ]

LSTM-RNN CRF โ€ข Using gate matrix (LSTM or GRU)

๐‘Š๐‘ก

Y

unfold ๊ฐ•๋Œ€

B

์ฃผ๋ณ€

I

์—

I

์Šคํƒ€๋ฒ…์Šค

I

์œ„์น˜

I

LSTM-RNN

Viterbi or Beam search

Page 25: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 25

Deep learning Models

โ€œ๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜๊ฐ€์–ด๋””์•ผ?โ€โ€ข ๐‘Œ๐‘ก๐‘’๐‘ฅ๐‘ก [๊ฐ•๋Œ€์ฃผ๋ณ€์—์Šคํƒ€๋ฒ…์Šค์œ„์น˜], [์–ด๋””]

โ€ข ๐‘Œ๐‘ก๐‘Ž๐‘”๐‘  [ B I I I I ], [ B ]

Bidirectional LSTM-RNN CRF (Bi-LSTM-RNN CRF)โ€ข Using gate matrix (LSTM or GRU)

Viterbi or Beam search

๊ฐ•๋Œ€

B

์ฃผ๋ณ€

I

์—

I

์Šคํƒ€๋ฒ…์Šค

I

์œ„์น˜

I

forward

backward

Page 26: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 26

Deep learning Models

Sequence-to-sequence model

Two different LSTM: Input/output sentence LSTM

Using the Shallow LSTM

Reverse input sentence

Training: Decoding & Rescoring

Page 27: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 27

Deep learning Models

Encoder-Decoder Architecture

Page 28: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 28

Pointer Networks

โ€ข Seq2seq์™€ attention mechanism ์„๊ธฐ๋ฐ˜์œผ๋กœํ•œ๋”ฅ๋Ÿฌ๋‹๋ชจ๋ธ

โ€ข ์ž…๋ ฅ์—ด์˜์œ„์น˜(์ธ๋ฑ์Šค)๋ฅผ์ถœ๋ ฅ์—ด๋กœํ•˜๋Š”๋ชจ๋ธ

โ€ข X = {A:0, B:1, C:2, D:3, <EOS>:4}

โ€ข Y = {3, 2, 0, 4}

A B C D <EOS> D C A <EOS>

Encoding Decoding

Deep learning Models

Page 29: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 29

Deep learning Models

Siamese Neural Network

Page 30: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 30

References

Jung, DEEP LEARNING FOR KOREAN NLP

Lee, ๋”ฅ๋Ÿฌ๋‹์„์ด์šฉํ•œํ•œ๊ตญ์–ด์˜์กด๊ตฌ๋ฌธ๋ถ„์„

Park, Point networks for Coreference Resolution

Park, Bi-LSTM-RNN CRF for Mention Detection

Page 31: Deep Learning intro. - Kangwoncs.kangwon.ac.kr/.../12_deeplearning_intro.pdfย ยท 2016-06-17ย ยท ๐‘– ๐œถ4 Natural Language Processing (NLP) โ€ข๋‹ต๋ณ€ โ€ข๊ฒ€์ƒ‰ โ€ข์ถ”๋ก  โ€ข๋Œ€ํ™”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ 31

QA

๊ฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

๋ฐ•์ฒœ์Œ, ์ตœ์ˆ˜๊ธธ, ๋ฐ•์ฐฌ๋ฏผ, ์ตœ์žฌํ˜, ํ™๋‹ค์†”

๐‘ ๐‘–๐‘”๐‘š๐‘Ž ๐œถ , ๊ฐ•์›๋Œ€ํ•™๊ต

Email: [email protected]