extracting opinion targets in a single- and cross-domain ... · 1 extracting opinion targets in a...

37
1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and Iryna Gurevych, 2010 Matteo Bonaker, 2017 Selected Topics in Sentiment Analysis Dr. Michael Wiegand Folien / Präsentation Veranstaltung Dozent

Upload: nguyennhi

Post on 10-Aug-2019

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

1

Extracting Opinion Targets in aSingle- and Cross-Domain SettingWith Conditional Random Fields

by

Niklas Jakob and Iryna Gurevych, 2010

Matteo Bonaker, 2017Selected Topics in Sentiment AnalysisDr. Michael Wiegand

Folien / PräsentationVeranstaltung

Dozent

Page 2: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 3: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

3

Kontext

‣ 2010 veröffentlicht

‣ Damals existierende Forschung:

‒ Sentiment-Analysis auf Cross-Domain

‒ CRF für Sentiment-Analysis

‒ Opinion-Target-Extraction auf Single-Domain

‣ Neue Kombination:

‒ Opinion-Target-Extraction auf Cross-Domain (mit CRF)

Page 4: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 5: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 6: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

6

CRF

CR

‣ CRF ist ein Machine-Learning-Modell

‣ Ähnlich wie HMMs

‣ Eingabe-Format unterschiedlich:

Bernie

would

have

won

HMM

NN

MD

VB

VBN

Theorie Anwendung Cross-Dom.

Page 7: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

7

CRF

CR

‣ CRF ist ein Machine-Learning-Modell

‣ Ähnlich wie HMMs

‣ Eingabe-Format unterschiedlich:

Bernie

would

have

won

CRF

B

O

O

O

NN

MD

VB

VBN

Theorie Anwendung Cross-Dom.

Page 8: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

8

Ausgabe

CR

‣ Ausgabe des CRF: Block-Markierung

‣ „Auflösung“ des Opinion-Target wird nicht gemacht

Bernie

would

have

won

CRF

B

O

O

O

...

...

...

...

B

O

O

O

Theorie Anwendung Cross-Dom.

Page 9: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

9

Ausgabe (IOB)

‣ Beginning …

‣ Inside … … of Opinion Target

‣Outside …

‣Markierung angrenzender Blöcke möglich

Einfaches Beispiel: I like trains. O O B O

Beispiel:

Even though I like cottage cheese veganism is more important to me. O O O O B I B O O O O O O

Page 10: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

10

Eingabe

‣ Token

‣ POS

‣ Short Dependency Path

‣ Word Distance

‣ Opinion Sentence

CRF

IOB

IOB

IOB

IOB

IOB

Token

Token

Token

Token

Token

POS

POS

POS

POS

POS

SDP

SDP

SDP

SDP

SDP

WD

WD

WD

WD

WD

OS

OS

OS

OS

OS

Theorie Anwendung Cross-Dom.

Page 11: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

11

Eingabe

‣ Token

‒ Wort / Satzzeichen

‒ Lernbar: „crew“ häufiger Target, als „admire“admire

the

camera

crew

of

Gladiator

I

admire

the

camera

crew

of

Gladiator

I

..

Theorie Anwendung Cross-Dom.

Page 12: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

12

Eingabe

‣ Token

‣ POS

‒ Lernbar: NN häufiger Target, als VBRVBR

DT

NN

NN

IN

NNP

PRP

admire

the

camera

crew

of

Gladiator

I

..

Theorie Anwendung Cross-Dom.

Page 13: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 14: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

14

Eingabe

‣ Token

‣ POS

‣ Short Dependency Path

‣ Word Distance

‒ Zu jeder Opinion-Expression:NP mit kürzester Distanz

‒ Lernbar: Target häufig in nahegelegener NP

N

Y

Y

Y

N

N

Y

admire

the

camera

crew

of

Gladiator

I

N.

Theorie Anwendung Cross-Dom.

Page 15: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

15

Eingabe

‣ Token

‣ POS

‣ Short Dependency Path

‣ Word Distance

‣ Opinion Sentence

‒ Wenn eine Opinion-Expression da ist,alle Token des Satzes markieren!

‒ Lernbar: Sätze ohne Opinion ignorieren

Y

Y

Y

Y

Y

Y

Y

admire

the

camera

crew

of

Gladiator

I

Y.

Theorie Anwendung Cross-Dom.

Page 16: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

16

Eingabe

‣ Token

‣ POS

‣ Short Dependency Path

‣Word Distance

‣Opinion Sentence

CRF

IOB

IOB

IOB

IOB

IOB

Token

Token

Token

Token

Token

POS

POS

POS

POS

POS

SDP

SDP

SDP

SDP

SDP

WD

WD

WD

WD

WD

OS

OS

OS

OS

OS

Theorie Anwendung Cross-Dom.

Page 17: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 18: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 19: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 20: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 21: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 22: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 23: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 24: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 25: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 26: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 27: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

27

Cross-Domain

‣ Warum sollte Cross-Domain schwieriger sein?

‒ Domain-Spezifisches „Wissen“ des CRF!

‣ Hier nochmal die Features:

‒ Token

‒ POS

‒ Short Dependency Path

‒ Word Distance

‒ Opinion Sentence

Theorie Anwendung Cross-Dom.

Page 28: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 29: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 30: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

30

Vorgehen

‣ Token-Feature für Cross-Domain entfernt

‣ Training auf 3 oder weniger Domains

‣ Test auf einer der übrigen Domains

Theorie Anwendung Cross-Dom.

Page 31: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 32: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 33: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 34: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

34

Schlussfolgerung

‣ Single-Domain‒ Höchster F-Score

Meiste Trainings-Daten

Alle Features

‣ Cross-Domain‒ Höchster F-Score

Training auf Kamera-Domain

Test auf Movie-Domain

Ohne Token-Feature

Theorie Anwendung Cross-Dom.

Page 35: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and

35

Danke für eure Aufmerksamkeit!

Page 36: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and
Page 37: Extracting Opinion Targets in a Single- and Cross-Domain ... · 1 Extracting Opinion Targets in a Single- and Cross-Domain Setting With Conditional Random Fields by Niklas Jakob and