assessment of historical figures using babelnet - babelnet workshop... · assessment of historical...

14
Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing Institute, Natural Language Processing Laboratory, Warsaw

Upload: others

Post on 12-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Assessment of historical figures using BabelNet

Szymon Roziewski, Maciej Kowalski

National Information Processing Institute, Natural Language Processing Laboratory, Warsaw

Page 2: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Motivation

l Building a simple Sentiment Analysis System.

l Providing sentiment analysis of

famous people based on

BabelNet synsets. Actually, they could

be fictional.

l To have some fun :)

Page 3: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Motivation

Why assessment of historical figures ?l To get a fast sentiment feedback

about people playing a role in History.

l Resulting sentiment might be useful

for another purposes: search

engines, marketing companies etc.

Page 4: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Method

l We used BabelNet for the core fo the system

l BabelNet provides synset tree for a given entity

l Each tree leaf carries short gloss of its own definition

l We collect closest synsets to build a small corpus of text

surrounding given term

l We perform sentiment analysis based on sentiment dictionaries:

– lexicon dictionary

– SentiWordNet

Page 5: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

BabelNet

l Huge multilingual encyclopedic dictionary and

ontology

l Offers a large network of concepts and named

entities with multilingual lexical coverage

Page 6: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Sentiment Dictionaries

l Lexicon Dictionary – lexicon built at University of Illinois Chicago

It is divided into positive and negative words

l SentiWordNet (Princeton) – a list of

words described by two real numbers:

the positive and negative impact

Basic forms of words obtained by using

morfologik library.

Page 7: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

BabelNet query for Julius Caesar

Page 8: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

BabelNet query for Julius Caesar

Page 9: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

BabelNet query for Julius Caesar

Page 10: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Sentiment Analysis of Julius Caesar

Page 11: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Implementation

l The process has been automated by using Java along with

BabelNet API, for quering the semantic network.

l Parameters used for the synset selection:

– language = EN

– BabelPOS = NOUN

– BabelSenseSource = WIKIDATA

Page 12: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Sentiment Analysis Results

Historical Figure Positive impact Negative impact

Albert Einstein 0.523 0.477

Isaac Newton 0.7 0.3

Al Capone 0.304 0.696

Marylin Monroe 0.602 0.398

Billy the Kid 0.601 0.399

Julius Caesar 0.6 0.4

Cinderella 0.4776 0.5224

Page 13: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Future Work

l There were no gold-standard sets to compare with, we assume

such method can be developed further.

l Implementing disambiguating mechanism in order to select the

exact synset carrying glosses related to historical figure in

question.

Page 14: Assessment of historical figures using BabelNet - BabelNet Workshop... · Assessment of historical figures using BabelNet Szymon Roziewski, Maciej Kowalski National Information Processing

Ośrodek Przetwarzania InformacjiPaństwowy Instytut Badawczy

al. Niepodległości 188 b00-608 Warszawa

tel.: +48 22 570 14 00e-mail: [email protected]

Thank you!