creating a commonsense knowledge base about...

50
Creating a Commonsense Knowledge Base about Objects Valerio Basile (University of Turin) SAILab, Siena 12/2/2020

Upload: others

Post on 04-Jul-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Creating a Commonsense Knowledge Base about Objects

Valerio Basile (University of Turin)

SAILab, Siena12/2/2020

Page 2: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

whoamiValerio Basile

Assistant professor @ UnitoPreviously:● PhD @ RUG Groningen● Postdoc @ Inria

Computational Semantics, Semantic Web, Natural Language Generation, Information Extraction, Linguistic Annotation, Distributional Semantics, General Knowledge Bases, Gamification, Social Media, Sentiment Analysis, Legal Informatics, Argument Mining, Social Media, Hate Speech, ...

Page 3: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Robotics and Artificial Intelligence

Objects

Linguistics and Semantics

Machine Learning and Clustering

Today

Page 4: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

● I Motivation: The Semantics of Objects● II Objects, Knowledge and The Web● III Objects, Words and Vectors● IV Frames and Prototypical Knowledge● V Default Knowledge about Objects

Today

Page 5: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Part IMotivation:

The Semantics of Objects

Page 6: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

5-year CHIST-ERA funded project (2014-2018)

4 EU partners

Page 7: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Deploy robots in human-inhabited environments.

The robots autonomously collect real-world data.

We use information available on the Semantic Web to identify the semantics of objects.

Page 8: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge
Page 9: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge
Page 10: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge
Page 11: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Perception and Identification

Robot deployments in office environments

The robot visits fixed waypoints on the map, taking full 360° RGB-D scans

Page 12: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

● Object classification

● Room detection

● Frame detection

● Inference

● ...

Page 13: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Part IIObjects, Knowledge and The

Web

Page 14: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

ClassificationWhat is (not) an object?What type is an object?

What is a room?...

RelationsHow are objects related?

Where is an object?What can I do with an object?

...

Object Knowledge

Page 15: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

http://lod-cloud.net/

Linked Open Data

Page 16: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Web KBs

● DBpediahub for LOD

● KnowRobsmaller, manually crafted, robotic-oriented

● ConceptNetlarge, automatically built, not-LOD

Page 17: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

The SUN database

https://groups.csail.mit.edu/vision/SUN/

Page 18: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Web KBs

Page 19: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Keyword Linking MethodsVector-based Contextual disambiguation

● Run String Match on the keywords● Split the missed keywords into tokens● Run String Match on the tokens● Compute the semantic similarity of each

token-entity with all the previously recognized entities

● Select the highest scoring token-entity

e.g., basket_of_banana dbr:→ Basket

Page 20: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Results2,493 objects in DBpedia679 locations in DBpedia2,935 object-location relations

The SUN database

Classification

Relations

Page 21: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge
Page 22: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Part IIIObjects, Words and Vectors

Page 23: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Problem

Classification is good, but relations are sparse

Object Knowledge

Page 24: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Distributional Relational Hypothesis

isa(E1, A) isa(E2, B) S(E1, E1) R(A, B) ?∧ ∧ →

Page 25: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Semantic Relatedness

Washing_machine Ashtray

Bathroom 5 2

Bedroom 0 1

Living_room 1 6

Co-occurrence matrix Singular value decomposition

M=U ΣV *

U k ΣkV k*=M k

Low-rank approximation

NASARI: A Novel Approach to a Semantically-Aware Representation of Items(Camacho-Collados, Pilehvar and Navigli, 2015)

Page 26: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Semantic SimilarityBathroom

Ashtray

Washing_machine

α

β

sim(Bathroom, Washing_machine) = cos( ) 0.71α ≈

sim(Bathroom, Ashtray) = cos( ) 0.37β ≈

Page 27: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Place Classification = Cosine similarity on NASARI

+ aggregation, weighting by distance, ...

Page 28: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Evluation: locatedAt

Gold standard: SUN database linked to DBpedia

Page 29: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Evluation: usedFor

Gold standard: ConceptNet linked to DBpedia

Page 30: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

931 high confidence location relationsOnly 52 were in the gold standard setE.g.:Trivet Kitchen→Flight_bag Airport_lounge→Soap_dispenser Unisex_public_toilet→

+ many related datasets:https://project.inria.fr/aloof/data/

Results

Page 31: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Object-action relation (usedFor)Extracting common sense knowledge via triple ranking using supervised and unsupervised distributional modelsS Jebbara, V Basile, E Cabrio, P Cimiano, Semantic Web 2018

Improving object detectionSemantic web-mining and deep vision for lifelong object discoveryJ Young, L Kunze, V Basile, E Cabrio, N Hawes, B CaputoRobotics and Automation, ICRA 2017

Object-location relation (locatedAt)Populating a knowledge base with object-location relations using distributional semanticsV Basile, S Jebbara, E Cabrio, P Cimiano, EKAW 2016

Distributional Relational Hypothesis

Page 32: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Part IVFrames and Prototypical

Knowledge

Page 33: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Problem

The distributional relational hypothesis is limited to specific relations

Page 34: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Frame Semantics

FrameNet (1997), Framester (2016), Framebase (2015)

Frame name

Frame type

Frame element

role

Page 35: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Frame InstanceInstance id: <fi12345>Frame type: fbframe:CookingFrame elements:● fbfe:Instrument, dbr:Knife● fbfe:Agent, dbr:Person● …

Default Knowledge Prototypical Frame Instances→=

F.I. extraction + F.I. clustering

Page 36: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Knowledge Extraction

https://github.com/valeriobasile/learningbyreading

Text

(Natural Language)

Semantic

Parsing

Word Sense

Disambiguation

Entity

Linking

Discourse

Representation

Structure

DBPedia

Entities

WordNet

Synsets

Semantic

Roles

FrameNet

FramesAlignment

RDF

Triples

Page 37: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Frame Instance Extraction

Page 38: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Frame Similarity

Instance id: <fi12345>Frame type: fbframe:CookingFrame elements:● fbfe:Instrument,

dbr:Knife● fbfe:Agent, dbr:Person● …

Instance id: <fi67890>Frame type: fbframe:EatingFrame elements:● fbfe:Instrument,

dbr:Fork● fbfe:Agent, dbr:Person● …

frame types

Page 39: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Frame Similarity

Instance id: <fi12345>Frame type: fbframe:CookingFrame elements:● fbfe:Instrument,

dbr:Knife● fbfe:Agent, dbr:Person● …

Instance id: <fi67890>Frame type: fbframe:EatingFrame elements:● fbfe:Instrument,

dbr:Fork● fbfe:Agent, dbr:Person● …

frame elements

Page 40: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Frame Similarity

Measuring Frame Instance RelatednessV. Basile, R. Lopez Condori, E. Cabrio

*SEM 2018 (accepted)

Page 41: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Pilot StudyText for language learners (1,653 short stories) 114,536 frame instances, 154,422 frame elements, 686 frame types, 222 roles filled by 3,398 types of concepts.Hierarchical clustering with our distance metric: complete-linkage agglomerative (SciPy)

Frame Instance Extraction and Clustering for Default Knowledge BuildingA. Shah, V. Basile, E. Cabrio, S. Kamath S.

Applications of Semantic Web technologies in Robotics - ANSWER 17

Page 42: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Pilot Study

Page 43: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Pilot StudyMost frequent frame type, role and element from each cluster

<http://framebase.org/fbframe/Ride_vehicle><http://framebase.org/fbfe/Vehicle><http://wordnet−rdf.princeton.edu/wn31/02837983−n>

300 triples, available at ~http://project.inria.fr/aloof/data/

Bicycle

Page 44: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Part VDefault Knowledge about

Objects

Page 45: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Default Knowledge about Objects

RDF dataset of common sense knowledge about objects.

Object classification, prototypical location, actions, frames...

Knowledge extracted from parsing, crowdsourcing, distributional semantics, keyword linking

http://deko.inria.fr/

Page 46: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Default Knowledge about Objects

10,990 nquads (named graphs)

603 from crowdsourcing

1,221 from distributional relational hypothesis

8,046 from keyword kinking

1,120 from KNEWS/frame instance clustering

+ DeKO ontology

http://deko.inria.fr/

Page 47: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge
Page 48: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

Left out… but open for discussion

● Fundational distinctionsClass vs. Instance in DBpediaAsprino L., Basile V., Ciancarini P., Presutti Empirical Analysis of Foundational Distinctions in Linked Open Data (IJCAI 2018)

● Nature of prototypical relationsDefault Logic and RDF

Page 49: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

The End(Q/A)

Page 50: Creating a Commonsense Knowledge Base about Objectsvaleriobasile.github.io/presentations/SAILab2020.pdf · 2020-05-21 · Frame Instance Extraction and Clustering for Default Knowledge

● Valerio Basile, Elena Cabrio, Fabien Gandon, Debora Nozza: Mapping Natural Language Labels to Structured Web ResourcesNL4AI 2018

● Valerio Basile, Roque Lopez Condori, Elena Cabrio:Measuring Frame Instance Relatedness.*SEM 2018

● Soufian Jebbara, Valerio Basile, Elena Cabrio, Philip Cimiano (2018): Extracting common sense knowledge via triple ranking using supervised and unsupervised distributional modelsSemantic Web

● Avijit Shah, Valerio Basile, Elena Cabrio, Sowmya Kamath S.: Frame Instance Extraction and Clustering for Default Knowledge Building. AnSWeR 2017.

● Jay Young, Valerio Basile, Markus Suchi, Lars Kunze, Nick Hawes, Markus Vincze, Barbara Caputo: Making sense of indoor spaces using semantic web mining and situated robot perceptionESWC 2017

● Jay Young, Lars Kunze, Valerio Basile, Elena Cabrio, Nick Hawes, Barbara Caputo: Semantic Web-Mining and Deep Vision for Lifelong Object Discovery. ICRA 2017

● Valerio Basile, Soufian Jebbara, Elena Cabrio, Philipp Cimiano: Populating a Knowledge Base with Object-Location Relations Using Distributional SemanticsEKAW 2016

● Jay Young, Valerio Basile, Lars Kunze, Elena Cabrio, Nick Hawes: Towards Lifelong Object Learning by Integrating Situated Robot Perception and Semantic Web MiningECAI 2016

● Valerio Basile, Elena Cabrio, Fabien Gandon: Building a General Knowledge Base of Physical Objects for Robots. ESWC 2016