sheldon challenge

14
STLab ISTC - CNR Semantic Holistic framEwork for LinkeD ONtology data Diego Reforgiato Recupero 1 , Andrea Giovanni Nuzzolese 1 , Sergio Consoli 1 , Aldo Gangemi 1,2 and Valentina Presutti 1 1 STLab, Institute of Cognitive Science and Technology, National Research Council, Italy 2 LIPN, Université Paris 13, Sorbone Cité, UMR CNRS, France 22 October 2014, Riva del Garda, Italy http://wit.istc.cnr.it/stlab-tools/sheldon

Upload: andrea-nuzzolese

Post on 27-Nov-2014

296 views

Category:

Technology


3 download

DESCRIPTION

SHELDON is the first true hybridization of NLP machine reading and Semantic Web. It is a framework that builds upon a ma- chine reader for extracting RDF graphs from text so that the output is compliant to Semantic Web and Linked Data patterns. It extends the current human-readable web by using Semantic Web practices and technologies in a machine-processable form. Given a sentence in any language, it provides different semantic functionalities (frame detection, topic extraction, named entity recognition, resolution and coreference, terminology extraction, sense tagging and disambiguation, taxonomy induction, semantic role labeling, type induction, sentiment analysis, citation inference, relation and event extraction) as well as nice visualization tools which make use of the JavaScript infoVis Toolkit and RelFinder, as well as a knowledge enrichment component that extends machine reading to Semantic Web data. The system can be freely used at http://wit.istc.cnr.it/stlab-tools/sheldon.

TRANSCRIPT

Page 1: Sheldon challenge

STLab ISTC - CNR

Semantic Holistic framEwork for LinkeD ONtology data

Diego Reforgiato Recupero1, Andrea Giovanni Nuzzolese1, Sergio Consoli1, Aldo Gangemi1,2 and Valentina Presutti1

1STLab, Institute of Cognitive Science and Technology, National Research Council, Italy2LIPN, Université Paris 13, Sorbone Cité, UMR CNRS, France

22 October 2014, Riva del Garda, Italy

http://wit.istc.cnr.it/stlab-tools/sheldon

Page 2: Sheldon challenge

STLab ISTC - CNR

Title

2

• Text

• Other text

Multilinguality: SHELDON

supports 47 languages

Page 3: Sheldon challenge

STLab ISTC - CNR

• Point

• subpoint

Title

3

• Built on top of FRED, Boxer, C&C• Use event detection, semantic role labeling, FrameNet roles, first-order

logic representation of predicate-argument structures (DRT), logical operator scoping, modality detection, tense representation, NER with TAGME, WSD, DBPedia.

1/4 Interactivity:

SHELDON graphs are interactive

Page 4: Sheldon challenge

STLab ISTC - CNR4

• Use FRED, SentiWordNet, SenticNet, SentiloNet (new lexical resource we developed), OntoSentilo (new ontology for opinion sentences), WordNet, VerbNet, DBPedia.

• Holder detection, topic and subtopic detection, propagation towards entities and related nodes of the graph

2/4

Page 5: Sheldon challenge

STLab ISTC - CNR5

• Binary relation discovery on FRED’s graph among identified entities.• Formalization of discovered properties in terms of domain and ranges

(WIBI) and OWL2 property chains• Built on top of FRED, DBPedia, WATSON API, WIBI

3/4

Page 6: Sheldon challenge

STLab ISTC - CNR6

4/4

Page 7: Sheldon challenge

STLab ISTC - CNR7

DEMO

Page 8: Sheldon challenge

STLab ISTC - CNR8

KNOWLEDGE EXPLORATION

Page 9: Sheldon challenge

STLab ISTC - CNR

KNOWLEDGE VISUALIZATION

Page 10: Sheldon challenge

STLab ISTC - CNR13

MARKET UPTAKE

Page 11: Sheldon challenge

STLab ISTC - CNR

Organizations using SHELDON3CUBE S.R.L., SME, ITAlmaviva, SME, ITARTEMATICA IMPRESA DI COMUNICAZIONE MULTIMEDIALE S.R.L. , SME, ITCaalma S.r.l. , SME, ITCDB Agency s.r.l. , SME, ITCNR sistemi informativi, research institution, ITCUBECURVE S.R.L. , SME, ITDR SOFT S.A.S. DI GRESTA MARIA CRISTINA & C. S.A.S. , SME, ITETNA HiTech, SME, ITGEST S.R.L. , SME, ITGRUPPO FOCUS S.R.L. , SME, ITHamatec S.r.l. , SME, ITHergaweb S.R.L. , SME, ITHGO S.r.l. , SME, ITHT S.r.l. , SME, ITImpera Software S.R.L. , SME, ITITALDATA SOC. COOP a.r.l. , SME, ITJOB CREATION S.R.L. , SME, ITLIPN, Université Paris 13, research institution, FRM.R.S. S.R.L. , SME, ITP.M.F. S.r.l. , SME, ITRed Link, SME, AUTR2M Solution, SME, IT and UKSemantic Sicily, consortium of Sicilian SMEsSentimetrix inc, SME, USSicula Ciclat Coop. , SME, ITSignorelli & Partners S.a.s. , SME, ITTECHMA - TECNOLOGIE MULTIMEDIALI AVANZATE S.R.L. , SME, ITTecnosys S.r.l. , SME, ITTesi Automazione S.R.L. , SME, ITUniroma 2, research institution, ITUniversity of Aberdeen, research institution, UKUniversity of Colorado, research institution, USUniversity of Bologna, research institution, ITUniversity of Catania, research institution, ITUniversity of Leeds, research institution, UKVisual Software S.r.l. , SME, IT

New H2020 project about robot for assisted living (to be started in January 2015) is going to use SHELDON for the core engine of the robot automatic behaviour.

EU tender about fish ontologies and fishery label representation (to be started in November 2014) is going to use SHELDON to provide some of the proposed services.

HERMES, an Italian industrial project is going to use SHELDON (several companies from LAZIO region) about travel planning (user profilation)PRISMA, an Italian project is using SHELDON for sentiment analysis over data related to the city of CataniaSHELDON is going to be used in the creationof Apache Stanbol plugins, which is a product of a finished FP7 project called IKS

20k accsss per

month

Page 12: Sheldon challenge

STLab ISTC - CNR

Examples of real use cases• SHELDON’s team leverages expertise and skills of its

members in start-up creation, management and sustainability.

• SHELDON represents the core service that a new UK start-up will provide to its customers.

PEOPLE INTERESTED IN SHELDON ASKED:• Sentiment Analysis using semantics (http

://alt.qcri.org/semeval2015/task11/)• Access to the machine reading capabilities to exploit

semantics for userclassification according to their text posted in social networks

• Access to relation discovery component and sentiment analysis to develop new components for CMS

• Access to the citation typing component for the digital library case study of CMS (potential collaboration with ELSEVIER)

• Use of SHELDON for the robot main component related to text understaning and interaction with humans

Page 13: Sheldon challenge

STLab ISTC - CNR

STRENGTHS

WEAKNESSES

OPPORTUNITIES T

HREATS

• Expertise in start-up creation• Each component succesfully

proved and published• Wide ICT network in Europe• Participation in H2020 project• Competitors do different things• Multi-language support for 47

languages• Automatic language

identification• REST API Access

• Initial funding• Dedicated programmers• Team not in the same

physical place• Computational time

(hardware upgrade needed)

• SHELDON is the first in its kind

• Provided semantics can be used in social networks, CMS, data analysis, system recommendation, etc.

• Semantics not really known yet

• 20k current access per month

• License issues• Security problems (REST

API)

Negative

Internal factors

Externalfactors

Positive

Page 14: Sheldon challenge

STLab ISTC - CNR17

THANK YOU