kyoto (ict - 211423) overview

27
KYOTO (ICT-211423) Yielding Ontologies for Transition-Based Organization FP7: Intelligent Content and Semantics http://www.kyoto-project.eu/ Piek Vossen 1 st Kyoto workshop: Environmental Knowledge Transition and Exchange February, 2-3, 2009, Artis Amsterdam

Upload: brian-hammond

Post on 01-Jan-2016

31 views

Category:

Documents


0 download

DESCRIPTION

KYOTO ( ICT - 211423) Y ielding O ntologies for T ransition-Based O rganization FP7: Intelligent Content and Semantics http://www.kyoto-project.eu/ Piek Vossen 1 st Kyoto workshop: Environmental Knowledge Transition and Exchange February, 2-3, 2009, Artis Amsterdam. - PowerPoint PPT Presentation

TRANSCRIPT

KYOTO (ICT-211423)Yielding Ontologies for Transition-Based OrganizationFP7: Intelligent Content and Semantics

http://www.kyoto-project.eu/

Piek Vossen1st Kyoto workshop: Environmental Knowledge Transition and ExchangeFebruary, 2-3, 2009, Artis Amsterdam

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

KYOTO (ICT-211423) Overview • Title: Yielding Ontologies for Transition-Based Organization

• Funded: – 7th Framework Program-ICT of the European Union: Intelligent Content and

Semantics

– Taiwan and Japan funded by national grants • Goal:

– Platform for knowledge sharing across languages and cultures– Enables knowledge transition and information search across different target

groups, transgressing linguistic, cultural and geographic boundaries.– Open text mining and deep semantic search– Wiki environment that allows people in the field to maintain their knowledge

and agree on meaning without knowledge engineering skills• URL: http://www.kyoto-project.eu/• Duration:

– March 2008 – March 2011• Effort:

– 364 person months of work.

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Consortium

1. Vrije Universiteit Amsterdam (Amsterdam, The Netherlands), 2. Consiglio Nazionale delle Ricerche (Pisa, Italy), 3. Berlin-Brandenburg Academy of Sciences and Humantities (Berlin,

Germany), 4. Euskal Herriko Unibertsitatea (San Sebastian, Spain), 5. Academia Sinica (Tapei, Taiwan), 6. National Institute of Information and Communications Technology

(Kyoto, Japan), 7. Irion Technologies (Delft, The Netherlands), 8. Synthema (Rome, Italy), 9. European Centre for Nature Conservation (Tilburg, The

Netherlands), • Subcontractors:

– World Wide Fund for Nature (Zeist, The Netherlands), – Masaryk University (Brno, Czech)

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

KYOTO (ICT-211423) Overview • Languages:

– English, Dutch, Italian, Spanish, Basque, Chinese, Japanese • Domain:

– Environmental domain, BUT usable in any domain • Global:

– Both European and non-European languages• Available:

– Free: as open source system and data (GPL)• Future perspective:

– Content standardization that supports world wide communication

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Current situation

• Vast amount of information in all kinds of formats and structures: websites, documents, databases, experts, community networks

• Scattered over the world: different regions, languages and cultures

• Highly dynamic and developing• Involving an extreme range of interests and

expertise: government, general public, education, policies, regulations and rules, biology, health, chemistry, agriculture, economy, poverty, social impacts, transport, tourism, food industry, natural disasters, etc.

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

KYOTO cycle

frog endemic frogs common frog poison frog

Golden poison froggopher frog

Dusky gopher frogforest frog

Garden ponds are havens for wildlife. They provide food and shelter for frogs, newts and aquatic insects, including damselflies and dragonflies,

(garden pont, haven, wild life)(garden pont, has_food, frog)(garden pont, has_food, newt)(garden pont, has_food, aquatic insect)(garden pont, is_shelter, frog)(garden pont, is_shelter, newt)(garden pont, is_shelter, aquatic insect)

Top

Middle

H20 CO2

Substance

Abstract

Process

Physical

Ontology

Environmental organizations

Tybot: term yielding robot

Kybot: knowledge yielding robot

Wordnets

Distributed, diverse & dynamic data

1

Capture text:"Sudden increase of CO2 emissions in 2008 in Europe"

2

CO2 emission3

Wikyoto

maintainterms & concepts

4

Index facts:Process: Increase Involves: CO2 emission When: 2008 Where: Europe

5Text & Fact Index

SemanticSearch

6

Citizens

Governments

Companies

DomainCO2

EmissionH20

PollutionGreenhouse

Gas

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Kyoto main assets

• Wiki platform (WIKYOTO) for connecting, transferring and controlling knowledge and information across people and computers

• Term yielding robots (TYBOT): software that extracts terms and concepts from documents

• Knowledge yielding robots (KYBOT): fact extraction software that generates a comprehensive list of facts from collection of sources

• Fact alert: reports changes in facts on a collection of sources

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

What makes KYOTO unique?

• Wikyoto community tool that hides technology and complex knowledge and language representation

• Operated by community people and not by knowledge engineers and language technology people

• Exploits massive labor force of comunities all over the world

• Connects domain knowledge to generic knowledge and domain language to general language

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Some basics

• What is a wordnet?

• What is an ontology

• What are facts?

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

WordNetWordNet• Semantic network in which concepts are defined in

terms of relations to other concepts, e.g. "car is a type of vehicle and so is a train".

• Structure: organized around the notion of synsets (sets

of synonymous words, e.g. "car, automobile") basic semantic relations between these

synsets http://www.cogsci.princeton.edu/~wn/w3wn.htmlhttp://www.cogsci.princeton.edu/~wn/w3wn.html Developed at Princeton by George Miller and his

team as a model of the mental lexicon.

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Relational model of meaning

man woman

boy girl

cat

kitten

dog

puppy

animal

man

woman

boy

cat

kitten

dogpuppy

animal

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

chronical patient ; mental patient

patient

ISA

ρ-PROCEDURE ρ-LOCATION

STATE

ρ-CAUSE

cureρ-PATIENT

treat

arts

disease; disorder

physiotherapymedicinesetc.

hospital, etc.

stomach disease, kidney disorder,

ρ-PATIENT

ISA

ρ-AGENT

child docter

child

co-ρ-AGENT-PATIENT

ISA

Wordnet = network of semantically related words

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Ontologies

• Formal definition of concepts that a computer can use to make inferences

• Uses logical representations of relations and properties of things

• Can be used to connect the meanings of words across languages and different domains

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Ontology Example• <EcoSystem rdf:ID="Living_Plant_Instance_12">• <SUO-name rdf:datatype="http://www.w3.org/2001/XMLSchema#string"• >freshwater species</SUO-name>• </EcoSystem>• <Event rdf:ID="Living_Plant_Instance_30009">• <rdf:type rdf:resource="#ChangeOfState"/>• <SUO-name rdf:datatype="http://www.w3.org/2001/XMLSchema#string"• >increase</SUO-name>• </Event>• <ChangeOfState rdf:ID="Living_Plant_Instance_20015">• <SUO-name rdf:datatype="http://www.w3.org/2001/XMLSchema#string"• >double</SUO-name>• </ChangeOfState>• <Event rdf:ID="Living_Plant_Instance_30017">• <rdf:type rdf:resource="#ChangeOfState"/>• <SUO-name rdf:datatype="http://www.w3.org/2001/XMLSchema#string"• >conversion of natural habitat to farmland</SUO-name>• </Event>

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Ontology example<rdfs:Class rdf:ID="WINE"> <rdfs:subClassOf rdf:resource="#POTABLE-LIQUID"/> <rdfs:subClassOf> <daml:Restriction>

<daml:onProperty rdf:resource="#MAKER"/> <daml:minCardinality> 1 </daml:minCardinality> </daml:Restriction>

</rdfs:subClassOf> <rdfs:subClassOf> <daml:Restriction>

<daml:onProperty rdf:resource="#MAKER"/> <daml:toClass rdf:resource="#WINERY"/> </daml:Restriction>

</rdfs:subClassOf> <rdfs:subClassOf> <daml:Restriction>

<daml:onProperty rdf:resource="#GRAPE-SLOT"/> <daml:minCardinality> 1

......... </rdfs:Class>

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Wordnet to Ontology

• Wordnets (semantic networks of words) in many languages can be linked to a single shared ontology

• Universal definition of meaning for different languages

• Relate words in free text to their meaning

• Allow computers to see the meaning of words in text in any language

Inter-LingualOntology

Device

Object

TransportDeviceEnglish Words

vehicle

car train

1

2

3 3

Czech Words

dopravní prostředník

auto vlak

2

1French Words

véhicule

voiture train

2

1

Estonian Words

liiklusvahend

auto killavoor

2

1

German Words

Fahrzeug

Auto Zug

2

1

Spanish Words

vehículo

auto tren

2

1

Italian Words

veicolo

auto treno

2

1

Dutch Words

voertuig

auto trein

2

1

Global WordNet Grid

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Concepts & Facts

• Conceptual knowledge: general & generic knowledge about – ClimateChange

• physical change • affecting the climate => definition of climate• in a region• during a period of time• caused by another change• causing yet other changes

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Abstract concept schema

• (subclass, ClimateChange, Change)

• (has_cause, ClimateChange, event)

• (has_cause, event, ClimatChange)

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Concepts & Facts

• Fact:– A case of ClimateChange has been observed:

• factual and significant change in the climate (temperature, humidity, wind direction, rain fall, etc.)

• in a particular region, e.g. the Alps.• Time period, e.g. summer 2005• Caused by CO2 emissions, North Atlantic gulf

stream• Causes decrease of biodiversity measured in

specific populations: fish, birds, insects => counts of populations

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Fact tableEvent Agent Patient When Where

Climate

change

Temperature rise

Climate of Spain

1990-2000

Spain

Climate

change

Humidity

rise

Climate of Lowlands

1995-2005

Netherlands

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Concrete results of Kyoto

• Domain wordnet for the environment domain• Domain ontology for the environment domain• Wikyoto to maintain the wordnet and ontology in

the future in a community• Tybots to extract new terms for the domain from

documents• Kybots to extract facts for the domain from

documents• A database of facts that is updated on a regular

basis

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Global time schedule

mar sep dec mar sep dec mar sep dec febPM

WP1 User requirements 5WP2 System design 12WP3 Capture 10WP4 Indexing 11WP5 Knowledge mining 120WP6 Knowledge integration 106WP7 Database systems and wiki 25WP8 Domain extension 12WP9 Evaluation 20¨ Milestones M1 M2 M3 M4

2008 2009 2010 2011

30 36

System-2

KE-Evaluation-1 KE-Evaluation-2

System-2

IR Evaluation-1 IR Eval-2

6 12 18 24

Design

Design

System-1S1

System-1S1

Design System-1

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

The role of the users

• Provide sources and documents• Define the questions and topics of interest• Form a community that uses the Wikyoto

system to define terms and ontology• Give feedback on the Wikyoto system and

the extracted terms• Extract facts and evaluate the system• Join the forum discussions on the website

of Kyoto

1st Kyoto workshop: Environmental Knowledge Transition and Exchange, February, 2-3, 2009, Artis AmsterdamICT-211423

Goal of this Workshop

• Bridge the gap between the enviromentalist and knowledge engineer.

• Discuss the requirements of the system of the future and near-future

• Discuss the design and first versions of the KYOTO system and compare it with the state of the art