semantic similarity measurement and geographic applications introduction

30
3 Apr 2008 geog 288MR, spring 08 1 Semantic Similarity Measurement and Geographic Applications Introduction Dr. Martin Raubal Department of Geography, UCSB [email protected]

Upload: aurora-whitney

Post on 03-Jan-2016

44 views

Category:

Documents


0 download

DESCRIPTION

Semantic Similarity Measurement and Geographic Applications Introduction. Dr. Martin Raubal Department of Geography, UCSB [email protected]. Overview. Instructor Organization Goals Schedule Introduction to the topic. Instructor. - PowerPoint PPT Presentation

TRANSCRIPT

3 Apr 2008 geog 288MR, spring 08 1

Semantic Similarity Measurement and Geographic Applications

Introduction

Dr. Martin Raubal

Department of Geography, UCSB

[email protected]

3 Apr 2008 geog 288MR, spring 08 2

Overview

• Instructor

• Organization

• Goals

• Schedule

• Introduction to the topic

3 Apr 2008 geog 288MR, spring 08 3

Instructor

• Instructor: Dr. Martin Raubal, [email protected]://www.geog.ucsb.edu/~raubal/

• Office (hours): Wednesday 2-4pm, EH 5713

• Phone: 893-4839

3 Apr 2008 geog 288MR, spring 08 4

Class meets

• Thursday 1-3pm, EH 5824

3 Apr 2008 geog 288MR, spring 08 5

Textbook & literature

• No textbook

• Literature online: http://www.geog.ucsb.edu/~raubal/Courses/288MR_Spring08_Papers/

3 Apr 2008 geog 288MR, spring 08 6

Organization

• Course Website: http://www.geog.ucsb.edu/~raubal/Courses/288MR_SemSim_Spring08_schedule.pdf

• Weekly readings

• Class participation is mandatory!

3 Apr 2008 geog 288MR, spring 08 7

Assignments

• Weekly brief critical commentaries (not longer than 1 page each) plus 3 questions connected to the readings (possibly related to your own field of study) to be discussed in class.

• Please hand in a printed version at the beginning of each session. Late assignments will not be accepted and count as failed.

• Lead one session including presentation of the weekly material (and possibly additional material) and discussion.

3 Apr 2008 geog 288MR, spring 08 8

Grading

• Your presentation & discussion lead (35%)

• Written assignments (35%)

• General class participation (30%)

Registration 2-4 hours!?

3 Apr 2008 geog 288MR, spring 08 9

Goals

• Identify and recognize the importance of semantic similarity measurement for GI Science & Systems (data retrieval, integration, etc.).

• Explore theoretical models and practical applications.

• Get to know relevant literature.

• Discuss, write, and present.

3 Apr 2008 geog 288MR, spring 08 10

Schedule

http://www.geog.ucsb.edu/~raubal/Courses/288MR_SemSim_Spring08_schedule.pdf

Take the lead for one session.

3 Apr 2008 geog 288MR, spring 08 11

You

• Who are you and what is your background?

• What is your motivation to participate in this class?

• Semantic similarity measurement? Any experience?

• How does it relate to your own research?

3 Apr 2008 geog 288MR, spring 08 12

Applications!?

• Think of applications where semantic similarity is of relevance for GI Sciences.

3 Apr 2008 geog 288MR, spring 08 13

Motivating example (1)

• Customer of OS wants to set up flood warning system.

• Need for existing flooding areas to analyze current flood defense situation in U.K.

Motivating example (1)

• OS Master Map:– geographic & topographic– information on areas used for flooding but not

designated as such

• ‘Watermeadow', 'carse‘, 'haugh' identified as flooding areas by their semantic description only (properties in ontology).

3 Apr 2008 geog 288MR, spring 08 14

3 Apr 2008 geog 288MR, spring 08 15

3 Apr 2008 geog 288MR, spring 08 16

User conceptualization of roads & residential areas

System model of roads & residential areas

Roads overlap residential areas?

Intersect to find roads going through residential areas

Motivating example (2)

3 Apr 2008 geog 288MR, spring 08 17

Semantic interoperability

• “Capacity of (geographic) information systems and services to work together without the need for human intervention” (Harvey, Kuhn et al. 1999)

• Achieving sufficient degree of semantic interoperability => necessary to determine semantic similarity between concepts.

3 Apr 2008 geog 288MR, spring 08 18

Similarity (psychology)“Similarity is fundamental for

learning, knowledge and thought, for only our sense of similarity allows us to order things into kinds so that these can function as stimulus meanings. Reasonable expectation depends on the similarity of circumstances and on our tendency to expect that similar causes will have similar effects" [Quine 1969, p. 114].

3 Apr 2008 geog 288MR, spring 08 19

Computer science

• Similarity plays major role to enable machine-based solutions: decision support systems, data mining, pattern recognition.

• Semantic information retrieval: similarity indicates relevance of results with regard to being similar to the query.

3 Apr 2008 geog 288MR, spring 08 20

Concept

A concept is "a mental representation of a class or individual and deals with what is being represented and how that information is typically used during the categorization" [Smith 1989, p. 502].

Concept vs. Category?

3 Apr 2008 geog 288MR, spring 08 21

Concepts in knowledge representation

• Conceptual knowledge can be represented in ontologies that consist of specifications of concepts, relations and axioms.

• Relations link concepts together and enable reasoning and measurement within an ontology.

• Taxonomical (hierarchical) relations are the most important for reasoning and structuring knowledge.

3 Apr 2008 geog 288MR, spring 08 22

dist (Bus, Ferry) < dist (Bus, Bike)

3 Apr 2008 geog 288MR, spring 08 23

Similarity measurements

• Approaches from different research areas (psychology, computer science, artificial intelligence) => apply to ontology-based semantic similarity measurement.

• Application areas:– Information retrieval & integration– Data mining & maintenance– Categorization– Natural-language processing– Pattern recognition

3 Apr 2008 geog 288MR, spring 08 24

Measure and representation

Representational model used to describe concepts determines semantic similarity measure (based on one notion of similarity).

Representation => similarity measure

3 Apr 2008 geog 288MR, spring 08 25

Semantic similarity measurement

• How close are two entities to each other conceptually?

• Value between 0 and 1:– ‘0’ => no similarity– ‘1’ => both entities are equal

• Different measurement theories.

3 Apr 2008 geog 288MR, spring 08 26

Schwering (2008)

relations not explicitly represented

3 Apr 2008 geog 288MR, spring 08 27

Topics tackled in this seminar

• Psychological models to explain human similarity judgment

• Approaches to similarity in different areas / disciplines

• What is representation?• Representation of geographic semantics• Semantic interoperability

3 Apr 2008 geog 288MR, spring 08 28

Topics tackled in this seminar

• Theories for measuring semantic similarity– Feature-based models– Geometric models

• Similarity measurement in context• Semantic similarity vs. semantic

relatedness• Measurements using a semantic similarity

serverhttp://128.176.133.142:8080/SimInterface/

Topics tackled in this seminar

• Use of similarity measurement in geographic applications:– Geographic information retrieval– Hydrography– Land use / land cover classification– Wayfinding and landmarks– World knowledge in Wikipedia– …

3 Apr 2008 geog 288MR, spring 08 29

3 Apr 2008 geog 288MR, spring 08 30

Reading for next session

• R. Goldstone and J. Son (2005) Similarity. in: K. Holyoak and R. Morrison (Eds.), Cambridge Handbook of Thinking and Reasoning. Cambridge University Press, Cambridge.

• Schwering, A., Approaches to Semantic Similarity Measurement for Geo-Spatial Data: A Survey. Transactions in GIS, 2008. 12(1): p. 5-29.

• A. Rissland (2006) AI and Similarity. IEEE Intelligent Systems (May-June 06): 39-49.