georeferencing of search results based on annoted data and geographic information systems

17
Georeferencing of Search Results Georeferencing of Search Results based on Annoted Data and based on Annoted Data and Geographic Information Systems Geographic Information Systems by Giw Aalam MPI, Department 5, Databases and Information Systems MPI, Department 5, Databases and Information Systems

Upload: aileen

Post on 05-Feb-2016

29 views

Category:

Documents


0 download

DESCRIPTION

Georeferencing of Search Results based on Annoted Data and Geographic Information Systems. by Giw Aalam MPI, Department 5, Databases and Information Systems. Motivation(1). „Whatever occurs, occurs in space and time.“ (Wegener 2000) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Georeferencing of Search Results based on Georeferencing of Search Results based on Annoted Data and Geographic Information Annoted Data and Geographic Information

SystemsSystems

byGiw Aalam

MPI, Department 5, Databases and Information Systems MPI, Department 5, Databases and Information Systems

Page 2: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Motivation(1)Motivation(1)

• „Whatever occurs, occurs in space and time.“ (Wegener 2000)

– Navigation– Path of a hurricane– Market surveys– Environmental dynamics

• Increasing market for geospatial information

Page 3: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Motivation(2)Motivation(2)

• Most georeferencing we encounter daily is in form of placenames:

– ~70% of text documents contain placename references (MetaCarta Inc. 2005)

– 49,69% out of 5Mio. libraray catalog records of the University of California contain >1 place related subject headings

(Petras 2004)

Time and space reference potentially important to access documents and knowledge

Page 4: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Motivation(3)Motivation(3)

• Most text oriented search engines heavily depend on recognizing weighted keywords

– Cannot find relevant results for queries like „tropical fruit“ and „Bodensee“; e.g. an article about apricots from Mainau isle

– Requires appropiate knowledge base (Thesauri/Ontology, Gazzetters/geospatial information)

Page 5: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

ObjectiveObjective

Show potentials & problems of the usage of geodata / georeferenced data in the framework of information retrieval

Develop a prototype for a search engine, based on

widespread available data and tools (Wikipedia, GoogleEarth)

Page 6: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Georeferencing(1)Georeferencing(1)

• Translation between informal and formal representations of geographic locations

– Informal reference e.g. used in discourse („Saarbrücken“, „the musicstore on Marktstraße“, …)

– Formal representations are basis for mathematical calculations like distance, direction and spatial relationships in general (52° 31' N, 13° 25' O )

Page 7: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Georeferencing(2)Georeferencing(2)

Page 8: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Annotation(1)Annotation(1)

• Congruent, common understanding of geographic references important for consistent annotation

– What should be annotated? – What is relevant?– What aspects of geodata should be described?– How?

• Context specific versus cross-context

Page 9: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Annotation(2)Annotation(2)

• XML-based formats play an increasing role in the framework of GIS (Geographic Information Systems) for annotation/description and data exchange

– Structured– Extensible; XML Schema comprises a binding determination of

an information model expressed in a document-instance– Possible to work with heterogeneous data

Page 10: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Annotation(3)Annotation(3)

• GML (Geography Markup Language)

– Defined by the OGC (Open Geospatial Consortium) as a „standard“ format for modeling and exchange of spatial information

– expected to be released as an international standard in 2007

Page 11: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Annotation(5)Annotation(5)

• KML (Keyhole Markup Language)

– Delevoped by Keyhole Corp. for the „EarthViewer“-Tool– Keyhole has been taken over by Google Inc. in 2004– KML now used in connection with GoogleEarth

– Visualisation of georeferenced data – Description of geometric figures, pictures and locations– Define view/perspective

Examples later…

Page 12: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Infrastructure(1)Infrastructure(1)

3-level architecture as considered by EU-initiative INSPIRE (Infrastructure for Spatial Information in Europe)

Page 13: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Infrastructure(2)Infrastructure(2)

• Bottom level: (meta-)data sources– „Machinable“; read & interpret

• Medium level: (value-added) services– Independent from specific databases

• Top-Level: user-applications (GIS, browser, specialized services,…)

Requirement for common interfaces!

Page 14: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Visualisation(1)Visualisation(1)

• Context-specific representation / visualisation of results

could effectively support the process of Data-Mining

• Geospatial coherence often deducible by use of adequate visualisation

– Conformance between mental model and cognitive style often better than in a simple table-view.

Page 15: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

Visualisation(2)Visualisation(2)

Identification of a pump on „Broad Street“ as source of cholera epidemic; Dr. John Snow, 1854, London

Example:

Page 16: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

References(1)References(1)

• Wegener M, Fotheringham A., 2000, Spatial models and GIS: New Potential and New Models, London, Taylor & Francis

• Petras V., 2004, Statistical Analysis of Geographic and Language Clues in the MARC Record. Technical report for the „Going Places in the Catalog: Improved Geographical Access“ project, University of California, http://metadata.sims.berkeley.edu/papers/Marcplaces.pdf

• MetaCarta Inc. 2005, MetaCarta corporate brochure, http://metacarta.com/docs/Corporate_Brochure_06_05.pdf

Page 17: Georeferencing of Search Results based on Annoted Data and Geographic Information Systems

References(2)References(2)

• „INSPIRE Architecture and Standards Position Paper“, INSPIRE (Infrastructure for Spatial Information in Europe), European Commission, Joint Research Centre, 2002 http://inspire.jrc.it/reports/position_papers/inspire_ast_pp_v4_3_en.pdf

• Düren U., „XML, GML, NAS“, Landesvermessungsanstalt NRW http://www.landesvermessungsamt.nrw.de/neues/veranstaltungen/seminare/images/Vortraege_LDS_Kurs_40004_06/Dueren_LVermA_NRW/LDS_40004_XML_GML_NAS.pdf

• W. Riekert, P. Treffler, 2000, „Georeferenzierung als Mittel zur Erschließung von Fachinformationen in Internet und Intranet“, 14. Int. Symposium Informatik im Umweltschutz

http://v.hdm-Stuttgart.de/~riekert/vortraege/00ui.pdf