0 daml tools for intelligent information annotation, sharing and retrieval umbc johns hopkins...

17
1 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin James Mayfield Benjamin Grosof DAML PI meeting, 30 November 2004

Upload: elinor-holmes

Post on 30-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

1

DAML Tools for Intelligent Information Annotation, Sharing and Retrieval

UMBC

Johns Hopkins University Applied Physics LabMIT Sloan School

Tim FininJames MayfieldBenjamin Grosof

DAML PI meeting, 30 November 2004

Page 2: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

2/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Overall Program Summary

OWL enables agents in open, dynamic environments to share knowledge and cooperate on tasks while managing privacy, commitments, trust, and security. To do this, we must ensure that:

– OWL integrates with common agent frameworks and standards– OWL integrates with other KR paradigms for real world reasoning -- rule based

systems, Bayesian reasoning, etc.– OWL permits IR systems to index and retrieve documents with text, multimedia

and knowledge

Travel Agents

Auction Service Agent

Customer Agent

Bulletin BoardAgent

Market Oversight Agent

Request

Direct Buy

Report Direct Buy Transactions

BidBid

CFP

Report Auction Transactions

Report Travel Package

Report Contract

Proposal

Web Service Agents

Page 3: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

3/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Technical Problem and Approach

• Current agent systems are not set up to use OWL– Develop ontologies, conventions, software and tools to support using OWL in

FIPA and similar systems

– Demonstrate and investigate the use of OWL & Agents in realistic applications • Policies need to be grounded in OWL content

– Develop Rei as a declarative, RDF based policy language– Demonstrate via several testbeds

• OWL needs to be extended/integrated with rule-based reasoning– Work with groups to develop specifications for RuleML, SWRL, etc– Build prototype translation systems and demonstrations

• OWL needs to be extended/integrated with uncertainty reasoning– Annotate OWL content with Bayesian info and generate BBN– Demonstrate via support for ontology mapping

• We need information retrieval systems that work with RDF content– Augment existing IR systems like Google with swangle terms– Develop Swoogle as a native RDF search engine

Page 4: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

4/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Technical Progress

Here are the comments on the template slide• What technical problems were there and when/how did you overcome them?• Are there any metrics that are relevant to your program? How did you measure technical

progress and success? What were your intermediate goals?• Did you meet your original or revised programmatic goals?

• We need to come with a slide for this

Page 5: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

5/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Milestones and Accomplishments

Agent &Systems

KR

IR

specs

2001 2002 2003 2004 2005

ITTALKS TAGA

Vigil Soupa Cobra

SweetJess SweetRules

Rei

F-OWL reasoner

OWLIR Swoogle

Swangler

Bayes OWL

DAML+OIL Joint Committee

W3C WebOnt Working group SWSI & SWSA

Papers/MS/PhD

5+/7?/2?24/4/525/4/120/1/04/0/0

RGB

s

s

s

ss

s

s

s

s

s

Legend

Softwareor service

Ontology

Ph.D. Dissertation

Spec impact

Distributed IDS

Mogatu

O

OO O

O

O

O

RuleML

O

O

O

Page 6: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

6

DAML Tools for Intelligent Information DAML Tools for Intelligent Information Annotation, Sharing and RetrievalAnnotation, Sharing and Retrieval

http://daml.org/ UMBC, JHU/APL, and MIT/Sloan are working together on a set of issues to be integrated into agent-based applications involving search and using rule-based reasoning: UMBC: Integrating communicating agents, DAML and the Web; JHU APL: DAML and information retrieval; and MIT Sloan School: DAML, rules based technology and distributed belief

Features•Generation from DB to DAML and HTML mediated by MySQL, Java servlets, and JSP.

•Generation of DAML descriptions and user profiles from HTML forms.

•Creation & use of DAML-encoded user modelsdescribing interests and ontology extensions.

•Ontologies for events, people, places, schedules,topics, etc.

•Automatic HTML form (pre) filling from DAML.

•Syncing of talks with Palm calendars via Coola.

•Automatic classification of talks into topicontology

•A XSB-based DAML/RDF reasoning engine.

•Agent-based services:• ITTALKS agent with KQML API using DAMLas content language

• Intelligent matching of people and talks based on interests, locations and schedules.

• Agents using both Jackal and FIPA’s Java Message Service

• Notification via email and mobile devices via SMS and WML.

• Discovery of relevant background papers from NEC CiteSeer

• Automatic generation and maintenance of user models

• Talk recommendations via collaborative filtering• Integration with STP (Smart Things and Places) ubiquitous computing project

UMBCAN HONORS UNIVERSITY IN MARYLAND

Webserver + Javaservlets

DAMLreasoning

engine

DAMLreasoning

engine

<daml></daml>

<daml></daml>

<daml></daml>

<daml></daml>

DAML files

Agents

Databases

People

RDBMSRDBMSDB

Email, HTML, SMS, WAP

FIPA ACL, KQML, DAML

SQLHTTP, KQML, DAML, Prolog

MapBlast, CiteSeer, Google, …HTTP

HTTP, WebScraping

WebServices

Ittalks.org a database driven web site of IT related talks at UMBC and other institutions. The database contains information on

– Seminar events– People (speakers, hosts, users, …)– Places (rooms, institutions, …)

This database is used to dynamically generate web pages and DAML descriptions for the talks and related information and serves as a focal point for agent-based services relating to these talks. To add your organization to ittalks.org and receive a domain (e.g., mit.ittalks.org) contact [email protected]. See http://daml.org/ for more information on DAML and the semantic web.

Acknowledgements: Funded by DARPA, contract F30602-00-2-0591. Students: H. Chen, F. Perich, L. Kagal, S. Tolia, Y. Zou, J. Sachs, S. Kumar and M. Gandhe. Faculty: T. Finin, A. Joshi, Y. Peng, R. Cost, C. Nicholas, R. Masuoka

http://ittalks.org/

Page 7: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

7

Travel Agent Game in Agentcities

http://taga.umbc.edu/

TechnologiesTechnologiesFIPA (JADE, April Agent Platform)

Semantic Web (RDF, OWL)

Web (SOAP,WSDL,DAML-S)

Internet (Java Web Start )

FeaturesFeaturesOpen Market FrameworkAuction ServicesOWL message contentOWL OntologiesGlobal Agent Community

Acknowledgements: DARPA contract F30602-00-2-0591 and Fujitsu Laboratories of America.Students: Y. Zou, L. Ding, H. Chen, R. Pan. Faculty: T. Finin, Y. Peng, A. Joshi, R. Cost. 8/03

MotivationMotivationMarket dynamicsAuction theory (TAC)Semantic webAgent collaboration (FIPA & Agentcities)

Travel Agents

Auction Service Agent

Customer Agent

Bulletin BoardAgent

Market Oversight Agent

Request

Direct Buy

Report Direct Buy Transactions

BidBid

CFP

Report Auction Transactions

Report Travel Package

Report Contract

Proposal

Web Service Agents

OntologiesOntologieshttp://taga.umbc.edu/ontologies/

travel.owl – travel concepts

fipaowl.owl – FIPA content lang.

auction.owl – auction services

tagaql.owl – query language

FIPA platform infrastructure services, including directory facilitators enhanced to use DAML-S for service discovery

Page 8: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

8Contributors include Tim Finin, Anupam Joshi, Yun Peng, R. Scott Cost, Jim Mayfield, Joel Sachs, Pavan Reddivari, Vishal Doshi, Rong Pan, Li Ding, and Drew Ogle. Partial research support was provided by DARPA contract F30602-00-0591 and by NSF by awards NSF-ITR-IIS-0326460 and NSF-ITR-IDM-0219649. November 2004.

http://swoogle.umbc.edu/

Swoogle uses four kinds of crawlers to discover semantic web documents and several analysis agents to compute metadata and relations among documents and ontologies. Metadata is stored in a relational DBMS. Services are provided to people and agents.

SWD RankA SWD’s rank is a function of its type (SWO/SWI) and the rank and types of the documents to which it’s related.

SWOs

SWIs

HTMLdocuments

Images

CGI scripts

Audiofiles

Videofiles

The web, like Gaul, is divided into three parts: the regular web (e.g. HTML), Seman- tic Web Ontologies (SWOs), and Semantic Web Instance files (SWIs)

SWD = SWO + SWI

Swoogle Statistics

Swoogle puts documents into a character n-gram based IR engine to compute document similarity and do retrieval from queries

SWD IR Engine

SWOOGLE 2

SWD Metadata

Web Service

Web Server

SWD Cache

The Web

The WebCandidate

URLs Web Crawler

SWD Reader

IR analyzer SWD analyzer

Human users

Intelligent Agents

discovery

digest

analysis

service

OntologyDictionary

SwoogleSearch

SwoogleStatisticsOntology Dictionary

Swoogle Search

Swoogle provides services to people via a web interface and to agents as web services.

SWDs 310,000 Classes 95,000

Triples 45,000,000 Properties 53,000

Ontologies 4,200 Individuals 6,800,000

Statistics as of November 2004

Page 9: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

9

OWL Search usingOWL

document SwangleTool

JENA

OWLdocument

+Swangled

triples

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

SwangleSearchQuery Triple

OWLdocument

+Swangled

triples

OWL Triples

<j.0:SwangledTriple> <rdfs:comment>Swangled text for [<http://www.xfront.com/owl/ontologies/camera/#Digital>, <http://www.w3.org/2000/01/rdf-schema#subClassOf>, <http://www.xfront.com/owl/ontologies/camera/#PurchaseableItem> ] </rdfs:comment> <j.0:swangledText>BE52HVKU5GD5DHRA7JYEKRBFVQ</j.0:swangledText> <j.0:swangledText>WS4KYRWMO3OR3A6TUAR7IIIDWA</j.0:swangledText> <j.0:swangledText>2THFC7GHXLRMISEOZV4VEM7XEQ</j.0:swangledText> <j.0:swangledText>HO2H3FOPAEM53AQIZ6YVPFQ2XI</j.0:swangledText> <j.0:swangledText>6P3WFGOWYL2DJZFTSY4NYUTI7I</j.0:swangledText> <j.0:swangledText>N656WNTZ36KQ5PX6RFUGVKQ63A</j.0:swangledText> <j.0:swangledText>IIVQRXOAYRH6GGRZDFXKEEB4PY</j.0:swangledText></j.0:SwangledTriple>

WebSearchEngine

FiltersSemanticMarkup

InferenceEngine

LocalKB

SemanticMarkup

SemanticMarkup

Extractor

Encoder

RankedPages

SemanticWeb Query

EncodedMarkup

TextQuery

TextFiltersText

[<http://www.xfront.com/owl/ontologies/camera/#Digital>, <http://www.w3.org/2000/01/rdf-schema#subClassOf>, <http://www.xfront.com/owl/ontologies/camera/#PurchaseableItem>]

Inference

[<http://www.xfront.com/owl/ontologies/camera/#Digital>, <http://www.w3.org/2000/01/rdf-schema#subClassOf>, <http://www.xfront.com/owl/ontologies/camera/#PurchaseableItem>] [<http://www.xfront.com/owl/ontologies/camera/#Digital>, <http://www.w3.org/2002/07/owl#Thing>, <http://www.xfront.com/owl/ontologies/camera/#PurchaseableItem>] [<http://www.xfront.com/owl/ontologies/camera/#Digital> ,

< http://www.w3.org/2002/07/owl#Thing >, < http://www.w3.org/2002/07/owl#Thing >]

Swangler

Vision

Indexing and Search

Swangler

Page 10: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

10Student: Z. Ding. Faculty: Dr. Y. Peng, Dr. T. Finin, Dr. A. Joshi. Ack: DARPA Contract F30602-00-2-0591. 09/03.

An example of translated Bayesian network ontologyAn example of translated Bayesian network ontology

Unified Ontology Support using Bayesian NetworksUnified Ontology Support using Bayesian Networks

MotivationMotivationReasoning within single ontologies is uncertain due to noisy or incomplete dataDL based reasoning is overgeneralized and inadequateRelations between concepts in different ontologies are inherently uncertainOWL is the new standard proposed by W3C for ontology representation

Translation ProcessTranslation ProcessExtending OWL for probability annotationDefining a set of structural translation rules to translate an OWL RDF graph to the directed acyclic graph (DAG) of Bayesian network (BN)Constructing conditional probability tables (CPTs) for individual variables in the DAG of BN

ApproachApproachTranslating OWL ontologies to Bayesian networks (BNs)Concept mapping as conditional probabilitiesJoining translated BNs (by probabilistic mappings) dynamicallySupport ontology reasoning (within and across ontologies) as Bayesian inference

Page 11: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

Context Broker Architecture

CoBrA FeaturesCoBrA Features[1] Use Semantic Web languages for context modeling and reasoning

[2] Use logic inference to detect and resolve inconsistent context knowledge

[3] Extend the REI policy language for privacy protection

[4] Adopt the FIPA standards for communication & knowledge sharing

EasyMeeting an intelligent meeting room prototype that provides services for speakers, audience & organizers based on their situational needs.

http://cobra.umbc.edu

Page 12: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

12/16 UMBC/JHU/MIT 11/19/2004

• A slide on SweetRules, SWRL and RuleML

Intelligent Information Annotation, Sharing and Retrieval Adding Rules to OWL

Page 13: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

13/16 UMBC/JHU/MIT 11/19/2004

RGB: Research Group in a Boxeating our own dog food

• The UMBC ebiquity lab’s portal exposesits content in RDF using a set of OWL ontologies for papers, people, projects, photos, etc.

• Lab members can add arbitrary RDF assertions about any of the portal’s objects.

• It’s designed as a modular, configurable package using O/S software that others can use to create their own portals.

Page 14: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

14/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Transition/Handoff

• Specifications– Participated in WEBONT, SWSA, SWSI, RuleML

• Software products– Damlator, Swangler, F-OWL, SweetRules, Rei

• Major demonstration systems– ITTALKS, TAGA, ebiquity site

• Services– Swoogle

• Papers– ~100 papers in journals, books, conferences and workshops, ~10 MS

theses, 5 PhD dissertations

• Users– Swoogle has many users, Rei used by ~4 other groups, … more

needed here

Page 15: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

15/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Remaining Issues

Issue Remediation

SweetRules plan not completed

Complete design and implementation…

Rei policy langu-age needs rules

Incorporate SWRL/RuleML languages. Can we compile Rei policies to OWL+SWRL?

Swoogle needs instance data

Extend Swoogle’s database to include all instance data. But, how well will it scale?

Swangler lacks query I/F

Build a query interface for swangled Semantic Web documents.

IR over mixed RDF and text

Develop tools to enable Swangler and Swoogle to work with documents that embed RDF in XHTML.

Bayes OWL demonstration

Application to ontology mapping will require additional basic research. Can probabilities be estimated from Swoogle data?

SweetRules toolkit

Populate the SweetRules toolkit with open sourced implementations

Page 16: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

16/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Summary

We’ve demonstrated that• OWL enables agents in open, dynamic environments to

share knowledge and manage privacy, commitments, trust, and security.– Demonstration: ITTALKS system, TAGA multiagent system, SOUPA

ontology for pervasive computing

• OWL can be integrated with other KR paradigms for real world reasoning– Demonstration: SweetJess, SweetRules, Bayes OWL, Rei

• OWL is compatible with the information retrieval paradigm– Demonstration: Swoogle as an RDF search engine, Swangling

permits IR systems to index and retrieve documents with text, multimedia, and knowledge.

• OWL facilitates the sharing of knowledge thru web portals– Demonstration: ebiquity web pages

Page 17: 0 DAML Tools for Intelligent Information Annotation, Sharing and Retrieval UMBC Johns Hopkins University Applied Physics Lab MIT Sloan School Tim Finin

17/16 UMBC/JHU/MIT 11/19/2004

Intelligent Information Annotation, Sharing and Retrieval Backup