Semantic Annotation and Search for Resources in the Next Generation Web

Download Semantic Annotation and Search for Resources in the Next Generation  Web

Post on 22-Feb-2016

20 views

Category:

Documents

0 download

Embed Size (px)

DESCRIPTION

Semantic Annotation and Search for Resources in the Next Generation Web. Ajith H. Ranabahu , Amit Sheth , Maryam Panahiazar , Sanjaya Wijeratne Kno.e.sis Center Wright State University Dayton OH. W3C Workshop on Data and Services Integration October 20-21 2011, Bedford, MA, USA. - PowerPoint PPT Presentation

TRANSCRIPT

Semantic Annotation and Search for Resources in the Next Generation Web

Semantic Annotation and Search for Resources in the Next Generation WebAjith H. Ranabahu, Amit Sheth, Maryam Panahiazar, Sanjaya WijeratneKno.e.sis CenterWright State UniversityDayton OHW3C Workshop on Data and Services IntegrationOctober 20-21 2011, Bedford, MA, USAAgendaThe service Integration problemWhat are the patterns we see?What is the best course of action?Making the best use of HTML 5 and search engines (Google / Bing)Experience from KinoAnnotate-Index-enhance-search Lifecycle for biology oriented documentsKino Web Annotations with schema.org and SA-REST service modelSA-REST, Microdata or any other mechanismOct 21 20112Before we start - Our AssertionsThere is no global model or representationAccept it!Instead we can represent one in a universally acceptable wayHuman in the loop is important!Don't forget the guy in the trenchGrass roots / bottom upTop down approaches are expensive to adoptOct 21 20113What is the Problem ?Services are (still) described in multiple ways SOAP vs REST debate is not as bad but still exists SOAP services have found its home in the enterprise

Oct 21 20114What is the Problem ? (Cont)REST has become the (de facto) standard in the consumer spaceNo agreed upon formal description (WSDL 2.0 / WADL ?)No specific registry mechanism - Developers just Google to find the services.Several high profile composition tools failed ! (Google Mashup Editor, Microsoft popfly)

Oct 21 20115What have we learnt ?Services (and Web APIs / services wrapped by programming language) are primarily composed by humansRead the documentation, copy sample code and use Google generouslySpecial purpose indexes and registries do not work.General purpose search engines (Google / Bing / Yahoo) has become really goodOct 21 20116Anticipated Future TrendsService consumption and composition is going to remain a semi-automated process Humans will always be part of the process

General purpose Search engines are going to be the key source of data for service composers

Oct 21 20117Our Primary PremiseModification of service descriptions via annotations is the best way to supplement the upcoming service consumption patterns

We are not alone in this thinking!The trend towards microdata and 'rich snippets'

Oct 21 20118A Generic Architecture for the Annotation / Index / Search CycleAnnotationThird party Data SourcesIndexSearch Annotation Submission or acquisition ProcessAnnotation EnhancementDocument ExtractionWeb DocumentsOct 21 20119Experience from Kino (KinoE)A tool for biologistsModify Web pages using SA-REST annotationsConcepts come from National Center for Biomedical Ontologies (NCBO)Use a specialized indexing engine that can parse the annotations and provide faceted searchingOct 21 201110KinoE Architecture

NCBO Ontology Access APINCBO Ontology RepositoryKino Search APISOLRJKino Index APISOLR Web InterfaceLucene IndexKino Browser PluginWeb PagesKino Web Front-endOther Front -endsNCBO REST ServiceKino Back-endKino browser based annotationKino Search InterfacesOct 21 201111KinoW (Web Edition)A more general annotatorSA-REST Service and Schema.org conceptsMechanism can be Microdata or SA-RESTOnly Microdata at the momentsPublishing targeted towards the original content providersUse WebDAV / Drupal Plugin / Wiki plugin etcOct 21 201113KinoW Architecture

Schema.org / LOD / Other third party concept providersKino Browser PluginWeb PagesCustom Front -endsBrowser based annotationOct 21 201114Hosted Site

WebDAV /CMS pluginscrawlingSearchEnhancementWhat is possible with this approach?General search engine based service discoveryAnnotation driven service discoveriesIssue queries in Google to find the services you are interstested (provided Google supports filtering by annotations)Formal structures (WSDL / WADL) can be gleaned from the human readable pagesBoth humans and machines can make useMore opportunities for composition toolsOct 21 201115DemonstrationQuestions

Extra : Role of LOD?Act as a huge third party data repository?Oct 21 201118