poolparty search server
DESCRIPTION
PoolParty Semantic Search Server is described technologically. How to use SKOS thesauri to map data from different sources and how to generate a semantic index. How to build precise faceted search.TRANSCRIPT
PoolParty
PoolParty Search Server 2.0
Alexander Kreiser
Agenda
• System Architecture
• Indexing and Mapping
• Search API
2
System Architecture
3
poolparty product family
Document Index
Search Services
Search Application
metadata
mapping
System Architecture
4
components
Search Services
Search Application
Collector<xml>
Semantic Indexer
Document Index
Cartridge
System Architecture
• Built upon robust, flexible and well documented frameworks and technologies:
– Apache Tomcat 6.0
– Spring Framework 3.0.5
– Spring Security 3.0.3
– ExtJS 3.3.1
• Using SKOS (RDF) as metadata format – standardized by the W3C
5
technologies
Indexing and Mapping
• Rich metadata in a standardized, extensible format (SKOS / RDF)
• Central metadata repository
• Thesaurus is managed with PoolParty
• Document metadata is mapped to concepts in the thesaurus
• Semantic information improves search experience
6
concepts
Indexing and Mapping
7
mapping
Title: Document TitleCreated: 2010-03-15M_Countries: AustriaM_Vendors: Atlassian, ApacheAbstract: …
Document PoolParty Project
Concept Scheme
Concept Scheme
Concepts
Concepts
Indexing and Mapping
8
mapping
Title: Document TitleCreated: 2010-03-15M_Countries: AustriaM_Vendors: Atlassian, ApacheAbstract: …
Document PoolParty Project
Concept Scheme
Concept Scheme
Concepts
Concepts
Indexing and Mapping
9
Collector
Semantic Indexer
Document Index
<xml>
M_Countries: AustriaM_Vendors: Atlassian, Apache
process
Indexing and Mapping
10
Collector
Semantic Indexer
Document Index
M_Countries: AustriaM_Vendors: Atlassian, Apache
<rdf>
process
Indexing and Mapping
11
Collector
Semantic Indexer
Document Index
M_Countries: http://company.com/thesaurus/AT M_Vendors: http://company.com/thesaurus/Atlassian, http://company.com/thesaurus/Apache
enriched
<rdf>
process
Indexing and Mapping
12
Document Index
Search Services
Search Application
metadata
Language: englishLanguage: german
"Austria""Österreich"
localized metadata
Indexing and Mapping
• New metadata values are
– automatically created in the thesaurus in a special container(Free Concepts from PPX)
– put into the mapped concept schemee.g. "Regions"
• Thesaurus manager can edit and/or merge the new concepts
13
new metadata
Search API
• Available web services:
– Search Service
– Suggest Service
– Similarity Service
• Supported formats:
– JSON
– XML
– RSS
14
services and formats
Search API
• Search Service
– 10 parallel clients (5000 total requests)
– Time: 31 ms – 2000 ms; average: 303 ms (network delay: min. 25 ms)
15
performance tests
Search API
• Suggest Service
– 10 parallel clients (5000 total requests)
– Time: 31 ms – 1094 ms (average: 118 ms) (network delay: min. 25 ms)
16
performance tests
Contact
Andreas BlumauerManaging [email protected]+43-1-8974122-27
Alexander KreiserSystem [email protected] +43-1-8974122-38
17
punkt. netServices GmbH
Lerchenfelder Guertel 43A—1160 Wien / Austria
http://www.punkt.at/ http://poolparty.punkt.at/
http://twitter.com/PoolParty_Teamhttp://www.facebook.com/pages/
PoolParty/134189476633941