woogle4mediawiki - social search and knowledge sharing for mediawiki

31
Woogle4MediaWiki Social search and knowledge sharing for MediaWiki Hans-Jörg Happel, FZI 2010-05-03 @Ontoprise Halo Team

Upload: hans-joerg-happel

Post on 12-May-2015

819 views

Category:

Technology


0 download

DESCRIPTION

Woogle4MediaWiki - a search extension for MediaWiki

TRANSCRIPT

Page 1: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Woogle4MediaWiki Social search and knowledge sharing for MediaWiki

Hans-Jörg Happel, FZI2010-05-03 @Ontoprise Halo Team

Page 2: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Agenda• Motivation• Features / Demo• Technical Details• Outlook & Discussion

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 2

Page 3: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

About me• Working as a researcher at FZI/WIM

since 2005• Core Topics:

– Knowledge Management, Software Engineering, Collaboration, Semantic Wikis

• Currently wrapping up PhD Thesis– Woogle4MediaWiki is one of the results– This Wednesday@AIFB

• Graduiertenkolloquium Angewandte Informatik „Bedarfsgetriebener Wissensaustausch in verteilten Teams“; 05. Mai 2010, 15:45 Uhr

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 3

Page 4: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

MOTIVATION

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 4

Page 5: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Wikis• Web-based repositories for sharing

consensual knowledge in a collaborative fashion

• In enterprises, Wikis often act as a layer of "glue" code relating content from other information systems and capturing all the rest– Wikis are a typical entry point for information

seeking– People are often confused why and what to

put into the Wiki

5Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 6: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

There are two kinds of people in a Wiki…

Readers• Have information needs

– But only few means for articula-ting such needs (e.g. search)

• Have collaboration needs– Search is an inherently social

activity (40,4% of users interact with others before and during search; 58,7% share information after search [EC08])

Contributors• Have limited ressources

(time/effort) and face futher barriers (motivation, privacy)

• Choices on contributions are seldom based on demand of readers (lack of guidance)

• Wiki communities stress the relevance of content („what is worth to be documented“)

6

• Knowledge sharing is a communication process• Artificial decoupling of readers and contributors in Wikis• Only few readers become contributorsSocial Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 7: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

• Search in MediaWiki– No query logs– „Red links“ (links to non-existing pages) are

the basic means to „demand“ content– No collaboration

7Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 8: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Contributing in MediaWiki

• Are Wikis really contribution-friendly?– „Red links“ do not tell how much this

information is required– The „empty-page“ phenomenon (How to

start an article)

8Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 9: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

(Media)Wikis can do better• Improve search with collaborative

features• Allow people to express information

needs• Provide a first-order representation for

information needs• Guide contributions by aggregating

information needs („need-driven knowledge sharing“)

9Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 10: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Need-driven Knowledge Sharing (NKS)

10

Collect information needs (e.g. from query logs)

Collect information needs (e.g. from query logs)

Calculate aggegate unsatisfiedinformation needs

Calculate aggegate unsatisfiedinformation needs

Identify private files worth sharing („Inverse Search“

[Hap08b])

Combine Wikis and Search („Woogle“)

Combine Wikis and Search („Woogle“)

Page 11: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

FEATURES / DEMO

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 11

Page 12: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Searching MediaWiki and beyond…• Can replace MediaWiki built-in search or

act as an additional search engine (user configurable!)

• Two flavours– WoogleNative: provides Lucene search

features in plain PHP– WoogleRemote: uses backend which

supports

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 12

Page 13: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Every query can be a Wiki page…• Allows for

– Wiki-Text („Social Did-You-Mean“)

– Discussion page– Watches/Notifications

(WoogleAlerts)

• Shows basic statistics about query– „How many people

searched for that?“

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 13

Page 14: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Social Search• Add, remove and re-rank query results• Add comments

• Beyond that: – How to support truly collaborative information seeking (for

researchers, lawyers etc!)?

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 14

Page 15: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Demand guidance• Woogle uses queries and further metadata to qualify desired

content• Users are informed how strong the information need is, and in

which context it arises

15Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 16: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Statistics

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 16

Page 17: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Woogle Configuration

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 17

Page 18: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

TECHNICAL DETAILS

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 18

Page 19: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

WoogleNative• First MediaWiki extension for running on

Zend Lucene Search (= Java Lucene Port for PHP)

• Performance experiences:– Can handle Wikis up to 50.000 pages

(Wikipedia size) and serves normal queries in >1 sec

– Worse for Wildcard queries etc.

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 19

Page 20: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

WoogleRemote• Connects to TeamWeaverIS via webservices

– Connecting different search engines generally possible

• TeamWeaverIS– Yet another Lucene-based search framework– Supports Query-Logs-As-A-Service

(Anonymized, aggregate query & click logging)– Comes with crawlers for File shares, Web, SVN,

CVS, Bugzilla, JIRA, JDBC, Serveral Wikis– Comes with search UIs & configuration files

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 20

Page 21: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Aggregate query & click logging

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 21

Page 22: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Implementation Trivia• Search UI

– Completely intercepts MW search process – indenpendent of MW search interfaces

• Red Links– Realized via JavaScript/jQuery (parses red links, retrieves needs from

backend batch-wise) to avoid patching MW

• Very basic implementation of access rights (via MediaWiki groups)

• Instrumentation addon– Allows to re-configure most system features based on user groups– Provides evaluation participation dialog– Logs various user behaviour data

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 22

Page 23: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

docdoc

twIS Backend(Public Index)twIS Backend(Public Index)

docdoc

docdoc

docdoc

User 2

docdoc

docdoc

docdoc

User 1

twIS EclipsetwIS Eclipse twIS Eclipse RCPtwIS Eclipse RCP

JSF FrontendJSF Frontend

Web BrowserWeb Browser

WoogleWoogle

twIS Backend 2twIS Backend 2

twIS Backend 3twIS Backend 3

DirectAuthDirectAuthAuth via

WikiAuth via

Wiki

Push IndexerPush Indexer

DirectAuthDirectAuth

Search, Crawl

Search, Crawl

Push IndexPush Index

CrawlCrawl

SearchSearch

Big Picture

TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web

and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query

logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)

TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web

and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query

logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)

Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a

TeamWeaver Integrated Search backend

Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a

TeamWeaver Integrated Search backend

Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.

Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).

Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.

Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).

Page 24: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

OUTLOOK

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 24

Page 25: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Outlook• Roadmap for Woogle4MediaWiki

– Lightweight integration with SMW (i.e. using semantic in result snippets)

– Social search– Request feature– Support users creating redirects

• Further (S)MW-related stuff– Semantic Need (= Woogle for SMW)– SMW4SE, SRF_UML

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 25

Page 26: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Summary• Woogle addresses the atificial separation of information seeking

and information provision in Wikis– Offers information seekers means to express needs– Guides contributors by aggregating demands

• Find downloads, demos, slides etc. at– http://www.mediawiki.org/wiki/Extension:Woogle4MediaWiki– http://www.teamweaver.org– http://amazonas.fzi.de/wooglenative/– Feedback & additional evaluation partners sought!

• Questions?

• Discussion– Any recommendations concerning templating/MVC?– Your experience with (UI)-Testing and Continuous Integration?– Your experience with UI surveys?– What would „Social Semantic Search“ mean to you?

26Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 27: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

BACKUP SLIDES

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 27

Page 28: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

28

Collaborative description of the information need

Notifications concerningsearch activities

Discussion page

Directly create new knowledge from the search dialog

Search has a URI and can be linked from within the Wiki(one page per search term)

Search across different data sources/types (WoogleRemote)

Additional clues with meta-information about a need

http://localhost/wiki/Woogle:Woogle4MediaWiki

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009

Page 29: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Demand guidance

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 29

Page 30: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009 30

Page 31: Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki

Literature• [EC08] Evans, B. M. and Chi, E. H. 2008. Towards a model of understanding social search. In

Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (San Diego, CA, USA, November 08 - 12, 2008). CSCW '08. ACM, New York, NY, 485-494.

• [Hap08a] Hans-Jörg Happel: Growing the Semantic Web with Inverse Semantic Search. In Proceedings of the 1st Workshop on Incentives for the Semantic Web (INSEMTIVE '08)

• [Hap08b] Hans-Jörg Happel: Closing Information Gaps with Inverse Search. In Proceeedings of the 7th International Conference on Practical Aspects of Knowledge Management (PAKM2008)

• [Hap09a] Hans-Jörg Happel: Woogle – On Why and How to Marry Wikis with Enterprise Search. In Proceedings of the 2nd Workshop on Integrated Knowledge Management Systems (IKMS2009)

• [Hap09b] Hans-Jörg Happel: Towards Need-driven Knowledge Sharing in Distributed Teams. In Proceedings of the 9th International Conference on Knowledge Management (I-KNOW 2009)

• [Hap09c] Hans-Jörg Happel: Social search and need-driven knowledge sharing in Wikis with Woogle. In Proceedings of the 5th international Symposium on Wikis and Open Collaboration (Orlando, Florida, October 25 - 27, 2009). WikiSym '09. ACM, New York, NY, 1-10.

31Social Search and Need-driven Knowledge Sharing with Woogle / WikiSym 2009