using the structure of dbpedia for exploratory search
DESCRIPTION
Presentation at Mining Data Semantics in Heterogeneous Networks Workshop at KDD 2013TRANSCRIPT
![Page 1: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/1.jpg)
Using the Structure ofDBpedia for ExploratorySearch
Speaker: Samantha LamSupervisor: Conor Hayes
![Page 2: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/2.jpg)
Motivating Work
DBpedia - heterogeneous graph
2
![Page 3: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/3.jpg)
Motivating Work
Background
Network Similarity: PathSim, NetClus, RankClus
Faceted Search: Facets for refining search
specific schema, (semi) supervised
→ good for search when user is familiar with query
→ ...but what about complete beginners?
→ Requires Exploratory Search – Unsupervised
3
![Page 4: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/4.jpg)
Motivating Work
Background
Network Similarity: PathSim, NetClus, RankClus
Faceted Search: Facets for refining search
specific schema, (semi) supervised
→ good for search when user is familiar with query
→ ...but what about complete beginners?
→ Requires Exploratory Search – Unsupervised
3
![Page 5: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/5.jpg)
Motivating Work
Background
Network Similarity: PathSim, NetClus, RankClus
Faceted Search: Facets for refining search
specific schema, (semi) supervised
→ good for search when user is familiar with query
→ ...but what about complete beginners?
→ Requires Exploratory Search – Unsupervised
3
![Page 6: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/6.jpg)
Exploratory Search?
Given query, how to organise results in a manner that is ‘useful’,i.e. aids exploratory search
E.g. suppose you hear a song on the radio...
Solution:
Classify results according to its contexts
Why? Alleviates in-depth reading and guides user
4
![Page 7: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/7.jpg)
Exploratory Search?
Given query, how to organise results in a manner that is ‘useful’,i.e. aids exploratory search
E.g. suppose you hear a song on the radio...
Solution:
Classify results according to its contexts
Why? Alleviates in-depth reading and guides user
4
![Page 8: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/8.jpg)
Assumption
similarity ⊂ relatedness
5
![Page 9: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/9.jpg)
Research Questions
1 Can we provide an effective graph-based framework that canaid exploratory search?
2 To do this, what is DBpedia’s graph structures wrt itsdifferent datasets?
6
![Page 10: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/10.jpg)
DBpedia graphs summary
Infobox properties
emergent, crowd-sourcedheterogeneous ‘types’dense
Infobox ontology, SKOS/Wiki Category, YAGO
agreed rulesis-A structuresparse, tree-like
Infobox good forGGGGGGGGGGA Relatedness
Ontology good forGGGGGGGGGGA Labelling similar items
7
![Page 11: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/11.jpg)
DBpedia graphs summary
Infobox properties
emergent, crowd-sourcedheterogeneous ‘types’dense
Infobox ontology, SKOS/Wiki Category, YAGO
agreed rulesis-A structuresparse, tree-like
Infobox good forGGGGGGGGGGA Relatedness
Ontology good forGGGGGGGGGGA Labelling similar items
7
![Page 12: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/12.jpg)
Research Q1 Proposition
General Framework:
8
![Page 13: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/13.jpg)
Sample Query & Results
Query: Lisa Hannigan
Two methods Weighted (W) and Uniform (U), 6 clusters
Cluster 1 (W, U) instruments
Top label: (W, U) Musical instruments
Cluster 2 (W) songs (U) album and songs
Top label: (W) Songs by artist (U) Albums by artist
Cluster 3 (W) albums (U) album, music genres and songs
Top label: (W) Albums by artist (U) Music subgenres by genre
9
![Page 14: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/14.jpg)
Sample Query & Results
Query: Lisa Hannigan
Two methods Weighted (W) and Uniform (U), 6 clusters
Cluster 1 (W, U) instruments
Top label: (W, U) Musical instruments
Cluster 2 (W) songs (U) album and songs
Top label: (W) Songs by artist (U) Albums by artist
Cluster 3 (W) albums (U) album, music genres and songs
Top label: (W) Albums by artist (U) Music subgenres by genre
9
![Page 15: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/15.jpg)
Sample Query & Results
Query: Lisa Hannigan
Cluster 4 (W) mixed, (U) mixed
Top label: (W) Songs by artist (U) Missing people
Cluster 5 (W) mixed, (U) mixed
Top label: (W) Albums by artist (U)Towns and villages in the Republic of Ireland by county
Cluster 6 (W) musicians and bands, (U) musicians and bands
Top label: (W) Place of birth missing (living people) (U)Place of birth missing (living people)
10
![Page 16: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/16.jpg)
Sample Query & Results
Summary:
Weighted produced 4 out of 6 coherent clusters whereasUnweighted only produced 2.
DBpedia Ontology labelling (see paper) provided broaderlabelling for messier clusters, e.g. top label was MusicalWorkfor mixed clusters
→ Categories better for more specific clusters.
11
![Page 17: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/17.jpg)
Ongoing Challenges
Evaluation
User Study:
- compare only Weighted versus Unweighted results,different labelling methods?
Comparison:
- possible to compare against other faceted methods?
- compare with plain list for recall?
12
![Page 18: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/18.jpg)
Summary
Investigated graph structure of DBpedia datasets
Framework to utilise this finding in exploratory search, gaveexample results
Ongoing challenge, evaluation
Thanks for listening! Questions welcome!
13
![Page 19: Using the Structure of DBpedia for Exploratory Search](https://reader033.vdocuments.site/reader033/viewer/2022042813/546f47d5b4af9f040c8b4671/html5/thumbnails/19.jpg)
Summary
Investigated graph structure of DBpedia datasets
Framework to utilise this finding in exploratory search, gaveexample results
Ongoing challenge, evaluation
Thanks for listening! Questions welcome!
13