towards a unified pagerank for dbpedia and wikidata

8
KIT – The Research University in the Helmholtz Association INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB) www.kit.edu An art draw drawn by Felipe Mic Lalli ([email protected] ). Towards a Unified PageRank for DBpedia and Wikidata Andreas Thalhammer 7 th DBpedia Community Meeting 15.09.2016 Leipzig

Upload: andreas-thalhammer

Post on 15-Apr-2017

786 views

Category:

Internet


1 download

TRANSCRIPT

Page 1: Towards a Unified PageRank for DBpedia and Wikidata

KIT – The Research University in the Helmholtz Association

INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB)

www.kit.edu

An art draw drawn by Felipe MicaroniLalli ([email protected]).

Towards a Unified PageRank for DBpedia and WikidataAndreas Thalhammer7th DBpedia Community Meeting 15.09.2016Leipzig

Page 2: Towards a Unified PageRank for DBpedia and Wikidata

AIFB2 03.05.2023

DBpedia (Wikipedia) PageRank

Available at http://people.aifb.kit.edu/ath/#DBpedia_PageRank. Computed since DBpedia 3.8.Since DBpedia 2015-04 included in http://dbpedia.org/sparql. Computed:

Wikipedia link structureConfiguration: 40 iterations, damping factor 0.85, start value 0.1Languages: en, es, de, fr, it, ru, zh

Why “DBpedia”? Link structure is extracted by the DBpedia Extraction Framework

(page links dataset). The dataset is published with DBpedia IRIs.

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Page 3: Towards a Unified PageRank for DBpedia and Wikidata

AIFB3 03.05.2023

Example: SPARQL result

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Page 4: Towards a Unified PageRank for DBpedia and Wikidata

AIFB4 03.05.2023

Towards Wikidata PageRank

The DBpedia PageRank dataset has been published with Wikidata URIs since 2015-04.

Approach: Use English DBpedia PageRank and transform URIs.

Problems: Only addresses 5,789,754 entities (Wikidata 15,862,673).Language-specific bias.

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Page 5: Towards a Unified PageRank for DBpedia and Wikidata

AIFB5 03.05.2023

Towards Wikidata PageRank

DBpedia 2016-04 provides the page links dataset with Wikidata URIs for each language edition.We merged the Wikidata URIs page links data of the ten biggest* language editions:

en , es , fr , de , zh , ru , pt , it , ar , ja

Increased coverage: 10,364,840 (Wikidata 15,862,673).

Particularity: Now we have duplicate links.

Can be leveraged – Hypothesis: reduce language-specific bias.

* Wikipedias with most users and at the same time have > 1M users

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

TimBL WWWde

en

Page 6: Towards a Unified PageRank for DBpedia and Wikidata

AIFB6 03.05.2023

Where would Wikidata PageRank be useful? (1)

(Cross-lingual) entity summarization:

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Further information: http://km.aifb.kit.edu/services/link/

Page 7: Towards a Unified PageRank for DBpedia and Wikidata

AIFB7 03.05.2023

Where would Wikidata PageRank be useful? (2)

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Page 8: Towards a Unified PageRank for DBpedia and Wikidata

AIFB8 03.05.2023 Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Questions?

[email protected] @thalhamm