towards a unified pagerank for dbpedia and wikidata

Post on 15-Apr-2017

787 Views

Category:

Internet

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

KIT – The Research University in the Helmholtz Association

INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB)

www.kit.edu

An art draw drawn by Felipe MicaroniLalli (micaroni@gmail.com).

Towards a Unified PageRank for DBpedia and WikidataAndreas Thalhammer7th DBpedia Community Meeting 15.09.2016Leipzig

AIFB2 03.05.2023

DBpedia (Wikipedia) PageRank

Available at http://people.aifb.kit.edu/ath/#DBpedia_PageRank. Computed since DBpedia 3.8.Since DBpedia 2015-04 included in http://dbpedia.org/sparql. Computed:

Wikipedia link structureConfiguration: 40 iterations, damping factor 0.85, start value 0.1Languages: en, es, de, fr, it, ru, zh

Why “DBpedia”? Link structure is extracted by the DBpedia Extraction Framework

(page links dataset). The dataset is published with DBpedia IRIs.

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

AIFB3 03.05.2023

Example: SPARQL result

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

AIFB4 03.05.2023

Towards Wikidata PageRank

The DBpedia PageRank dataset has been published with Wikidata URIs since 2015-04.

Approach: Use English DBpedia PageRank and transform URIs.

Problems: Only addresses 5,789,754 entities (Wikidata 15,862,673).Language-specific bias.

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

AIFB5 03.05.2023

Towards Wikidata PageRank

DBpedia 2016-04 provides the page links dataset with Wikidata URIs for each language edition.We merged the Wikidata URIs page links data of the ten biggest* language editions:

en , es , fr , de , zh , ru , pt , it , ar , ja

Increased coverage: 10,364,840 (Wikidata 15,862,673).

Particularity: Now we have duplicate links.

Can be leveraged – Hypothesis: reduce language-specific bias.

* Wikipedias with most users and at the same time have > 1M users

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

TimBL WWWde

en

AIFB6 03.05.2023

Where would Wikidata PageRank be useful? (1)

(Cross-lingual) entity summarization:

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Further information: http://km.aifb.kit.edu/services/link/

AIFB7 03.05.2023

Where would Wikidata PageRank be useful? (2)

Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

AIFB8 03.05.2023 Andreas Thalhammer - Towards a Unified PageRank for DBpedia and Wikidata

Questions?

andreas.thalhammer@kit.edu @thalhamm

top related