talk of europe – linking european parliament proceedings

Post on 24-Jun-2015

615 Views

Category:

Data & Analytics

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Talk held in DHBenelux 2014 by Astrid van Aggelen. Presents the project Talk of Europe, which makes available the plenary debates in the European Parliament, including all translations, as linked open data. Focus of the talk is on the data sources used and how this information is modelled, as well as the possibilities of the resulting RDF dataset in humanities research.

TRANSCRIPT

Talk of EuropeLinking European Parliament Proceedings

Astrid van Aggelen - VU University AmsterdamMax Kemman (@MaxJ_K) - Erasmus University Rotterdam

About the project

● Astrid van Aggelen● Laura Hollink

● Max Kemman● Martijn Kleppe● Henri Beunders

● Marnix van Berchum

● Johan Oomen● Jaap Blom

● Steven Krauwer● Jan Odijk

● 2014-2015● Funded by CLARIN-NL & CLARIN

ERIC

Primary goals

plenary sessions 1996 - (present)● Represent in Resource Description

Framework

Primary goals

plenary sessions 1996 - (present)● Publish as linked open data

Primary goals

plenary sessions 1996 - (present)● Promote

applications

The European Parliament

sessionsessionDay

agendaItemspeech

The European Parliament

Plenary sessions are NOT

● strictly role-based● mirror of law-making● interactive

Datasets

1. Europarl debate registrydate, debates, speakers, speeches

who said what in which debate on which day?

Data model

Data model

Enriching the data (1)

?What is a member’s political background?

Datasets

● Europarl debate registrydebates, speakers, speeches

● Europarl MEP databaseparties, committee, country, delegation

Data model

Data model

Data model

Enriching the data (2)

How to categorise debates?

?

Enriching the data (2)● Foreign Affairs● Human Rights● Security and Defence● Development● International Trade● Budgets● Budgetary Control● Economic and Monetary Affairs● Employment and Social Affairs● Environment, Public Health and Food Safety● Industry, Research and Energy● Internal Market and Consumer Protection● Transport and Tourism● Regional Development● Agriculture and Rural Development● Fisheries● Culture and Education● Legal Affairs● Civil Liberties, Justice and Home Affairs● Constitutional Affairs● Women's Rights and Gender Equality● Petitions

Datasets

● Europarl debate registrydebates, speakers, speeches, texts*

● Europarl MEP databaseparty, committee, country, delegation

● Europarl report registrycommittee / theme

Data model

Data model

Enriching the data (3)

In which role is this person speaking?

Enriching the data (3)

Heuristic processing!

Ideas for applications

● data enrichment:geographical dataset, encyclopedia, voting info (Eur-Lex)

● applications: topic preference of speakers by country / geography / party

cross-lingual language use

sentiment analysis

Creative camp

● Bring together developers and academic researchers from across Europe

● Promoting inventive use of the EP dataset, exploiting web and natural language processing techniques to add new knowledge and functionality to the dataset

● 6-10 October 2014 at NISV (Hilversum, The Netherlands)

● Submissions due: Friday 20 June

www.talkofeurope.eu/cfp

More info

General info: www.talkofeurope.eu

Creative camp: www.talkofeurope.eu/cfp/

Astrid a.e.van.aggelen@vu.nl

Max kemman@eshcc.eur.nl / @MaxJ_K

top related