shebanq gniezno

43
Data Archiving and Networked Services SHEBANQ INNET conference Gniezno – 2013-09-07 Dirk Roorda - researcher @ DANS,TLA System for HEBrew Text: ANnotations for Queries and Markup

Upload: dirk-roorda

Post on 26-Jun-2015

192 views

Category:

Education


5 download

DESCRIPTION

Save queries as annotations. A method for the digital preservation of queries on a Hebrew Text database with linguistic information in it. These queries form the data for interpretations by biblical scholars. Sharing those queries as Open Annotation enables researchers to communicate their (intermediate) results.

TRANSCRIPT

Page 1: Shebanq gniezno

Data Archiving and Networked Services !

SHEBANQ !

INNET conference !Gniezno – 2013-09-07 !

Dirk Roorda - researcher @ DANS,TLA !

System for HEBrew Text: ANnotations for Queries and Markup !

Page 2: Shebanq gniezno

Overview

1. Context: text, data, research in Hebrew Bible

2. Problem: sharing the research process

3. Solution idea: queries as annotations

Project: CLARIN-NL: SHEBANQ

4. (A) Curation

5. (B) Demonstrator

Page 3: Shebanq gniezno

1 (of 5) Context

Text, data and research in the Hebrew Bible

Page 4: Shebanq gniezno

VU Amsterdam

Eep Talstra Centre for Bible and Computer

text + linguistic features => database

database + research questions => publications

4 !

Page 5: Shebanq gniezno

2 (of 5) Problem

Sharing the research process

Page 6: Shebanq gniezno

Lock - in

scholarly-bi

bles.com!

Stuttgart Electronic Study Bible

⇒ massive dissemination

But

⇒ not the right dynamics for tool development

Page 7: Shebanq gniezno

Leiden: international workshop biblical scholarship

Desiderata:

new tool development

text transmission (variants)

linguistic analysis (features)

even combined!

a short history: 2012

leiden loren

tz!

Page 8: Shebanq gniezno

Hebrew Text in the Archive

urn:nbn:nl:u

i:13-ikjj-ek

!

Page 9: Shebanq gniezno

Hebrew Text in the Archive

urn:nbn:nl:u

i:13-ikjj-ek

!

how can the people annotate

our work? !

Page 10: Shebanq gniezno

Research Data Cycle

Page 11: Shebanq gniezno

Research Data Cycle Text transmission, tradition, editorial

processes

Free University, theology faculty,

server department, WIVU project

!

NWO projects !NWO projects

religious communities

theol. scholars

theol. scholars

enlightened lay people

scholarly-

bibles.com!

Page 12: Shebanq gniezno

Research Data Cycle Text transmission, tradition, editorial

processes

Free University, theology faculty,

server department, WIVU project

!

NWO projects !NWO projects

Research Data Archiving

DANS

religious communities

theol. scholars

theol. scholars

CLARIN SHEBANQ

linguists

Wider public: Annotation,

Query Saving, via Linked Data

dig. hum

comp. hum

enlightened lay people

scholarly-

bibles.com!

Page 13: Shebanq gniezno

3 (of 5) Solution idea

Queries As Annotations

Page 14: Shebanq gniezno

queries-as-annotations

model ! query ! example !

body ! query instruction !SELECT ALL OBJECTS WHERE [Word FOCUS part_of_speech = verb AND lexeme = "שים"] !

targets ! query results in context !

ו ישכם יעקב ב בקר ו יקח את ה אבן אשר שם מראשתיו ו ישם אתה מצבה ו יצק שמן

על ראשה

annotation ! published query ! qu123 (just an identifier) !

metadata !

researcher, date created, date last

run, research question !

Janet Dyk 2004-02-16 2012-01-27 Can the verb ים have a double שobject? - article in Foundations for Syriac Lexicography !

Page 15: Shebanq gniezno

OpenAnnotation openannotati

on.org!

Page 16: Shebanq gniezno

provenance

Page 17: Shebanq gniezno

motivation

Page 18: Shebanq gniezno

demonstrator datane

tworkservice

.nl/qaa!

Page 19: Shebanq gniezno

demonstrator datane

tworkservice

.nl/qaa!

Page 20: Shebanq gniezno

demonstrator datane

tworkservice

.nl/qaa!

Page 21: Shebanq gniezno

demonstrator datane

tworkservice

.nl/qaa!

Page 22: Shebanq gniezno

demonstrator

Page 23: Shebanq gniezno

demonstrator

Page 24: Shebanq gniezno

demonstrator

Page 25: Shebanq gniezno

demonstrator

still missing:

saving queries

not semantic-web-enabled

sustainability

Page 26: Shebanq gniezno

4 (of 5) Project

CLARIN-NL: SHEBANQ: (A) Curation

Page 27: Shebanq gniezno

SHEBANQ

System for Hebrew Text: ANnotations for Queries

CLARIN-NL project

data curation: LAF

demonstrator: query saver

#!/etc bc

s/g$/q/ !

Page 28: Shebanq gniezno

Linguistic Annotation Framework

ISO 24612:2012

Nancy Ide, Laurent Romary

Page 29: Shebanq gniezno
Page 30: Shebanq gniezno
Page 31: Shebanq gniezno
Page 32: Shebanq gniezno
Page 33: Shebanq gniezno

feature definitions

Page 34: Shebanq gniezno

feature definitions

Page 35: Shebanq gniezno

TEI ISO-FS schema

Page 36: Shebanq gniezno

dcr:datcat on <fDecl> versus <f>

26,225,966 <f>s ! !2.5 GB redundant attribute material !!

Page 37: Shebanq gniezno

5 (of 5) Project

CLARIN-NL: SHEBANQ: (B) Demonstrator

Page 38: Shebanq gniezno

select all objects where

[clause [phrase phrase_function = Objc [word FOCUS tense = infinitive_absolute] ]]

Execute

Query executed

Passage

תאו םימשה תא םיהלא ארב תישארב׃ץראה

תיב הלעא יכ תוא המ והיקזח רמאיו׃הוהי

Controls

תיב הלעא יכ תוא המ והיקזח רמאיו׃הוהי

Gen 1:1

2Chron 3:4

Gen 1:1 תאו םימשה תא םיהלא ארב תישארב׃ץראה

תיב הלעא יכ תוא המ והיקזח רמאיו׃הוהי

Text

1Sam 12:4

Ex 23:2

Query results

Prev 2 3 65 ... 2241 Next21 313 results

Executing query ...

view in context

Save this query

Researcher Oliver Glanz

Date created 2013-08-25

Date last run 2013-08-25

Project Data and Tradition

Institute VU/Eep Talstra Centre for Bible and Computing

Reason irregular valency of ארב

Comments needs to be combined with query on םיהלא

Save PublishCancel

Name valency ארב

Edit Query

Page 39: Shebanq gniezno

Passage

תאו םימשה תא םיהלא ארב תישארב׃ץראה

תיב הלעא יכ תוא המ והיקזח רמאיו׃הוהי

Controls

תיב הלעא יכ תוא המ והיקזח רמאיו׃הוהי

Gen 1:1

2Chron 3:4

Gen 1:1 תאו םימשה תא םיהלא ארב תישארב׃ץראה

תיב הלעא יכ תוא המ והיקזח רמאיו׃הוהי

Text

1Sam 12:4

Ex 23:2

Saved Query Results

Prev 2 3 65 ... 2241 Next21 313 results

view in context

Information on this query

Researcher Oliver Glanz

Date created 2013-08-25

Date last run 2013-08-25

Project

Institute

Reason

Comments

Name

Query Info

select all objects where

[clause [phrase phrase_function = Objc [word FOCUS tense = infinitive_absolute] ]]

MQL query text Persistent Identifier urn:nbn:nl:ui:13-scpm-ji

http://www.persistent-identifier.nl/?identifier=urn...

valency ארב

Data and Tradition

VU/Eep Talstra Centre for Bible and Computing

irregular valency of ארב

needs to be combined with query on םיהלא

Page 40: Shebanq gniezno

datanetworks

ervice.nl/qa

a!

Page 41: Shebanq gniezno

SHEBANQ: implementing Q-a-A

Page 42: Shebanq gniezno

the benefits of infra increasing involvement

Page 43: Shebanq gniezno

thank you

[email protected]

slideshare.net/dirkroorda/

s/g$/q/ !

#!/etc bc Eep Talstra Centre for Bible and Computer!