work package 3 - month 6 by christian morbidoni

15
DM2E PROJECT - WP3 M6 - REPORT WP Leader: Christian Morbidoni - Net7 giovedì 21 giugno 12

Upload: digitised-manuscripts-to-europeana

Post on 19-May-2015

834 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Work Package 3 - Month 6 by Christian Morbidoni

DM2E PROJECT - WP3M6 - REPORT

WP Leader: Christian Morbidoni - Net7

giovedì 21 giugno 12

Page 2: Work Package 3 - Month 6 by Christian Morbidoni

WP3 TASKS

• Task T3.1 Initial functional specifications of the prototyping platform (Net7, UBER, EAJC, MPIWG, ÖNB (task leader))

• Task T3.2 Building of the prototype platform (Net7 (task leader), MPIWG)

• Task T3.3 Tutorials and Documentation (Net7 (task leader), MPIWG)

• Task T3.4 Background research on Scholarly Primitives (UBER (task leader), MPIWG, ÖNB, UiB)

giovedì 21 giugno 12

Page 3: Work Package 3 - Month 6 by Christian Morbidoni

PARTNERS AND EFFORT

• UBER 14 PM Task 3.1 - 3.3

• EAJC 3 PM Task 3.1

•MPIWG 12 PM Task 3.1 - 3.2 - 3.3 - 3.4

•Net7 22 PM Task 3.1 - 3.2 - 3.3

•OKFN 8 PM Task 3.1 - 3.2 - 3.3

•ONB 12 PM Task 3.1 - 3.4

• UIB 6 PM Task 3.1 - 3.4

giovedì 21 giugno 12

Page 4: Work Package 3 - Month 6 by Christian Morbidoni

TIMELINED3.1 - M 6Initial Specification Report for the platformLEAD: ONB

M0 M3 M6 M9 M12 M15 M18 M21 M24 M27 M30 M33 M36

D3.2 - M 11Prototyping Platform ImplementedLEAD: NET7 D3.3 - M 24

E-Learning Courses publishedLEAD: NET7

D3.4 - M 3Research Report on DH Scholarly PrimitivesLEAD: UBER

MS2 - M 18Draft E-Learning CoursesLEAD: NET7

MS2 - M 24Intermediary research Report on DH Scholarly PrimitivesLEAD: UBER

giovedì 21 giugno 12

Page 5: Work Package 3 - Month 6 by Christian Morbidoni

giovedì 21 giugno 12

Page 6: Work Package 3 - Month 6 by Christian Morbidoni

STATUS: TASK 3.1

• Task T3.1 Initial functional specifications of the prototyping platform (Net7, UBER, EAJC, MPIWG, ÖNB (task leader))

• Small deviation from the DOW:

• Discussed during the Kick-Off (not formalized till now: MY FAULT)

• Form the DOW: “The requirements will be collected by organizing a workshop jointly with WP1 (ONB/EAJC) and by submitting online questionnaires to the scholarly community (UBER/OKF)”

• Due to time constraints development of the prototype has started based on the draft specifications posted in the wiki: https://dm2e.hu-berlin.de/redmine/projects/wp3/wiki/Prototyping_platform

• Goal has been :

• to reach a first draft prototype within M6 (July)

• stabilize features + possibly add new ones within M11 (December)

giovedì 21 giugno 12

Page 7: Work Package 3 - Month 6 by Christian Morbidoni

DELIVERABLE 3.1

• Due for M6 - End of July (?)

• Proposal

• Demonstration of the current state of the prototype + Discussion and validation of results (today 15:45 – 16:45 session or tomorrow 09:00 – 10:30 session)

• List possible additional features to be implemented (Net7 + ONB + EAJC)

• Input from the DHAB meeting and from this project meeting

• Assign priorities to features via an online poll

• Net7 to select high priority features feasible within month 11

• Remaining features as input for the challenges in WP4, to be built on top of the prototyping platform

giovedì 21 giugno 12

Page 8: Work Package 3 - Month 6 by Christian Morbidoni

TASK 3.2

• Task T3.2 Building of the prototype platform (Net7 (task leader), MPIWG)

• Korbo platform first version online at http://korbo.org

• Please request login to Romeo Zitarosa [email protected] to test and provide feedback

• NOTE: it is very alfa: ugly GUI :-) and under continuous development

• Pundit annotation tool (http://thepund.it) enhanced and integrated with Korbo

• Screencasts shown at the DHAB:

• Korbo: http://youtu.be/evyhX89oL-Y

• Korbo + Pundit: http://youtu.be/NUIhhduKJP8

giovedì 21 giugno 12

Page 9: Work Package 3 - Month 6 by Christian Morbidoni

TASK 3.2

Search drivers

Search SPARQL endpoint

Search Muruca DLs

Data import and transformation drivers

Standard Linked Data Import

Europeana Data Model Import

Data storage layer

SYNC

Augmentation tools connectors

Pundit connectorOthers (to come)

LOD Interface(Based on ELDA Linked Data API implementation) Korbo REST API

Augmentation tools

(e.g. Pundit)

Metadata providers / Digital Content providers

Search items

Get metadata Get digital content

Instantiate / configure

Store augmentations

Others (to come)

Baskets SPARQL Endpoints

Linked Data visualization/browsing tools(e.g. Lodlive.it)

Get RDF representations SPARQL queries

giovedì 21 giugno 12

Page 10: Work Package 3 - Month 6 by Christian Morbidoni

STATUS: TASK 3.1

• Task T3.1 Initial functional specifications of the prototyping platform (Net7, UBER, EAJC, MPIWG, ÖNB (task leader))

• A questionnaire for gathering possible technical specification to make content and metadata usable in Korbo has been submitted to content partners (ÖNB)

• First draft requirements on provided content to be usable in Korbo in the wiki

• Some feedback received

giovedì 21 giugno 12

Page 11: Work Package 3 - Month 6 by Christian Morbidoni

CONTENT QUESTIONNAIRESImgs Text HMTL pages Digital Library

HUBFormat: PNGStable URL for plain imgs: ???

Format: TEI 5Stable URL for plain text: ???

Stable URL: YES http://dingler.culture.hu-berlin.de/

JDC Format: PDFStable URL for plain imgs: ???

Format: PDF, XMLStable URL for plain text: ???

Stable URL: ???http://search.archives.jdc.org/

MPIWGFormat: JPGStable URL for plain imgs: YES ?

Format: Plain textStable URL for plain text: NO?

Stable URL: YES http://echo.mpiwg-berlin.mpg.de

ONB ABO Format: JPG / JPG 2000Stable URL for plain imgs: ?

Format: METS / HOCRStable URL for plain text: ?

Stable URL: ?

ONB CAB Format: JPG (Thumb only?)Stable URL for plain imgs: YES ?

Stable URL: YEShttp://aleph.onb.ac.at/F?func=file&file_name=login&local_base=ONB06

SBBFormat: JPGStable URL for plain imgs: YES ?

Stable URL: YEShttp://digital-b.staatsbibliothek-berlin.de/digitale_bibliothek/index.html

UIB Format: JPGStable URL for plain imgs: YES

Format: Plain textStable URL for plain text: YES

Stable URL: YEShttp://wab.uib.no/wab_hw.page/ and http://www.wittgensteinsource.org/

UBFFMFormat: JPGStable URL for plain imgs: YES?

Stable URL: YEShttp://sammlungen.ub.uni-frankfurt.de/mshebr/nav/index/all -

giovedì 21 giugno 12

Page 12: Work Package 3 - Month 6 by Christian Morbidoni

MINIMAL REQUIREMENTS FOR CONTENT PROVIDERS

• From the questionnaire:

• A simple REST API for getting digital objects. Requirements: stable URLs for each digital object. Support for standard HTTP methods (as GET and HEAD) and headers (as last-modified, content-length, content-type, content-encoding).

• Feasible for almost all partners (except ONB)

• Minimal requirements:

• Plain images available from a stable URLs (no application or HTML, just plain image)

• Plain text (XML, txt) available from a stable URLs

• Such URL have to be included in the EDM via the edm:object property

giovedì 21 giugno 12

Page 13: Work Package 3 - Month 6 by Christian Morbidoni

MINIMAL REQUIREMENTS FOR CONTENT PROVIDERS

• Annotable HTML chucks available at a steble URL

• Example HTML chunk:

• Where the value of the about attribute (e.g. http://example.org/21345) should be a resolvable to the HTML chunk itself.

• The EDM should contain a triple like the following

• ex:proxy/provider/12334 <http://purl.org/net7/korbo/vocab#hasAnnotableVersionAt> <http://example.org/12345>

• Is it really necessary?

• If it can’t be done we will live without

• However it would bring benefits

<html> <body> <div class="pundit-content" about="http://example.org/21345">

// arbitrary HTML, possibly simple markup, no javascript <img src="http://example.org/imgs/21345.jpg"/> <p>caption, transcription, whatever, bla bla</p> </div> </body></html>

giovedì 21 giugno 12

Page 14: Work Package 3 - Month 6 by Christian Morbidoni

MINIMAL REQUIREMENTS FOR CONTENT PROVIDERS

• Annotable HTML chucks available at a steble URL

• Example HTML chunk:

• Where the value of the about attribute (e.g. http://example.org/21345) should be a resolvable to the HTML chunk itself.

• The EDM should contain a triple like the following

• ex:proxy/provider/12334 <http://purl.org/net7/korbo/vocab#hasAnnotableVersionAt> <http://example.org/12345>

• Is it really necessary?

• If it can’t be done we will live without

• However it would bring benefits

<html> <body> <div class="pundit-content" about="http://example.org/21345">

// arbitrary HTML, possibly simple markup, no javascript <img src="http://example.org/imgs/21345.jpg"/> <p>caption, transcription, whatever, bla bla</p> </div> </body></html>

THIS

IS T

O B

E DIS

CUSS

ED:

NOT D

EFIN

ITIV

E

giovedì 21 giugno 12

Page 15: Work Package 3 - Month 6 by Christian Morbidoni

TASK 3.3

• Next steps

• Stabilize current version

• Integration of MPIWG tools as add-on/REST services

• In addition, work will be done on an experimental connection between Europeana and the ECHO collaborative research platform developed by MPIWG

• Open question:

• How different version of the same object are modeled in EDM produced in WP2 ?

• This might be relevant to Korbo (Import and augmentation)

giovedì 21 giugno 12