bulgariana collectionsineuropeana 19032012
DESCRIPTION
This presentation describes Bulgariana, an aggregator to Europeana. This is also a community building initiative aiming at concerting activities in the realm of digtalization, preservation and presentation of cultural heritage in Bulgaria.TRANSCRIPT
Bulgariana Collections in
EuropeanaEuropeana
Mariana Damova, PhD
Veliko Tyrnovo
March 2012
Ontotext Corp
• Who is Ontotext? – The leading semtech company in Eastern Europe and one of the leaders world-
wide
– 55 people: Bulgaria (Sofia, Varna), Austria, USA
– Worked in this area since 2000
– Venture funding and commercial clients since 2008
– Bulgaria's most successful participant in EU FP5,6,7 research
– www.ontotext.com– www.ontotext.com
• 360-degree semantic technology:– Semantic Repository (OWLIM), ETL
– Text Mining, Semantic Annotation and Search (KIM, U.Sheffield GATE, Teamware, MIMIR)
– Web Mining and Crawling
– Ontology Engineering and Exploitation
– Master Data and Linked Data Management
2
Ontotext Clients (selected)
• British Broadcasting Corporation (BBC)– Runs its World Cup 2010 sites on top of OWLIM
– Next is BBC Sports (2011) and the 2012 Olympics
• The National Archives (UK) The UK Government’s official archive contracted Ontotext to implement semantic search for the Government Web Archive
• British Museum (UK) ResearchSpace project funded by the Andrew W. Mellon Foundation support collaborative web-based research, information sharing for the cultural heritage scholarly community, a consortium lead by Ontotext
• LODAC (Linked Open Data in Academia) • LODAC (Linked Open Data in Academia) Japan’s National Institute of Informatics and aggregates various information across multiple Japanese resources as LOD
• The Polish Digital National Museum aggregates artifacts from cultural institutions in the Digital Libraries Federation PIONIER Network: over 70 contributing institutions including universities, libraries, museums, archives, research.
• The Gothenburg City Museum provided close to 9K museum objects from two collections to build a use case within the MOLTO FP7 project for a knowledge representation infrastructure that allows querying RDF and presenting RDF results in natural language.
• Bibliothek, The Hague, aggregation of data from 150 library databases
3
Outline
• Europeana
• bulgariana.eu
• Collections
• Europeana Data Standards• Europeana Data Standards
• Metadata mapping, conversion and ingestion
• Digital repository
• Conclusion
4
Europeana
http://www.europeana.eu
• Launched in 2008• Project funded by the European Commission
5
• Project funded by the European Commission• Based in the National Library of the Netherlands, the Koninklijke Bibliotheek• Goal to make Europe's cultural and scientific heritage accessible to the public• Over 180 heritage and knowledge organzations and IT experts across Europe• Europeana Collection: 5M objects in 2009, 10M in 2010, 20M at present• Endorsed by the European parliament in 2010• 2011 "Comité des Sages" makes recommendations about Europeana
to put online the collections held by Europe's libraries, archives, museums and audiovisual archives – vast numbers of books and periodicals (there are some 2.5bn items in Europe's libraries alone), and millions of hours of film and video covering the whole of Europe's diverse history and culture.
Europeana
• Collection types: Image, Sound, Video, Text• Present Europeana Architecture
Back office
SolrDB
visitor
ingestionEuropeanaPortal
6
visitor
system context
back office
Provider
• Europeana data standards• Europeana aggregators (by country or cultural heritage sector)• Process of ingesting content (4-6 weeks)
bulgariana.eu
7
bulgariana.eu
• Main Purpose: BG
aggregator for Europeana
• Secondary Purpose:
networking and special
interest group for BG
Cultural HeritageCultural Heritage
8
Collections
9
Collections
Golden Pages from the Bulgarian RenaissanceЗлатни страници от Българското Възраждане
unique manuscripts of Bulgarian folk songs collected in 19th century by Miladinov Brothers, renowned Bulgarian Folkloristspublished in 2008 by D-r Luchia Antonova,Institute of Bulgarian Language, Bulgarian Academy of Sciences
МАРКО КРАЛЕВИКИ БОЛЕН СЕ КАИТ И СЕ ИСПОВЕДВИТ
10
ИСПОВЕДВИТ
Поболил се Марко Кралевике,що си лежал токму три години,от нищо се иляч (1) не на’ож’ал.И му рече негва стара майќа:“Ай ти, Марко, ай ти, синко милий;не си болен, синко, от господа,тук си болен, синко, от гре’о’и,да ти викна попой (2), ду’овници,лепо да се синко исповедиш,да си кажиш твоите гре’о’и!”….
Collections
Pra-historic and Thracian CivilizationsПраисторическа и Тракийска цивилизация
Unpublished Thracian archeological objects collected by Prof. Valeria Fol, Center of Thracology at the Institute for Balkan Studies at the Bulgarian Academy of Sciences
11
Europeana Data Standards
12
Europeana Data Standards
• Unified metadata • ESE – Europeana Semantic Elements
• DublinCore & Europeana fields• 36 fields: flat, limited ability semantic links
dc:title europeana:provider dc:creator europeana:dataProvider dc:subject europeana:rights dc:description europeana:typedc:publisher europeana:isShownBy and/or europeana:isShownAt … …
13
• EDM - Europeana Data Model
Basic data model Two contextual classes
Metadata mapping, conversion and ingestion
14
Metadata conversion
DELVINGtool used to convertentries into ESE
15
Europeana ingestion – OAI-PMH
16
Metadata ingestion
• administrative steps• OAI-PMH access provided to Europeana• ingestion tests
17
Digital Repository for Cultural Heritage
18
Digital Repository for Cultural Heritage
• Elaboration of the metadata properties in accordance with the content providers requirements• Migration of databases and digitalized artifacts from available online resources and cultural bodies collections• Training of users to work with the admin panel of the digital repository – metadata input and editing, media files upload• Publication of the digitalized collections on the web – UI layer enabling rich
19
• Publication of the digitalized collections on the web – UI layer enabling rich visualization, various search options, browse by thematic categories, etc…
developed by Sirma Media
Links
• http://bulgariana.eu
• http://bulgarianheritage.bulgariana.eu
• http://www.europeana.eu
– europeana_collectionName: 20215*– europeana_collectionName: 20215*
– for the individual sets use europeana_collectionName: 2021501* (or
2021502*)
• http://britishmuseum.ontotext.com
Sofia,
13
IMI
Review
20
Community Building
• Google group
– http://groups.google.com/group/cultural-heritage-
digitalisation (35 members)
• Collaboration with IMI and UNIBIT
• Meeting in Sofia 30.01.2012 (75 participants)• Meeting in Sofia 30.01.2012 (75 participants)
• Intense networking as a result
• Broadcast at Bulgarian National Radio
• Working group for the Ministry of Culture
• Upcoming meeting in Veliko Tyrnovo 19.03.2012
• About 5 project ideas for the upcoming FP7 and PSP calls
Sofia,
13
IMI
Review
21
Conclusion
Aggregator for Bulgarian Cultural Heritage to Europeana
22