the library behind the scene how does it work ? the library behind the scenes 1 jinr / cern grid and...

36
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

Upload: blaise-wilson

Post on 17-Dec-2015

222 views

Category:

Documents


0 download

TRANSCRIPT

1

The Library behind the sceneHow does it work ?

The Library behind the scenes

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

2

Outline

• 1- Introduction – definitions and context• 2- Information systems in particle physics• 3- Standards• 4- Tools• 5- Conclusions and outlook

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

3

1- Introduction

Do you speak “Librarian”?

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

4

What’s a Library for you ?

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

5

What’s a Library for you ?

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

6

Some definitions

• Library (Oxford Reference Online)• A building or room containing collections of books,

periodicals, and sometimes films and recorded music for use or borrowing by the public or the members of an institution

• Digital Library (Wikipedia)– is a library in which collections are stored in digital formats

(as opposed to print, microform, or other media) and accessible by computers. The digital content may be stored locally, or accessed remotely via computer networks. A digital library is a type of information retrieval system.

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

7

Context

• Towards a digital world: print vs online– Traditional print/physical collections: • Books, journals, theses, reports, standards…• Physical item, description, location

– E-resources:• E-books, e-journals, multimedia (videos, photos…), e-

document…• File, description, link, content ++

Evolution of information retrieval

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

8

2. Information system at CERN and in particle physics

CDS and Inspire

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

9

Particle physics and CERN

• Particle physics– Aims to understand how the Universe works– Small but tightly organized worldwide community– Experimental vs theoretical

• CERN: Research Institute in Particle Physics– LHC (Large Hadron Collider)– 2500 staff + 10,000 users coming from everywhere in the world

• Need fast communication to distribute research result– Publication in journals, too long process– Preprint communication– From mail to arxiv.org

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

10

Open access

• Traditional publication model: – Subscription, purchase, controlled access

• Open Access: – Open Access (OA) literature is digital, online, free

of charge to the reader, and free of most copyright and licensing restrictions.

– Green OA, ex. Institutional repositories (CDS, JINR..)

– Gold OA, OA to published articles

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

11

CERN Scientific Information Service

• Mission– Provide information resources in ALL fields of

relevance to CERN– Ensure scientific information produced at CERN is

safeguarded and made publicly available.– Distribute CERN publications

• Audience– Particle physicists (from CERN and from outside),

Engineers, technicians, Computer scientists, Administrative staff

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

12

CERN Document server: institutional repository and Library catalogue

http://cdsweb.cern.chPowered by Invenio

CERN Library collections: (e)books, (e)journals, (e)standardsCERN Institutional repository: preprints, articles, theses (fulltext)…Multimedia collections. CERN publication

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

13

Inspire: HEP literature database

http://inspirehep.net/Powered by Invenio

Worldwilde repository:CERN, Fermilab, SLAC and Desy

All HEP literature (since 1960)

Citation extraction, author, affiliation analysis..

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

14

JINR Document server

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

15

Where do first search for information?

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

16

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

17

What system do your prefer?Arxiv 0804.2701v2, Gentil-Beccot et al.

2007 survey9% of HEP scholars use Google as preferred information system

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

18

Google generationArxiv 0804.2701v2, Gentil-Beccot et al.

What do you do when you are looking for an information?

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

19

Why?

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

20

2- Standards

Why we need them

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

21

Library catalogue

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

22

Library catalogue

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

23

Marc 21

• Metadata: data about data, record description• MARC: MAchine Readable Cataloguing,

international standard for representing and communicating bibliographic records, developed in the 60s

• MARC21: redesigned MARC for the 21st century– Is based on the ANSI standard Z39.2, which allows

users of different software products to communicate with each other and to exchange data.

• XML-MARC: XML schema based on MARC21

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

24

Marc 21

Author Title

Identifier

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

25

MARCxml

Title

Author

Identifier

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

26

Other standards

• Metadata: – MARC, Dublin Core, BibTex…

• Identifiers: – DOI, ISBN, Barcodes…

• Data exchange protocols:– Z39.50, OAI-PMH

• Full text and coding:– Xml, PDF, PDF/a…

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

27

Why is this important?

• Retrieve information– Identification and searchability

• Preservation– Ensure the information will be readable by

another machine / in 20 years time (?)• Interoperability / Information integration– Transfer (convert) data easily to another catalogue– Extract information and re-use it

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

28

4- Tools

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

29

A Library system

Bibliographic record

Physical Item

Electronic file

Borrower

LibrarianBibliographic record

Physical Item

Electronic file

Bibliographic record

Physical Item

Electronic file

BorrowerBorrower

Bibliographic record

Bibliographic record

Publisher Other Source

Loan

Create/edit

Ingest

Ingest

LIBRARYCATALOGUE

Access

Search / Find

Author

Submit

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

30

CERN Document server

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

31

Circulation and statistics

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

32

Record edition: BibEdit

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

33

Record edition: Multi-record editor

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

34

Records ingestion

Library catalogue

Conversion

Matching

-> New records-> Update records

MARCXML

XML

XMLMARCXML

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

35

• Importance of structured information:– Standards– Automatic procedures as much as possible

• Why? – Users find what they need (and even more)– In the digital era, new challenges and opportunities:

• Build new services on top of the catalogue • Integrate information resources• Communicate!

• More in the next session!

Conclusions and outlook

JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS

36

[email protected]

Спасибо!