metadata towards an e-research cyberinfrastructure
DESCRIPTION
Metadata towards an e-research cyberinfrastructure. The case of French ETDs. Summary. Introducing ARTIST and authors of this collective work Main actors operating on French ETDs Their roles; Their metadata 3 case studies Creating metadata: thinking about reusability - PowerPoint PPT PresentationTRANSCRIPT
Introduction DC 2006 1
Metadata towards an e-research
cyberinfrastructure
The case of French ETDs
Introduction DC 2006 2
Summary Introducing ARTIST and authors of this collective work
Main actors operating on French ETDs
Their roles; Their metadata
3 case studies
Creating metadata: thinking about reusability
Thematic survey around biodiversity: thinking vocabularies
Institutional survey: thinking ontologies
Conclusion
Introduction DC 2006 3
Authors Jacques DUCLOY – INIST, ARTIST Yann Nicolas – ABES Diane Le Hénaff - INRA Muriel FOULONNEAU – now CCSD Luc GRIVEL – Univ. Paris 1 Jean-Paul DUCASSE – Univ. Lyon 2 + several little interventions and
comments
Introduction DC 2006 4
ARTIST members Networked workshop Appropriation
by Research Communities of Technologies of Scientific & Technical Information
Community of: Researchers & Engineers Information Science & Information Technologies working in research world
http://artist.inist.fr/
Introduction DC 2006 5
ARTIST topics How to build
Digital Library or scholarly publishing applications dealing with e-science?
New approaches to make research or science in a cyberinfrastructure
Introduction DC 2006 6
ARTIST e-Science experimentations Scientific forum
Sample: cooperative linguistic discussion Carl Lagoze paper about DL
What is a Digital Library ? Scientific journal: AMETIST
Appropriation, Mutualisation, Experimentation Digital writing
Richer on-line version Experience becomes “reproductible”
Paper view Scientific focus and evaluation purpose
Digital Library experimentations (metadata -> DL) Cooperative writing: this article
Introduction DC 2006 7
This article: metadata for e-science
Not only for: (scientific) information retrievalBut also for: + research evaluation + federative digital libraries + research policy oriented studies + scientific surveys
Introduction DC 2006 8
main entities dealing with French ETDs Universities
And their national organization EPST (National Research Institute)
Sample CNRS : 30,000 people European framework
Each country gets its own organization + European actions (Delos, DRIVER…)
Francophone framework International framework
Networked Digital Library of Theses and Dissertations (NDLTD),
ePrints application profile
Introduction DC 2006 9
Translation difficulties showing different approaches Thèse = ETD
Thesis (English context) Dissertation (US context)
Veille scientifique Scientific survey (using informetric tools) In order to discover innovations
Pilotage de la recherche research policy oriented studies Strong role of French administration
Actors DC 2006 10
Actor: Cyberthèses
CyberdocsOpenOffice + XSLT
Word + style
Xml / TEI
Jury
ETD
Actors DC 2006 11
Cyberthèses metadata Thesis document: Xml, TEI-Lite
Xml version must be readable by a human being
Metadata: DC ETD-ms (Electronic Thesis & Dissertation Metadata
Standard) Further related axis:
TEI header, Latex -> MathML
Actors DC 2006 12
Actor: ABESMinistry of Education
Agency
Star
University: ETD
Union catalog
Persistent Identifier
Sudoc portal
Preservation
Dissemination(CINES)
Introduction DC 2006 13
Abes / Star metadata TEF (Thèses Electroniques Françaises) AFNOR standard (French member of
ISO) Dublin Core Qualified With several ETD adaptations (jury…) METS Rights Using Schematron
Actors DC 2006 14
Actor: CCSD Centre pour la Communication Scientifique
Directe
HAL: NationalArchive
InternationalArchive (Arxiv, Driver…)
LocalArchive
ThematicArchive
TEL …Inserm
Researcher author
Researcher reader
Introduction DC 2006 15
CCSD metadata At the beginning: local schema Strong relationship between:
Author University, laboratory, research team
DC export / OAI - PMH
Introduction DC 2006 16
Hal: institutional repository A French advantage related to open
archive « Protocole d’accord Universités
EPST sur les dépots par les chercheurs » In CNRS each researcher must produce
an activity report which in generated by Hal/CCSD
Some scientific headers can request a CCSD deposit
Actors DC 2006 17
Actor: INIST/CNRS Institut de l’Information Scientifique et
Technique http://www.inist.fr
Pascal & Francis bibliographic data bases 15,000,000 XML records
Scientific Portals, Scientific information analysis Vocabularies: termSciences
Introduction DC 2006 18
INIST - metadata Bibliographic records
Exodic Origin: CCF based MARC format Translated in SGML in 92 now with a DCQ approach Strong links between authors and affiliations
termSciences ISO 16642 (TMF)
Actors DC 2006 19
Thesis
Cyberthèses
STAR
Local archive
CCSD
Articles
OAI-PMH
The landscape we would like to have
INIST
Introduction DC 2006 20
Case study: sharing theses and their metadata Thinking about reusability by several actors
which interoperate during ETD life cycle
Inra unit
Ecole doctorale
Inra
UniversityAbes/star
ETD
Univ. Lab.
Introduction DC 2006 21
Sharing metadata Administrative metadata are
requested for a quite complex workflow
Contents must be matched A given person could have different
names… Different ways of naming units… Different classification schemes…
Introduction DC 2006 22
Case study 2: BiodivERsA BiodivERsA: European research policy
network about biodiversity x * 10 funding agencies,
y*100 research program z * 1000 projects x1 * 10000 results …
distributed network of CRIS CRIS: Current Research Information System
Introduction DC 2006 23
Vocabulary adaptations
CRIS…
Archive
DL
Thematic CRIS
Thematic Archive
Thematic DL
Global DL
Classification schemas must be matched for computation purpose (funding evaluation)
Introduction DC 2006 24
Case study 3 Affiliation must be managed with ontologies
UHP CNRS INRIA
CRINLoria Inria Lorraine Inria Sophia
YT Cortex OmégaOrpailleur
Introduction DC 2006 25
A technical conclusion Metadata (DCQ) is good but not
sufficient We need
vocabulary adaptations, ontologies Sharing several repositories
(vocabularies, affiliations…) Managing metadata history etc
Introduction DC 2006 26
The very conclusion We need to help people working
altogether and doing compromises We need researcher becoming owners
of their scientific information system… We need librarians appropriating
technologies and helping researcher to appropriate librarian feeling
We need engineers in computer science appropriating library and edition issues
That is what we try to help to do with ARTIST
Introduction DC 2006 27
Thank you for your listening Thank you for your questions…