icpsr collection development: lessons learned & moving forward

28
ICPSR Collection Development: Lessons Learned & Moving Forward NDIIPP Partner Meeting Collecting Digital Content Going Forward: Lessons Learned and New Initiatives Washington, D.C. 10 July 2008

Upload: morty

Post on 22-Jan-2016

45 views

Category:

Documents


0 download

DESCRIPTION

ICPSR Collection Development: Lessons Learned & Moving Forward. NDIIPP Partner Meeting Collecting Digital Content Going Forward: Lessons Learned and New Initiatives Washington, D.C. 10 July 2008. Overview. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: ICPSR Collection Development: Lessons Learned & Moving Forward

ICPSR Collection Development: Lessons Learned & Moving Forward

NDIIPP Partner Meeting

Collecting Digital Content Going Forward:

Lessons Learned and New Initiatives

Washington, D.C.

10 July 2008

Page 2: ICPSR Collection Development: Lessons Learned & Moving Forward

Overview

1. We have been successful at preserving important social science data for decades.

2. Yet, important social science data have been lost despite our best efforts.

3. We want to leverage the processes we developed in the Data-PASS project to avoid future loss.

Page 3: ICPSR Collection Development: Lessons Learned & Moving Forward

Social Science Data Preservation Successes

Page 5: ICPSR Collection Development: Lessons Learned & Moving Forward

ICPSR’s Data-PASS Efforts

Page 6: ICPSR Collection Development: Lessons Learned & Moving Forward

Collection Development Successes

• High response rate (44%)

• Most PIs indicated that they wanted to be “Good Citizens” and help.

“This sounds like an exciting project.”

“I hope your project is successful because I think that it is important.”

“Good luck with the project.”

Page 7: ICPSR Collection Development: Lessons Learned & Moving Forward

Collection Development Successes

• Many of the data we have received are important mid-sized collections, which might not otherwise be collected.

• Around half of PIs still have access to data (49%)

Page 8: ICPSR Collection Development: Lessons Learned & Moving Forward

57 Commitments to Deposit• “U.S.-Sweden Cooperative Science:

Demographic Change in Nineteenth-Century Sweden” (Katherine Lynch, 1989, NSF #8912792)

• “Studies of the Economic Impacts of Global Environmental Change” (Nordhaus, 1991, NSF #9024860)

• “Social Support and Mental Health in a Black Community” (Dressler, 1980, NIH #1R01MH033943)

Page 9: ICPSR Collection Development: Lessons Learned & Moving Forward

Recent Acquisitions

• “Massachusetts Occupational Survey: 1953-1954” (Kahl)

• “Hurricane Andrew: Its Impact on Law and Social Control” (Akers)

• “Attitudes Toward Women and Work” (Huber)

Page 10: ICPSR Collection Development: Lessons Learned & Moving Forward

Challenges

• “Good Citizens” = high willingness but no time, money, or resources to submit to us.

Page 11: ICPSR Collection Development: Lessons Learned & Moving Forward

Challenges

• A good number (24%) of PIs no longer could access their data.

Page 12: ICPSR Collection Development: Lessons Learned & Moving Forward

Archival Status/Availability of Identified Data (n=1,668)

Page 13: ICPSR Collection Development: Lessons Learned & Moving Forward

Data Discarded“It was just too long ago, I generally keep data for something like 10

years beyond the last time I do something with them.”

“Data cards were destroyed long after several articles were published from them.”

“Destroyed, in accord with APA 5-year post-publication rule.”

Hardware Problems“Some data were collected, but the data file was lost in a technical

malfunction.”

“Electronic version was lost in a computer failure.”

Page 14: ICPSR Collection Development: Lessons Learned & Moving Forward

Destroyed for Confidentiality Reasons“The material…was considered sensitive data. Institutional review

boards, and the state registry board from which some data were derived required us to promise to destroy the data after a certain period of time, which was done a year or so after publication.”

“When the project ‘hibernated’ it was eventually decided to destroy the records in order to protect the identities of the subjects. I destroyed the records about a year ago when the building that housed them was torn down….Since I could no longer assure the confidentiality of the records, I destroyed them.”

Acts of Nature“The data from the studies were on punched cards that were destroyed

in a flood in the department in the early 80s.”

Page 15: ICPSR Collection Development: Lessons Learned & Moving Forward

Discarded or Lost in a Move“The data were on punched cards, which probably were discarded

(after a decade or more of non-use) in one of several office moves.”

“They were all published in journals, and when I moved from [University A] to [University B], I discarded them.”

“As I retired and gave up my office last year, I disposed of the data tape at that time. Unfortunately, I simply didn’t have the room to store these data sets at my house.”

“I lost track of them during several moves over the past 20 years.”

“I had the data (ASCII and SPSS system files) on old floppy discs and they seem to have been lost in a move.”

Page 16: ICPSR Collection Development: Lessons Learned & Moving Forward

Obsolescence

“….a good deal of…data from the study were also entered (sometime in the late 1970s) onto the [University’s] mainframe in an early version of SPSS. I sincerely doubt that they are still accessible in electronic form—is there even such a thing as a mainframe any more?”

“The data were backed up several times using different media, but eventually, the media became obsolete and I did not transfer it to newer technologies.”

“Speech recordings stored on a LISP Machine…, an experimental computer which is long obsolete.”

Page 17: ICPSR Collection Development: Lessons Learned & Moving Forward

Simply Lost

“This was a long time ago, university has changed its computer systems several times since. I am not able to locate the data.”

“I really cannot recall. It was decades ago, and I subsequently moved to new positions in various universities. The records of the responses obtained are forever lost.”

“I don’t honestly know where they would be now. For all I know, they are on a [University] server, but it has been literally years and years since the research was done, and my files are long gone.”

Page 18: ICPSR Collection Development: Lessons Learned & Moving Forward

Challenges

• Some PIs have data, but data are in obsolete formats, without sufficient documentation, or require massive clean-up.

Page 19: ICPSR Collection Development: Lessons Learned & Moving Forward

Proprietary Formats of Data Not Archived

0 50 100 150 200 250 300

Filemaker Pro

Matlab

PDF

Hard Copies

.WAV

Access

Stata

Word

ASCII text

SAS

Excel

SPSS

And Also… .aiff, ArcView (ESRI), ARLEQUIN, ATLAS.ti, AVI, Binary (UWAR format), BMPD, Brainvoyager, Bvh, Coldfusion database, .DAT, Dbase, DICOM, Digital Voice Files, .DSS, E-PRIME, EQS, EVIEWS workfile, FLASH, FSL files, GAUSS, HTML, iMovie, JPG, JMP databases, MAXQDA, Molecular workbench database, MP3, MySQL, NEXUS, .NSP, NVivo, Paradox data files, PHYLIP, QDS, REFLEX, Rich Text Format, SALT transcription format, Sawtooth WinCati, Sequencer, SQL, Statistica, Statmost, STRUCTURE, Systat, Teleform, Trimble Pathfinder spatial coordinate raw data files, TSP, Viso, WordPerfect. Last updated 8/17/2007

Page 20: ICPSR Collection Development: Lessons Learned & Moving Forward

Storage Media of Data Not Archived

0 50 100 150 200 250 300 350 400

DisksPunched CardsMainframe ComputerDVDVideo CassetteAudio TapeZip disksFlash DriveFloppy Disk (5 1/4", 3 1/2 ")ServerTapeHard CopyCDHard Drive

Last updated 8/17/2007

And Also… cassette tape, DAT tape, digital video tape, McBee cards, minidisks, mini digital video tapes, network drive, QIC-80 tape, reel-to-reel audio tape, reel-to-reel videotapes, Sun UNIX network, 16mm film.

Page 21: ICPSR Collection Development: Lessons Learned & Moving Forward

Collection Development Policy Update

Page 22: ICPSR Collection Development: Lessons Learned & Moving Forward

Specific Updates

• Regular harvests of NIH & NSF awards.

• Early, ongoing contact with PIs.

• Outreach & Education.

• Peer Network.

Page 23: ICPSR Collection Development: Lessons Learned & Moving Forward
Page 24: ICPSR Collection Development: Lessons Learned & Moving Forward
Page 25: ICPSR Collection Development: Lessons Learned & Moving Forward

Outreach & Education

Page 26: ICPSR Collection Development: Lessons Learned & Moving Forward

Peer Network

Page 27: ICPSR Collection Development: Lessons Learned & Moving Forward

Collection Development Policy Update

Page 28: ICPSR Collection Development: Lessons Learned & Moving Forward

Jared LyleICPSR

(734)[email protected]