personal born-digital in what environments do we...

22
3/6/20 1 Archiving the Non-Organiza8onal Born- Digital: The Challenges Posed by Material from Individuals, Communi8es, & Events Howard Besser Moving Image Archiving & Preserva8on New York University hQp://besser.tsoa.nyu.edu/howard/Talks/ hQp://www.nyu.edu/8sch/preserva8on/ Besser-Berkeley 3/6/2020 1 Archiving the Non-Organiza8onal Born- Digital: The Challenges Posed by Material from Individuals, Communi8es, & Events Background & The Problem of Personal, Community, & Event- based Digital Archiving (for both text and image) The par8cular problem of Messaging The PDA Conferences My Projects that provided key concepts & Challenges InterPARES Preserving Digital Public Television Ac8vist Archivists & the Occupy Movement Important Lessons & Approaches Learned (From Ac8vist Archivists & Other projects) Other projects to Monitor Further Applied Research needed Besser-Berkeley 3/6/2020 2 Personal born-digital (from PDA Conferences) Correspondence/email Personal photos/movies and group collec8ons Manuscript dra\s, camera original footage, rough cuts Personal documents Diaries Home movies And has been extended to encompass: Family history Community/Ethnic history & Movements Genealogy Digital humani8es Besser-Berkeley 3/6/2020 3 In what environments do we find this type of material? Archives and Library Special Collec8ons Collec8ons documen8ng a community Collec8ons documen8ng an ethnic group Collec8ons documen8ng a social movement Collec8ons documen8ng the work of any other type of group (a group of Architects, a set of law-makers, etc.) Collec8ons documen8ng an event- Besser-Berkeley 3/6/2020 4 Documen8ng an Event Besser-Berkeley 3/6/2020 5 Documen8ng an Event Besser-Berkeley 3/6/2020 6

Upload: others

Post on 13-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

1

ArchivingtheNon-Organiza8onalBorn-Digital:TheChallengesPosedby

MaterialfromIndividuals,Communi8es,&Events

HowardBesserMovingImageArchiving&Preserva8on

NewYorkUniversityhQp://besser.tsoa.nyu.edu/howard/Talks/hQp://www.nyu.edu/8sch/preserva8on/

Besser-Berkeley3/6/2020 1

ArchivingtheNon-Organiza8onalBorn-Digital:TheChallengesPosedbyMaterialfromIndividuals,Communi8es,&Events

•  Background&TheProblemofPersonal,Community,&Event-basedDigitalArchiving(forbothtextandimage)

•  Thepar8cularproblemofMessaging•  ThePDAConferences•  MyProjectsthatprovidedkeyconcepts&Challenges

–  InterPARES–  PreservingDigitalPublicTelevision–  Ac8vistArchivists&theOccupyMovement

•  ImportantLessons&ApproachesLearned(FromAc8vistArchivists&Otherprojects)

•  OtherprojectstoMonitor•  FurtherAppliedResearchneeded

Besser-Berkeley3/6/2020 2

Personalborn-digital(fromPDAConferences)

•  Correspondence/email•  Personalphotos/moviesandgroupcollec8ons•  Manuscriptdra\s,cameraoriginalfootage,roughcuts•  Personaldocuments•  Diaries•  HomemoviesAndhasbeenextendedtoencompass:•  Familyhistory•  Community/Ethnichistory&Movements•  Genealogy•  Digitalhumani8es

Besser-Berkeley3/6/2020 3

Inwhatenvironmentsdowefindthistypeofmaterial?

•  ArchivesandLibrarySpecialCollec8ons•  Collec8onsdocumen8ngacommunity•  Collec8onsdocumen8nganethnicgroup•  Collec8onsdocumen8ngasocialmovement•  Collec8onsdocumen8ngtheworkofanyothertypeofgroup(agroupofArchitects,asetoflaw-makers,etc.)

•  Collec8onsdocumen8nganevent-

Besser-Berkeley3/6/2020 4

Documen8nganEvent

Besser-Berkeley3/6/2020 5

Documen8nganEvent

Besser-Berkeley3/6/2020 6

Page 2: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

2

Documen8nganEvent

Besser-Berkeley3/6/2020 7

Well-funded“an8-racist”ac8vists,withbackingfrombillionaireJewsandgovernmentins8tu8ons,preventracially-awareWhitepeoplefromengaginginonlinecommerce.TheNa8onalAllianceandCosmotheistChurch,forexample,havebeeneffec8velybannedfromonlinecommercefornearlytwoyears.NowtheJewsandtheirpawnsaretryingtoextendthiscensorshiptototallynon-poli8calvenueslikethisBloomington,Indianafarmers’market.

https://nationalvanguard.org/2019/06/indiana-pro-white-farmers-threatened-persecuted-by-jewish-funded-leftists/

Documen8ngaCommunityGroup

Besser-Berkeley3/6/2020 8

Documen8ngaSocialMovement

Besser-Berkeley3/6/2020 9

Señal3LaVictoria(San8ago,CL)

10Besser-Berkeley3/6/2020

Señal3LaVictoria(San8ago,CL)

11Besser-Berkeley3/6/2020

Señal3LaVictoria(San8ago,CL)

12Besser-Berkeley3/6/2020

Page 3: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

3

Copwatch(DA?)

13Besser-Berkeley3/6/2020

ArchivingSocialMovementstoday(Video)hQps://www.ny8mes.com/video/world/asia/100000006970549/coronavirus-chinese-ci8zens.html

Besser-Berkeley3/6/2020 14

HomeVideoDay—NYCChinatown

15Besser-Berkeley3/6/2020

HomeVideoDay—NYCChinatown

16Besser-Berkeley3/6/2020

CenterforAsianAmericanMediaCAAMchannel“MemoriestoLight”

•  “HomemoviesoccupyauniqueplaceinAmericanculture.Thoughgenerallydismissedfortheiramateurquali8es,homemoviesprovideuswithexceedinglyauthen8candhonestmovingimages.Premisedonthehistoric,cultural,andar8s8cvalueofthehomemovie,MemoriestoLight:AsianAmericanHomeMoviesisana8onalpar8cipatoryartsprojectthatcollec8velyandaesthe8callyconstructssharedsocial,cultural,andpoli8calrepresenta8onsofAsianAmericadirectlyfromthecommunityitself.Sincethemainstreammediahasgivenussofewauthen8cimagesoftheAsianAmericanexperience,homevideosbecomethemostrealwaytoseehowourgrandparents,mothers,fathers,auntsanduncleslivedtheirlives.”

17Besser-Berkeley3/6/2020

CAAMchannel“MemoriestoLight”

18Besser-Berkeley3/6/2020

Page 4: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

4

Interna8onalDigitalEphemeraProject

19Besser-Berkeley3/6/2020

GENERALPROBLEMSOFTHISTYPEOFBORN-DIGITALCONTENT

Besser-Berkeley3/6/2020 20

Intheanalogworld

•  Tradi8onally,wehavecometounderstandtheworkofwriters,scien8sts,filmmakersbyscholarsstudyingtheirpapersandrough-cutsinSpecialCollec8onsandArchives

•  Theircorrespondenceandprogressivelydifferentdra\sofpapersandrough-cutsrevealtheirchangingthoughtsandcra\

•  ButhowdowegathertheseintheDigitalAge?

Besser-Berkeley3/6/2020 21

AlasdairGray'sLanark(GlasgowULibrary)

Besser-Berkeley3/6/2020 22

Correspondence

Besser-Berkeley3/6/2020 23

Wherecanwefindthesetoday?

•  DopeoplewriteleQersonpaper?Canweseetheitera8onsofchangesonmanuscripts?DopeoplesavetheirEDLs?

•  Wherecanwefindtoday’sequivalentofthese?

Besser-Berkeley3/6/2020 24

Page 5: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

5

Stagesoftheproblem

•  Stage#1:Peoplerecordondigitalmediainsteadofanalog

•  Stage#2:Peoplenolongerstoretheirdigitalworksinplacesoverwhichtheyhaveabsolutecontrol– Emailservices(gmail,yahoo)– Cloudstoragefordocuments(googledocs)– Socialnetworkservices(Vimeo,YouTube,Instagram)

Besser-Berkeley3/6/2020 25

Stage#1Issues-digitalreplacesanalog

Besser-Berkeley3/6/2020 26

Stage#1Issues-digitalreplacesanalog

•  Thiswillrequire– newinterven8ons(likechangingcreators’workflow,savingEDLs,orinterveninginemailhandlingso\ware)

– Newtools(likeforanalyzingemail)– newapproacheslikedigitalarcheology,forensics

Besser-Berkeley3/6/2020 27

Stage#2Issues-contentnolongeronharddisk

•  RiseofOnlineServicesandSocialMediaischangingwherethiscontentresides(andisimposingTOSrestric8onsthatgobeyondtherightsholder)

Besser-Berkeley3/6/2020 28

MuchContentisonFacebook

Besser-Berkeley3/6/2020 29

Iden8tarianPlarormonlyonWeb

Besser-Berkeley3/6/2020 30

Page 6: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

6

CoreMul8-loca8onProblems

•  It’sdifficultenoughwhensomeone’sphotosormoviesarespreadthroughouttheirharddisk.Buttodaysomeimagesthere,butothersontheirphone(s),YouTube,Vimeo,Instagram,Flickr,Facebook,inTweets,etc.

•  Similarproblemsplagueemail•  MostSocialNetworkTOSpoliciesprohibittheownerfromgivingtheirpasswordtoanyoneelse(evenLibrary)

Besser-Berkeley3/6/2020 31

Andhowdowehandledona8onsa\eranimportantpersondies?

Besser-Berkeley3/6/2020 32

AndtheseissuesarealsotrueforCommunityGrps&Assns

•  w/SocialMedia,groupac8vityismoreimportantthanever

•  Buteachpersoninthegroupisanindividualcollector.Andfrequentlyasetofindividualcollec8onsformsthegroupcollec8on.

Besser-Berkeley3/6/2020 33

Documen8ngProtests

Besser-Berkeley3/6/2020

-photo from Activists Guide to Archiving Video

34

Whenaggregated,manydifferentpersonalcollec8onsformanimportant

pictureof:•  Anevent•  Anethnicgroup•  Acommunity•  Asocialmovement•  Asetofarchitects•  Asetoflaw-makers

•  Whatisimportanttothem,howtheygoabouttheirbusiness,…

Besser-Berkeley3/6/2020 35

Andweknowfrompastworksthataggrega8onscreatenewmeanings

•  Aggrega8ngallthephotosandhomemoviesoftheDigitalDiasporaishugelymoremeaningfulthanasinglephoto-

•  OnetweetsaysveryliQle,butthousandsoftweetscanshowtrendsordepictapar8culareventorday

Besser-Berkeley3/6/2020 36

Page 7: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

7

DigitalDiasporaFamilyReunion

Besser-Berkeley3/6/2020 37

DigitalDiasporaFamilyReunion

Besser-Berkeley3/6/2020 38

Andweknowfrompastworksthataggrega8onscreatenewmeanings

•  Aggrega8ngallthephotosandhomemoviesoftheDigitalDiasporaishugelymoremeaningfulthanasinglephoto

•  OnetweetsaysveryliQle,butthousandsoftweetscanshowtrendsordepictapar8culareventorday

Besser-Berkeley3/6/2020 39

Butaggrega8ngitemsfromdisparatesourcescausessignificantproblems

•  Vastquan8tyofuser-contributedmaterial•  RightsIssues•  Noeasywaytocontrolforquality,fileformat,metadata(notevenanyconsistencyforanyofthese)-

Besser-Berkeley3/6/2020 40

EveryImageCollectorhasaDifferentApproach

•  Differentfile-namingconven8ons•  Differentfileformats•  Differentcompressionschemes•  Differentmetadata•  Storedindifferentarrangements/hierarchies•  Storedindifferentplaces(cellphone,personalharddisk,YouTube,Vimeo,Facebook,…)

Besser-Berkeley3/6/2020 41

THEPARTICULARPROBLEMOFMESSAGING

Besser-Berkeley3/6/2020 42

Page 8: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

8

Born-DigitalMessaging•  Increasingly,importantrecordsetshavemovedtoemailsandtextmessaging

•  Muchoftheback-and-forthincollabora8onhappensthroughmessaging–  “let’strythisinstead”–  “increasebudgetcategoryXby$12K,anddecreasecategoryYbythesameamount”

–  “shouldweinsertthisphotobelowparagraph3?”–  “reviewthedocumentIsentyouthismorning”

•  IncreasinglythistypeofmessagingisonphoneservicesorTwiQer

Besser-Berkeley3/6/2020 43

Born-DigitalCorrespondence

•  Emailarchivingisbadenough– Anddiscussionsmightshi\backandforthbtwnworkandpersonalemailaccounts

•  Butwhathappenswhenemaildiscussionsswitchbackandforthtotextmessaging(likewhensomeoneisaskedtosendaphotothatresidesontheirphone)?

•  Andphonemessagingthreadscankeepjumpingaroundbtwnservices-

Besser-Berkeley3/6/2020 44

Phonemessagingthreadscankeepjumpingaroundbtwnservices

•  Somemessagessenttodistribu8ongroupwillbeansweredone-on-one(sotheoriginalques8on/contextisinacompletelydifferentthread)-

•  Mightjumpfromoneservicetoanother,thenbackagain– Standardtextmessagingnotworkingwithindepartmentstore,soswitchovertoWhatsAppondeptstoreWifi,thenbacktostandardtextmessagingwhenoutside

Besser-Berkeley3/6/2020 45

Howard’sTextMessaging

Besser-Berkeley3/6/2020 46

Howard’sTextMessaging

Besser-Berkeley3/6/2020 47

Phonemessagingthreadscankeepjumpingaroundbtwnservices

•  Somemessagessenttodistribu8ongroupwillbeansweredone-on-one(sotheoriginalques8on/contextisinacompletelydifferentthread)

•  Mightjumpfromoneservicetoanother,thenbackagain– Standardtextmessagingnotworkingwithindepartmentstore,soswitchovertoWhatsAppondeptstoreWifi,thenbacktostandardtextmessagingwhenoutside

Besser-Berkeley3/6/2020 48

Page 9: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

9

TextMessagetoWhat’sApp

Besser-Berkeley3/6/2020 49

Phonemessagingthreadscankeepjumpingaroundbtwnservices

•  Somemessagessenttodistribu8ongroupwillbeansweredone-on-one(sotheoriginalques8on/contextisinacompletelydifferentthread)

•  Mightjumpfromoneservicetoanother,thenbackagain– Standardtextmessagingnotworkingwithinconfhall(ordeptstore),soswitchovertoWhatsApponconfcntr(deptstore)Wifi,thenbacktostandardtextmessagingwhenoutside

Besser-Berkeley3/6/2020 50

Phonethreadssome8messwitchtoEmail

•  Forexample,retrievingandsendingdocumentsorphotosthatareoncomputer,notonphone

Besser-Berkeley3/6/2020 51

IssueswithPrivacy-orientedMessagingApps--Signal

•  Privacygoesbeyondend-to-endencryp8on•  Nocloudstorageofmessagecopies•  Disables“screencapture”func8on•  AllmessagesstoredinencryptedDB(30characterkey?)

Besser-Berkeley3/6/2020 52

RUponreceipt,differentphonechannelsare

joinedtogetherthroughAlerts…

Besser-Berkeley3/6/2020 53

Butnotthreadedtogetherforlaterviewing

•  withinphonefiles•  withinmessagecaptureu8li8es•  btwnphoneandemail

Besser-Berkeley3/6/2020 54

Page 10: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

10

THEPDACONFERENCES

Besser-Berkeley3/6/2020 55

PersonalDigitalArchiving2015hQp://personaldigitalarchiving.com/

Besser-Berkeley3/6/2020 56

PDA:WhoAQends&Presents

•  Ci8zenArchivists–  Peoplewhowanttostepinandrescuecontentinperil–  Peoplewholiketocreateso\ware/Apps/Guidelinestohelpothersfacingsimilarproblems

•  CommunityorEthnicgroupsandac8vistswan8ngtosavepor8onsoftheirheritage

•  Professionallibrarians&archivists(andtheirprogrammingsupportstaff)

•  Regularso\waredevelopers•  Researchers(bothacademicandcomputerindustry)

Besser-Berkeley3/6/2020 57

PDAGoals—Sharingknowledge

•  Whatworkedandwhatdidn’t;whatpartsturnedouttobemoredifficultthanan8cipated

•  Newanddifferenttypesofcontenttocollect•  Guidelines,procedures,workflows,methodologies

•  So\ware

Besser-Berkeley3/6/2020 58

PDAHistoryIni8allystartedbyInternetArchivewithco-sponsorshipfromNetherlandsSound&Vision,LC/NDIIPPandCNI•  2010InternetArchive•  2011InternetArchive•  2012InternetArchive•  2013UnivofMaryland•  2014IndianaStateLibrary&ISU•  2015NewYorkUniversity•  2016UniversityofMichigan•  2017StanfordUniversity•  2018UniversityofHouston•  2019UniversityofPiQsburgh

Besser-Berkeley3/6/2020 59

SamplePDATalks-Indianapolis2014hQp://visions.indstate.edu/pda2014/conference-program.html

•  TheSocialLifeofPersonalInforma8on,•  DefiningthePersonalinDigitalArchivingandCommuni8es•  PersonalArchivingasaGatewaytoDataLiteracy•  FindingRoots,Gems,andInspira8on:UnderstandingtheUl8mateUsesofDigital

Materials•  Opportuni8esandChallengesinAccessioningPersonalDigitalArchives•  PublicLibrariesandPersonalDigitalArchiving:OutreachLessonsLearned•  ClearasGlass:ControllingtheDataGeneratedbyWearableTechnology•  PersonalArchivingWithintheScholarlyWorkflow:ZoteroasConnectorfor

Collec8ngFacultyWork•  PreservingYourDigitalPhotosUsingFreeorLow-CostSo\ware•  Collabora8veDataManagementandProvenanceasCorePillarsofPersonalDigital

Cura8on•  FamilyArchiveasaNarra8veOrganiza8on•  DesigningaPersonal&FamilyArchiveforthe21stCentury

Besser-Berkeley3/6/2020 60

Page 11: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

11

PDA2015hQp://blogs.loc.gov/digitalpreserva8on/2015/08/report-on-the-personal-digital-

archiving-2015-conference/

Besser-Berkeley3/6/2020 61

PDA2015

Besser-Berkeley3/6/2020 62

PDA2015Topics

•  PreservingDigitalPhotos•  DigitalPreserva8onofArt•  Crea8veuseofOpenSourcetools-•  CollegeLibrarysponsoredPersonalDigitalArchivingDays

•  CommunityEngagement•  VideoPreserva8on•  DigitalHumani8es&SocialScience•  Workshops-

Besser-Berkeley3/6/2020 63

Crea8veuseofOpenSourcetools•  JasonScoQ,“WhentheEmulatorsBrokeFree.”ScoQ’sbasicmessagewasthattheInternetArchive

hasmadesignificantadvanceswithemula8onofoldgamesonline.•  Jus8nMckinney,MarkSimonHaydnandAshleyBlewer,“

DoesBitTorrent+PrivateTrackers=TheNewFilmArchive?”(PDF).MckinneysaidBitTorrentisnotinherentlyevil,it’sjustatool,andpeople—especiallyprofessionalarchivistsandlibrarians—shouldbemoreopenmindedaboutusingBitTorrentasatested,effec8vetooltotransferlarge(legal)filesquicklyonline.

•  CalLeeandKamWoods,“BitstreamConfiden8al:Considera8onsandApproachestoCura8ngPoten8allySensi8vePersonalDatainCollec8ons”(PDF)

•  WendyHagenmaier,“PDAasanOpportunityforCollabora8veAdvocacyandMurderMysteryIntrigue”(PDF)and“AnExplora8onofthePoten8alImpactofWearableCompu8ngTechnologiesonDigitalArchivingandPreserva8on”(PDF)

•  CheyenneJansdaQer,“BuildingDigitalCollec8onsInOmekaForTheLayperson”•  AshleyBlewer,“Don’tknowaboutyou,butI’mfeelinglikeSHA-2!”•  PeterChan,“5.25inchfloppydisks”(PDF).

Besser-Berkeley3/6/2020 64

WorkshoponDo-It-YourselfPersonalDigitalArchiving

Besser-Berkeley3/6/2020 65

WorkshoponArchiveMa8ca

Besser-Berkeley3/6/2020 66

Page 12: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

12

WorkshoponEmailArchiving

Besser-Berkeley3/6/2020 67

SOMEKEYCONCEPTSFROMINTERPARES,PDPTV,ACTIVISTARCHIVISTS Besser-Berkeley3/6/2020 68

KeyConcepts&ChallengesfrommypriorworkwithothertypesofDigitalContent

•  InterPARES—Ifwehopetopreserveelectronicrecords,archivistsneedtobeinvolvedearlyinthelife-cycleofthatrecord,longbeforetherecordentersthearchive

•  PreservingDigitalPublicTelevision—Pushingmetadatagatheringupstreamintotheproduc8oncycle-

Besser-Berkeley3/6/2020 69

Preserving Digital Public Television Workflow in Production Process-

•  Site Visits to productions •  Interview Production staff •  Diagrams of Workflow-

Besser-Berkeley 3/6/2020

Pushing Metadata Gathering Upstream: The Problem

TRADITIONALLY… •  Very little metadata required for

preservation accompanies an object to a repository.

•  Archives, libraries and other repositories must create (or re-create) most of the necessary metadata.

•  This requires many manual hours, and significant resources - both time and money.

IN THE DIGITAL WORLD… •  This doesn’t scale up. Repositories

will be unable to continue in this manner, as more metadata than ever is required.

Besser-Berkeley 3/6/2020

But much of the necessary metadata has already been gathered during production

•  For each element/clip, production team usually notes source, date, place, people, and other descriptive info

•  But this is treated as internal information, and often various parts of the info are distributed among the personal notebooks of different production assistants

•  There is seldom a central location for this info, and the info is seldom turned over to the archive (which later tries to recreate much of it)

Besser-Berkeley 3/6/2020

Page 13: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

13

When the Archive tries to re-create this info, it is seldom successful

Producers know much more about the content of their productions than the archivists do. Archivists wanting accurate info must go back to the production staff (often years later) to start brainstoriming over the info

“Once the (television) program is finished, it is passed on to the archive or library for safe keeping. Librarians will catalog and classify the content, possibly using a proxy copy, and enter the resulting informative metadata in their database so they can retrieve it in the future. However, rarely if ever is the metadata from the rest of the process passed onto them, except, perhaps, for the title, tape number, and basic technical information about recording formats. It has to be re-created, with all the associated risk of errors and lack of accuracy--not to mention the work and time involved.”

- Cox, Tadic, and Mulder, Descriptive Metadata for Television (2006)

Besser-Berkeley 3/6/2020

We need to find ways to push metadata access upstream

•  Digital requires even more metadata than Analog –  As the workflow becomes file-based, the need for robust and accurate

metadata will become critical. File relationships, video codecs, bit rates, and rights information must be explicit, accurate, and immediately accessible. This will require a much deeper level of metadata than is currently captured in tape-based archives.

–  We can’t continue to supply this metadata at ingest; that won’t scale •  Obtaining the necessary metadata at the end of production and

broadcast life cycle is not feasible. Metadata will need to be systematically gathered during the production lifecycle and submitted with the programs to the preservation repository.

Besser-Berkeley 3/6/2020

Besser-Berkeley 3/6/2020

Examined Potential Points of Metadata Capture

Besser-Berkeley 3/6/2020

Examined Potential Points for Metadata Capture •  Much of the necessary metadata for preservation is already

generated by the production unit, but discarded after their internal use. This needs to be captured throughout the workflow.

•  “Those in the production unit are the creators and have first

hand knowledge of who, what, where, when, and why the content was created.” -- Mary Ide and Leah Weisse, WGBH Archivists.

Proposed Solutions…?

•  Preservation becoming a shared responsibility between content

creators, distributors, curators, and preservationists.

•  Partnerships are needed to come to unified solutions.

•  Preservationists seek reliable metadata back upstream in the production workflow...

Besser-Berkeley 3/6/2020

WorldFocus •  Nightly news program begun Oct 2008 •  We began working with Workflows six months before program

began •  Had ability to engineer metadata gathering into the creation/

production process

Besser-Berkeley 3/6/2020

Page 14: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

14

Ac8vistArchivistshQp://ac8vist-archivists.org/(useWaybackMachine)

hQps://www.facebook.com/Ac8vistArchivists/

•  NYUMIAPstudentsandgradsoriginallyworkingonarchivingmediafromtheOccupymovement

•  Guidelinesbothac8vistcreatorsandarchives•  Developednewerlow-impactmethods

Besser-Berkeley3/6/2020 79

HowOccupymaterialresembleswhatwe’llbefacinginthefuture

•  Vastquan8tyofuser-contributedmaterial•  Noeasywaytocontrolforquality,fileformat,metadata–  noenforcingguidelinesaswithorganiza8onalrecords–  nosemi-consistencyasinasingleindividual’spersonalrecords

•  MuchofthematerialcanmosteasilybefoundonSocialNetworks

•  …weneedtofindsmartwaystoharvestmetadataandanalyzefiles,aswellastoinfluencebehaviorofpoten8alcontributors

Besser-Berkeley3/6/2020 80

Ac8vistArchivistWebsite

Besser-Berkeley3/6/2020 81

Ac8vistArchivistsProjects-

•  “WhyArchive”postcard&video•  7TipstoEnsureYourVideoIsUsableintheLongTerm

•  Studyofmetadatalossthroughuploadingtoservices•  BestPrac8cesforCreators/Collectors•  “Toolkit”forOccupyarchiving•  Coordina8ngdiscussionsamongvariousgroupsarchivingdifferentpartsofOccupy

•  Exploringmethodsforobscuringiden88es

Besser-Berkeley3/6/2020 82

IMPORTANTLESSONSFROMACTIVISTARCHIVISTS(&OTHERS)

Besser-Berkeley3/6/2020 83

LessonsLearnedforArchivists-

•  CommunicatewellwithyourfutureContributors•  DevelopCoopera8veRela8onships•  Makeiteasyforfuturecontributorstocreate“archival-friendly”works

•  ForCoopera8veProjects,allowforinstruc8onsnotbeingfollowed

•  FindsmartwaystodealwithScale•  HandlePrivacy&Securityresponsibly

Besser-Berkeley3/6/2020 84

Page 15: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

15

CommunicatewellwithyourfutureContributors-

•  Learntospeaktheirlanguage•  Helpthemtorealizetheimportanceofarchiving

Besser-Berkeley3/6/2020 85

“WhyArchive”video

Besser-Berkeley3/6/2020 86

“WhyArchive”postcard•  ACCOUNTABILITY.Archivescollectevidencethatcanholdthoseinpower

accountable.•  SELF-DETERMINATION.Wedefineourownmovement.Weneedto

createandmaintainourownhistoricalrecord.•  SHARE.Archivesareapointofentrytoourmovement’srichrecord.We

canusethemtoensuretransparency,generatediscussion,andenabledirectac8on.

•  EDUCATE.Today’svideos,flyers,web-pages,andsignsarematerialfortomorrow’sskill-shares,classes,andmobiliza8ons.

•  CONTINUITY.Justaspastmovementsinspireus,newac8vistswilllearnfromtheexperienceswedocument.

•  RECORD&COLLECTwhat’shappeningaroundyou.•  PRESERVEtherecord.

Besser-Berkeley3/6/2020 87

DevelopCooperaWveRelaWonships-

•  TrytobeQerunderstandwhattheiraimsare;getinvolvedintheirac8vi8es

•  Developpartneringrela8onships

Besser-Berkeley3/6/2020 88

par8cipatedinSelf-helpac8vi8es:Skill-sharesforOccupiers

Besser-Berkeley3/6/2020 89

Self-helpac8vi8es:

OtherArchiveShare-DayandHackathonac8vi8es

•  BatchdownloadfromFLICKRwithselectedaQributes(#OWS,Crea8veCommons,EXIFmetadata,tagged-textmetadata)

•  Re-mixingofolderfootage•  Crea8ngavisual8meline•  Miningmaterialfordata(eg.numberofco-loca8onsofanofficer’snamewith“pepperspray”)

Besser-Berkeley3/6/2020 90

Page 16: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

16

Makeiteasyforfuturecontributorstocreate“archival-friendly”works-

•  Low-hangingfruit•  Easyinstruc8onalmaterialthatappealstowhattheythinkisimportant

•  Instruc8onsforredundantmetadatacollec8on(tomakesurethatitiscaptured)

Besser-Berkeley3/6/2020 91

Low-Hangingfruit

•  TurnGPSon•  Developstrategiesforautoma8ngaprofileanduploads(ouridealApp)

Besser-Berkeley3/6/2020 92

7TipstoEnsureYourVideoIsUsableintheLongTerm

•  Collectdetailswhilefilming•  Keepyouroriginalrawfootage,unaltered•  Makeyourvideodiscoverable•  Contextualizeit•  Makeitverifiable•  Allowotherstocollectandarchive•  Orarchiveityourself

Besser-Berkeley3/6/2020 93

BestPrac8cesforContentCreators

•  Security–  Hiddencameralaws,par8es’consentlaws

•  CapturingContent–  Highestquality,setdateand8me-stamps,noteloca8on

•  OffloadingContent–  Rawfilesdirectlyontocomputer,keepmaterialorganized

•  UploadingContent–  Importanceoftagging,reviewofdiffservices

•  Deposi8ngwithanArchive•  Copyright

Besser-Berkeley3/6/2020 94

OccupyArchivingKit•  WhyArchive?•  Whatisan“archive”?HowdoIcreateanarchive?•  Crea8ngarchiving-friendlycontent•  HowcanIcollectmaterialsforthearchive?•  WhatshouldIsave?•  HowshouldIorganizemymaterials?HowdoIgetitintothearchive?•  Descrip8on/Metadata•  MediaManagement•  Storage&Preserva8on•  Access•  Exhibi8onandPresenta8on/Outreach•  RightsandRe-Use

Besser-Berkeley3/6/2020 95

WITNESS:Ac8vists’GuidetoArchivingVideo,YvonneNg

hQp://archiveguide.witness.org/

Besser-Berkeley3/6/2020 96

Page 17: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

17

Collec8ng–ThinkTank

Besser-Berkeley3/6/2020 97

ThinkTankmetadataredundancies•  Guideliness8pulatethatpersonholdingrecordingdevicewillchecktoseethat8meanddatestamparecorrectbeforebeginningrecording(mostlydidn’thappen)

•  Guideliness8pulatethatascriptbereadverba8matthebeginningoftherecording,withdate,8me,proposedsubject,etc.(andwouldeventuallyallowvoice-recogni8onso\waretocreateappropriatemetadata).Scriptalsostatedthatallpar8cipantsagreedtoCrea8veCommonslicensingoftherecording

•  Guidelinesrequestedthatdate/8mebeembeddedintheappliedfile-name

Besser-Berkeley3/6/2020 98

FindsmartwaystodealwithScale-

Besser-Berkeley3/6/2020 99

Collec8ngStreamingMediaTheNYUMellonComposersProject

•  Tradi8onalWebCrawlers(Heritrix) follow links and capture most web content. But they are less successful with streaming video and dynamic content executed in the browser (like JavaScript).

•  NYU collaborated with IA to create a combined crawler and browser-

Besser-Berkeley3/6/2020 100

BROZZLER!

“browser” | “crawler” = BROZZLER

Logo: Noah Levitt 101Besser-Berkeley3/6/2020 102Besser-Berkeley3/6/2020

Page 18: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

18

StreamcapturereliesonYoutube-dlhQps://rg3.github.io/youtube-dl/supportedsites.html

103Besser-Berkeley3/6/2020

TamimentYouTubecollec8ng

•  TamimentArchivewasselec8velybrowsingthroughYouTubeOccupyvideos,tryingtochoosewhichonestokeep,thencatalogingthemwith–  Title,Creator,Crea8onDate,UploadDate,Descrip8on,URL,YoutubeUsername,License,Format,Codec,SourceMedia,OnInternetArchive,CCLicensetype

•  Buttheydidn’trealizethatthiswouldn’tscale!

Besser-Berkeley3/6/2020 104

March24,2012YouTubestats(just6monthsa\erstartofmovement)

•  “#Occupy”169,000•  “OccupyWallStreet”98,400•  “OccupyProtest”70,500•  “OccupyMovement”54,800•  “#OWS”50,300•  “OccupyOakland”13,400•  “Zuco�Park”6,690

Besser-Berkeley3/6/2020 105

Alterna8veapproachtoYouTubeSelec8onprocess

•  DevelopcategoriesofimportantYouTubevideos– Celebrityvisits,Internalworkings(library,kitchen,media),Confronta8onswithpolice,Labor,Housing,etc.

•  HaveOccupiersfillinanonlineformlis8ngthe5mostimportantvideosineachcategory

Besser-Berkeley3/6/2020 106

AdvantagesofYouTubeCollabora8veFilteringSelec8onProcess

•  Scalableandmanageable•  ConsistentwithOccupyideasofinclusivenessandofmanagingownstory

•  Tamimentcans8llchoosetobeselec8veincollec8ngonlyapor8onofwhatisvotedin,butthetotalsetforreviewisamanageablescale

Besser-Berkeley3/6/2020 107

HandlePrivacy&Securityresponsibly-

Besser-Berkeley3/6/2020 108

Page 19: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

19

“Inaneffortto

todevelopmethodologyandanapproachandwillredacttheemailaddressesorotherpersonallyiden8fiableinforma8onfrombroadpublicpresenta8on.”Formoreseelibrary.ucla.edu/service/scl/rights-toolkit

UCLA Deed of Gift template

Besser-Berkeley3/6/2020 109

Promo8ngPrivacyProtec8onExamplefromWITNESS

•  “ObscuraCamisavisualprivacyappforphotoandvideo,thatgivesyouthepowertobeQerprotecttheiden8tyofthosecapturesinyourphotos,beforeyoupostthemonline”

•  DevelopedbyGuardianProjectinconjunc8onw/HumanRightsgroupWITNESS-

Besser-Berkeley3/6/2020 110

ObscuraCam

Besser-Berkeley3/6/2020 111

DiscussissuesaroundcommercialserviceswithCreators/Recorders-

•  DisappearanceofembeddedmetadatafromYouTube&Vimeo

•  MoregeneralRightsissues•  GivearchivestheIPrighttodownload

Besser-Berkeley3/6/2020 112

Studyofmetadatalossthroughuploadingtoservices

Besser-Berkeley3/6/2020 113

Evenwithgoodrecords,RightsIssuesremain1999WTOSeaQleProtest20thanniversaryVideoPreserva8on

Besser-Berkeley3/6/2020 114

Page 20: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

20

Evenwithgoodrecords,RightsIssuesremain1999WTOSeaQleProtest20thanniversaryVideoPreserva8on

Besser-Berkeley3/6/2020 115

Evenwithgoodrecords,RightsIssuesremain1999WTOSeaQleProtest20thanniversaryVideoPreserva8on

Besser-Berkeley3/6/2020 116

YouTubeUserAgreement

•  5B“YoushallnotdownloadanyContentunlessyouseea‘download’orsimilarlinkdisplayedbyYouTubeontheServiceforthatContent.”

Besser-Berkeley3/6/2020 117

Crea8veCommonsGuidance•  Crea8veCommonsletsyoumix-and-matchfourdifferent

condi8ons:–  AQribu8on:Youletotherscopy,re-useanddistributeyourvideo,butthey

mustcredityou.–  Share-Alike:Youletotherscopy,re-useanddistributeyourvideo,onlyifthey

dothesamewiththeworktheycreate.–  Non-Commercial:Youletotherscopy,re-useanddistributeyourvideofor

non-commercialpurposesonly.–  NoDeriva8veWorks:Youletotherscopyanddistributeyourvideo,butnotto

createnewworksusingit.•  Youcanusethesecondi8onsindifferentcombina8onstoshareyourwork

inacontrolledway.Crea8veCommonslicensesarelegaltoolsthatdependonpre-exis8ngcopyrightlaws.HavingaCrea8veCommonslicenseonyourworkmaygiveyoulegalrecourse,butitmaynotactuallypreventpeoplefromdownloadingandre-usingyourvideoillegally.

Besser-Berkeley3/6/2020 118

MarkingCrea8veCommonslicenses•  ThereareafewwaystomarkyourvideowithaCrea8veCommons

license.OnewayistoincludeaCrea8veCommons“bumper”ortextcardinyourvideo.Crea8veCommonshascreatedsomewithgraphicsthatyoucandownloadfromtheirwebsite.Thismethodisusefulifyourvideoisgoingtobesharedoffline(e.g.onDVD,livescreenings),asthelicenseinforma8onisaQachedtothevideoitself.

•  AnotherwaytomarkyourvideowithaCrea8veCommonslicenseistopublishyourvideoonplarormsthatareCrea8veCommons-enabled,suchasYouTube,Vimeo,orInternetArchive.Theseplarormsallowyoutoeasilyselectalicenseduringtheuploadprocess.Thismethodisusefulbecausethelicenseismachine-readable.Asearchengine,forexample,candetectthelicense.

Besser-Berkeley3/6/2020 119

TipsforArchivistsonOutreachtoCommuni8es

•  Buildtrust•  Speakintheirlanguage(notarchive-speak)•  Iden8fywaysyoucanmeetneedstheyalreadyperceive

•  Approachprojectsascollabora8onwheneverpossible

•  Don’tonlyfocusoncontentandmetadata,butalsorightsthatcanbeanimpedimenttopreserva8on

Besser-Berkeley3/6/2020 120

Page 21: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

21

OtherProjectstomonitor-

•  Documen8ngtheNowhQps://www.docnow.io/

•  Interna8onalDigitalEphemeraProjecthQp://idep.library.ucla.edu/

•  ePADDhQps://library.stanford.edu/projects/epadd

Besser-Berkeley3/6/2020 121

Documen8ngtheNowhQps://www.docnow.io/

•  Documen8ngtheNowrespondstothepublic'suseofsocialmediaforchroniclinghistoricallysignificanteventsaswellasdemandfromscholars,students,andarchivists,amongothers,seekingauser-friendlymeansofcollec8ngandpreservingthistypeofdigitalcontent.Documen8ngtheNowhasastrongcommitmenttopriori8zingethicalprac8ceswhenworkingwithsocialmediacontent,especiallyintermsofcollec8onandlong-termpreserva8on.ThiscommitmentextendstoTwiQer'sno8onofhonoringuserintentandtherightsofcontentcreators.Theprojectisacollabora8veeffortbetweenShi\Design,Inc.,theUniversityofMaryland,andtheUniversityofVirginia.

Besser-Berkeley3/6/2020 122

Interna8onalDigitalEphemeraProjecthQps://idep.library.ucla.edu/

Besser-Berkeley3/6/2020 123

ePADDhQps://library.stanford.edu/projects/epadd

•  “ePADDisfreeandopensourceso\waredevelopedbyStanfordUniversity'sSpecialCollec8ons&UniversityArchivesthatsupportstheappraisal,processing,preserva8on,discovery,anddeliveryofhistoricalemailarchives.ePADDincorporatestechniquesfromcomputerscienceandcomputa8onallinguis8cs,includingmachinelearning,naturallanguageprocessing,andnameden8tyrecogni8ontohelpusersaccessandsearchemailcollec8onsofhistoricalandculturalvalue.”

Besser-Berkeley3/6/2020 124

RemainingAppliedResearchQues8ons

•  HowdodealwithScalewiththetsunamiofborn-digitalcontent(appraisal,descrip8on,discovery,workflow,…)

•  Howtoavoidourcollec8onscon8nuingtoreflecttheworldoftherich,well-known,andpowerful

•  BeQermethodsforcollec8ngsocialmediaandthreadedphonemessages

•  Tensionbtwnpreserva8on&privacy(&handlingmassiveamountsofredac8on)

•  ImprovingpublicpolicyandTOSinareaslikeIP,privacy,andwhathappensupondeath

Besser-Berkeley3/6/2020 125 Besser-Berkeley3/6/2020

•  hQp://besser.tsoa.nyu.edu/howard/Talks•  hQp://idep.library.ucla.edu/•  hQp://ac8vist-archivists.org/(useWayback)•  hQps://www.facebook.com/Ac8vistArchivists/•  hQps://archive.org/details/personaldigitalarchiving•  hQp://www.docnow.io/•  hQp://blogs.loc.gov/digitalpreserva8on/2015/08/report-on-the-personal-digital-

archiving-2015-conference/

ArchivingtheNon-Organiza8onalBorn-Digital:TheChallengesPosedbyMaterialfromIndividuals,Communi8es,&Events

126

Page 22: Personal born-digital In what environments do we findbesser.tsoa.nyu.edu/howard/Talks/20buckland-pda.pdf · extend this censorship to totally non-poli8cal venues like this Bloomington,

3/6/20

22

ArchivingtheNon-Organiza8onalBorn-Digital:TheChallengesPosedbyMaterialfromIndividuals,

Communi8es,&Events

Besser-Berkeley3/6/2020 127