historic language and historic newspapers: strategies for breaking the language barrier christopher...

72
Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom, University of Wyoming Nancy Chaffin, Colorado State University Colorado Association of Libraries Conference November 11, 2005

Post on 19-Dec-2015

219 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier

Christopher Cronin, University of Colorado at BoulderMartha Hanscom, University of WyomingNancy Chaffin, Colorado State University

Colorado Association of Libraries ConferenceNovember 11, 2005

Page 2: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Agenda

Introduction to the collection Issues around using historical newspapers General search strategies Keyword searching Stop List Searching broad and narrow topics

Page 3: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 4: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 5: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Colorado’s Historic Newspaper Collection http://www.cdpheritage.org/collection/chnc.cfm Collaborative project between the CDP,

Colorado State Library, and Colorado Historical Society Library Services & Technology Act

(LSTA)/Institute of Museum and Library Services (IMLS) grant to digitize the microfilm of Colorado newspapers

Started with 48 newspapers from 1859-1880, but grew to 85 newspapers though the year 1923

Page 6: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Using Historic Newspapers

Technological: Optical Character Recognition (OCR)Accuracy is dependent on the quality of the

microfilm, and quality of the original when it was microfilmed

Cultural: changes in publishing practices, journalistic practice, and language usage.

Page 7: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 8: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Differences in Journalistic Practice

Shorter articles Mixture of fact and opinion Variant language usage: from obsolete and archaic to

derogatory and offensive Additional resources:

Newspapers by Anne Rubenstein, on the Center for History and New Media web site (George Mason University). Available: http://chnm.gmu.edu/worldhistorysources/unpacking/newsmain.html

Uncovering our history: teaching with primary sources by Susan H. Veccia (American Library Association, 2004)

Page 9: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

General Search Strategies – 1

Broad topics: if you’re interested in transportation, search for specific modes of transportRailroads, stage coach, horses, wagons,

automobiles Even more specific: names of transportation

companies like Union Pacific, or Ford and Buick

Page 10: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

General Search Strategies – 2

Limit searches for common terms to specific dates or newspapers

Be creative when searching for geographic areas

Broaden and narrow searches appropriately based on results: if a newspaper doesn’t list “Ford”, search under “cars” or automobiles”

Page 11: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Keyword Searching

See pages 3-5 of the Searching Colorado’s Historic Newspaper Collection handout Connectors: and, or, not Phrase: quotation marks (plus connectors if needed) Only case-sensitive when an initial capital is used Truncation/wildcard: asterisk (*) Punctuation: do not use apostrophes (’) Spelling: be aware of alternate spellings Abbreviations: be aware of different practices in abbreviations

(e.g., Wm. for William)

Page 12: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Stop List

Do not search with common words: a, an, at, the

Words that are too common will generate an error message:

“The following errors occurred: The query string is empty. The following words are very common and were not included in your search: [term]”

Use asterisks creatively: “Gre* War”

Page 13: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Searching for a broad topic – Immigration

Page 14: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Choose a topic Read secondary sources Use a thesaurus to find words to search

Page 15: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Break up topic into smaller elements Search one aspect

Examples: Education or language Limit search by date range or geographic

area

Page 16: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Use wild cards (*)*migration or immigra*

Combine terms Immigrants and literate

Try alternate spellingsTheater or theatre

Page 17: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Test the search strategy See what results you get Get ideas for other search termsRefine the search

Try the search again

Page 18: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Searching Colorado’s Historic Newspaper Collection

Page 19: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 20: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 21: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 22: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 23: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 24: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 25: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 26: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 27: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 28: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 29: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 30: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Examples of search results

Page 31: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 32: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 33: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 34: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 35: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 36: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 37: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 38: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 39: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 40: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 41: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 42: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 43: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 44: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 45: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 46: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 47: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 48: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 49: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 50: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 51: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 52: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 53: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 54: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 55: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 56: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Colorado’s Historic Newspapers Collection:

Germans from Russia:

Searching for a specific immigrant group

Page 57: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Searching strategies: Background information Dates

CHNC: 1859-1923 Germans from Russia: 1885-1924

Terminology Volga Germans Russians Black Sea Germans

Specific locations Globeville Larimer, Weld, Morgan, Logan Counties (South Platte watershed) Pueblo

Source: Rock, Kenneth W., “Unsere Leute:” The Germans from Russia in Colorado. Colorado History Magazine, v. 54, no. 2 (1977) pp 154-183.

Page 58: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 59: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 60: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 61: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 62: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 63: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 64: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 65: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 66: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 67: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 68: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 69: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 70: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 71: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Questions?

Page 72: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Conclusion