Transcript
Page 1: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier

Christopher Cronin, University of Colorado at BoulderMartha Hanscom, University of WyomingNancy Chaffin, Colorado State University

Colorado Association of Libraries ConferenceNovember 11, 2005

Page 2: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Agenda

Introduction to the collection Issues around using historical newspapers General search strategies Keyword searching Stop List Searching broad and narrow topics

Page 3: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 4: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 5: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Colorado’s Historic Newspaper Collection http://www.cdpheritage.org/collection/chnc.cfm Collaborative project between the CDP,

Colorado State Library, and Colorado Historical Society Library Services & Technology Act

(LSTA)/Institute of Museum and Library Services (IMLS) grant to digitize the microfilm of Colorado newspapers

Started with 48 newspapers from 1859-1880, but grew to 85 newspapers though the year 1923

Page 6: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Using Historic Newspapers

Technological: Optical Character Recognition (OCR)Accuracy is dependent on the quality of the

microfilm, and quality of the original when it was microfilmed

Cultural: changes in publishing practices, journalistic practice, and language usage.

Page 7: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 8: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Differences in Journalistic Practice

Shorter articles Mixture of fact and opinion Variant language usage: from obsolete and archaic to

derogatory and offensive Additional resources:

Newspapers by Anne Rubenstein, on the Center for History and New Media web site (George Mason University). Available: http://chnm.gmu.edu/worldhistorysources/unpacking/newsmain.html

Uncovering our history: teaching with primary sources by Susan H. Veccia (American Library Association, 2004)

Page 9: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

General Search Strategies – 1

Broad topics: if you’re interested in transportation, search for specific modes of transportRailroads, stage coach, horses, wagons,

automobiles Even more specific: names of transportation

companies like Union Pacific, or Ford and Buick

Page 10: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

General Search Strategies – 2

Limit searches for common terms to specific dates or newspapers

Be creative when searching for geographic areas

Broaden and narrow searches appropriately based on results: if a newspaper doesn’t list “Ford”, search under “cars” or automobiles”

Page 11: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Keyword Searching

See pages 3-5 of the Searching Colorado’s Historic Newspaper Collection handout Connectors: and, or, not Phrase: quotation marks (plus connectors if needed) Only case-sensitive when an initial capital is used Truncation/wildcard: asterisk (*) Punctuation: do not use apostrophes (’) Spelling: be aware of alternate spellings Abbreviations: be aware of different practices in abbreviations

(e.g., Wm. for William)

Page 12: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Stop List

Do not search with common words: a, an, at, the

Words that are too common will generate an error message:

“The following errors occurred: The query string is empty. The following words are very common and were not included in your search: [term]”

Use asterisks creatively: “Gre* War”

Page 13: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Searching for a broad topic – Immigration

Page 14: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Choose a topic Read secondary sources Use a thesaurus to find words to search

Page 15: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Break up topic into smaller elements Search one aspect

Examples: Education or language Limit search by date range or geographic

area

Page 16: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Use wild cards (*)*migration or immigra*

Combine terms Immigrants and literate

Try alternate spellingsTheater or theatre

Page 17: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Strategy

Test the search strategy See what results you get Get ideas for other search termsRefine the search

Try the search again

Page 18: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Searching Colorado’s Historic Newspaper Collection

Page 19: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 20: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 21: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 22: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 23: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 24: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 25: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 26: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 27: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 28: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 29: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 30: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Examples of search results

Page 31: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 32: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 33: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 34: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 35: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 36: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 37: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 38: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 39: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 40: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 41: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 42: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 43: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 44: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 45: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 46: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 47: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 48: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 49: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 50: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 51: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 52: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 53: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 54: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 55: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 56: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Colorado’s Historic Newspapers Collection:

Germans from Russia:

Searching for a specific immigrant group

Page 57: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Searching strategies: Background information Dates

CHNC: 1859-1923 Germans from Russia: 1885-1924

Terminology Volga Germans Russians Black Sea Germans

Specific locations Globeville Larimer, Weld, Morgan, Logan Counties (South Platte watershed) Pueblo

Source: Rock, Kenneth W., “Unsere Leute:” The Germans from Russia in Colorado. Colorado History Magazine, v. 54, no. 2 (1977) pp 154-183.

Page 58: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 59: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 60: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 61: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 62: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 63: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 64: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 65: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 66: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 67: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 68: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 69: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 70: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Page 71: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Questions?

Page 72: Historic Language and Historic Newspapers: Strategies for Breaking the Language Barrier Christopher Cronin, University of Colorado at Boulder Martha Hanscom,

Collaborative Digitization Programwww.cdpheritage.org

©2005

__________________________________________

Conclusion


Top Related